Raft Region Size: The Invisible Lever for Distributed Database Performance

If you have ever tuned a distributed database, you have probably adjusted obvious knobs: CPU, memory, replication factor, concurrency limits. But there is a quieter setting, one that rarely gets headlines, that has an outsized impact on performance, reliability, and operational sanity: Raft region size. In TiDB, via TiKV, region size refers to the size […]

The Hidden Cost of Database Over-Provisioning

According to the 2025 Kubernetes Cost Benchmark Report, average CPU utilization across Kubernetes clusters is approximately 10 percent. For many teams, that raises an obvious question: why is database utilization so low? The answer is not poor engineering. It is structural. Modern cloud-native database scaling models are built around peak capacity planning. To protect availability, […]

How to Build an AI Advisor That Shows College ROI (Not Rankings)

I know too many people drowning in student debt. A friend got her Master’s in Music from an expensive private school. She’s $120k in debt and works at Starbucks. Another guy I know got a business degree from a school that costs $50k a year. He’s doing the same job he could’ve gotten without the […]

OpenClaw Memory Architecture: Building a Local-First RAG with SQLite

Editor’s Note: At PingCAP, we build distributed systems (TiDB), but we appreciate elegant engineering at any scale. OpenClaw demonstrates a perfect use case for local-first RAG databases: when you need zero-ops, total data privacy, and instant startup for a single user, SQLite is a highly pragmatic choice. We believe in choosing the right tool for […]

Teaching AI Agents to Speak “Production” SQL: Introducing TiDB Skills

AI coding agents are excellent at producing code that “works on my machine”. But as every database engineer knows, there is a massive gap between a query that runs in a local Docker container and one that survives in a high-concurrency production environment. We keep seeing the same issues arise when agents generate SQL based […]

How to Build a Voice-to-Text App That Learns Your Style (Without Storing Your Words)

I’m a fast talker, but standard tools treat every platform like a dry JIRA ticket. To fix this, I dived into Chrome extension development to create Speak It: a voice-to-text app that learns your style without recording your secrets. Using privacy-first AI, the system maps a “fingerprint” of your speech—focusing on formality and sentence length—rather […]

Memory Fragmentation in Linux: What It Is, Why It Hurts, and How to Fix It

Managing memory in a high-performance database environment isn’t just about having enough RAM; it’s about how that RAM is organized. For SREs and DBAs, understanding the nuances of the Linux kernel’s memory management can be the difference between a smooth-running system and unpredictable tail latency. In this post, we’ll break down the core mechanics of […]

MySQL Alternatives at Scale: Why TiDB Beats MariaDB

Dive deep into two popular MySQL alternatives and discover why TiDB is a better option for extreme scalability and real-time analytics.

How Bling Migrated a Mission-Critical 25TB MySQL Database to TiDB

Every fast-growing SaaS platform eventually faces the same reality: scaling a single massive database becomes increasingly risky and expensive. At Bling (Part of the LWSA Group), a leading SaaS ERP platform serving the e-commerce markets in Brazil and Mexico, we hit this ceiling hard. Founded in 2009, Bling powers over 300,000 daily active users and […]

Welcome to the (Agentic) Machine: The Database Trends That Will Define 2026

Last year, I talked a lot about the unification of workloads, or how to bring disparate data processes together. But as we move into 2026, the theme has shifted. It is now unmistakably all about scaling agentic AI.  However, we must be careful not to interpret this through the lens of the last decade. In […]

The Making of TiDB X: Origins, Architecture, and What’s to Come

When we unveiled TiDB X, the new core engine for TiDB Cloud, at our recent TiDB SCaiLE annual event, the response was immediate and enthusiastic. Many people reached out afterward with technical and non-technical questions about where TiDB X came from, why we decided to build it, and how it connects to the future of […]

Distributed SQL Database: Architecture, Scale, and High Availability

A distributed database is any system that spreads data across multiple nodes. However, a distributed SQL database is a stricter subset: it keeps full SQL semantics and ACID transactions, automatically partitions data for horizontal scale, and uses consensus replication (e.g., Raft) so writes are consistent and failover is predictable. In short, distributed SQL gives you […]
123410