What companies are using TiDB in production?

TiDB is trusted by over 3000 global enterprises across a variety of industries, such as financial services, gaming, and e-commerce. Users include Square (US), Shopee (Singapore), and China UnionPay (China).

How is TiDB different from other relational databases like MySQL?

TiDB is a next-generation, distributed relational database that can independently scale both computing and storage capacity by adding new nodes. Unlike traditional relational databases that only scale vertically, TiDB offers horizontal scalability, high availability with automatic failover, HTAP capabilities for both OLTP and OLAP workloads, and MySQL protocol compatibility so you can replace MySQL without changing application code.

What is the relationship between TiDB and TiDB Cloud?

TiDB is an open-source database best suited for organizations that want to run it on-premises or in their own data centers. TiDB Cloud is a fully managed cloud Database-as-a-Service (DBaaS) built on TiDB, with an easy-to-use web-based management console for managing TiDB clusters in mission-critical production environments.

Is TiDB compatible with MySQL?

TiDB is highly compatible with the MySQL protocol and the common features and syntax of MySQL 5.7 and MySQL 8.0. Ecosystem tools for MySQL such as PHPMyAdmin, Navicat, MySQL Workbench, and DBeaver can all be used with TiDB. Some MySQL features are not supported in TiDB due to architectural differences in a distributed system.

What programming languages can I use to work with TiDB?

You can use any programming language supported by the MySQL client or driver, including Java, Go, Python, Ruby, PHP, and more.

How does TiDB support strong consistency?

TiDB implements Snapshot Isolation consistency, delivering REPEATABLE-READ for MySQL compatibility. Data is redundantly copied between TiKV nodes using the Raft consensus algorithm to ensure recoverability in the event of node failure. TiDB uses a replication log and State Machine model — write requests go to a Leader node which replicates the command to Followers as a log, and once the majority of nodes receive the log, it is committed and applied.

Where can I run TiDB?

TiDB is available for bare metal, cloud-based, or hybrid installations. A Kubernetes Operator is available, and you can also use TiUp to quickly deploy a test environment on your laptop or a full production cluster across many nodes.

How does TiDB ensure high availability?

TiDB uses the Raft consensus algorithm to ensure data is highly available and safely replicated throughout storage in Raft Groups. Data is redundantly copied between TiKV nodes across different Availability Zones to protect against machine or data center failure. Automatic failover ensures your service stays online continuously.

What support is available for TiDB customers?

TiDB is supported by a team with experience running mission-critical use cases for over 3000 global enterprises across financial services, e-commerce, enterprise applications, and gaming. 24/7 support is available for TiDB Enterprise Subscription users.

What are PD, TiDB, TiKV, and TiFlash nodes in a TiDB Cluster?

PD (Placement Driver) is the brain of the TiDB cluster, storing metadata and sending data scheduling commands to TiKV nodes. TiDB is the SQL computing layer that aggregates query results and is horizontally scalable. TiKV is the transactional store for OLTP data, maintained in multiple replicas with native high availability. TiFlash is the analytical storage layer that replicates data from TiKV in real-time to support OLAP workloads using columnar storage.

How does TiDB replicate data between TiKV nodes?

TiKV divides the key-value space into key ranges called Regions. Data is distributed across all nodes using Regions as the basic unit, with PD responsible for spreading Regions evenly. TiDB uses the Raft consensus algorithm to replicate data by Regions — multiple replicas of a Region form a Raft Group, and each data change is recorded as a Raft log that is reliably replicated across nodes.

How do I make use of TiDB HTAP capabilities?

As a Hybrid Transactional Analytical Processing (HTAP) database, TiDB automatically replicates data between the OLTP store (TiKV) and OLAP store (TiFlash) in real-time. This eliminates the need for a separate data warehouse and supports real-time analytics on transactional data. Typical HTAP use cases include user personalization, AI recommendations, fraud detection, business intelligence, and real-time reporting.

Is there an easy migration path from another RDBMS to TiDB?

Yes. TiDB provides TiDB Lightning and a Data Migration Tool to migrate data from MySQL databases. Since TiDB implements the MySQL wire protocol, you can use the MySQL client directly. TiKV APIs are also available for Java, Go, Rust, and Python.

What is the difference between TiDB Community Edition and the Enterprise Subscription?

Some features such as audit logging are not included in the Community Edition. The most significant difference is the inclusion of Enterprise Support at the Enterprise Subscription level, providing 24/7 professional support for production environments.

How does TiDB protect data privacy and ensure security?

TiDB includes Transport Layer Security (TLS) and Transparent Data Encryption (TDE) for encryption at rest. It operates across two network planes: one for application-to-TiDB server communication and one for internal data communication. TiDB also supports extended syntax for Subject Alternative Name verification and TLS context for internal communication.

What companies are using TiDB Cloud in production?

TiDB Cloud is trusted by enterprises including Catalyst (US), KNN3 Network (Singapore), and CAPCOM (Japan), alongside thousands of other global organizations across financial services, SaaS, Web3, gaming, and e-commerce.

TiDB Cloud is a fully managed cloud Database-as-a-Service (DBaaS) built on TiDB. It allows developers and DBAs to deploy on Amazon Web Services or Google Cloud through an intuitive console, handling infrastructure management and cluster deployment so teams can focus on building applications. Clusters can be scaled in or out with a simple click.

Is TiDB Cloud compatible with MySQL?

TiDB Cloud is highly compatible with the MySQL protocol and the common features and syntax of MySQL 5.7 and MySQL 8.0. MySQL ecosystem tools including PHPMyAdmin, Navicat, MySQL Workbench, and DBeaver can all be used with TiDB Cloud.

Where can I run TiDB Cloud?

TiDB Cloud is currently available on Amazon Web Services (AWS) and Google Cloud.

How does TiDB Cloud ensure high availability?

TiDB Cloud uses the Raft consensus algorithm to replicate data safely across TiKV nodes in different Availability Zones, protecting against machine or data center failure. As a SaaS provider, PingCAP meets SOC 2 Type 2, ISO 27001, ISO 27701, PCI DSS, GDPR, and HIPAA standards to ensure data security, availability, and confidentiality.

What support is available for TiDB Cloud customers?

TiDB Cloud is supported by the same team behind TiDB, with experience running mission-critical workloads for over 3000 global enterprises. 24/7 support is available for all TiDB Cloud users.

How do I make use of TiDB Cloud HTAP capabilities?

TiDB Cloud automatically replicates data between the OLTP store (TiKV) and OLAP store (TiFlash) in real-time, enabling real-time analytics on transactional data without a separate data pipeline. Typical use cases include AI recommendations, fraud detection, business intelligence, and real-time reporting.

Is there an easy migration path from another RDBMS to TiDB Cloud?

Yes. TiDB provides TiDB Lightning and a Data Migration Tool for migrating from MySQL. TiDB Cloud implements the MySQL wire protocol so existing MySQL clients work directly. TiKV APIs are also available for Java, Go, Rust, and Python.

TiDB Cloud Traffic Replay: Seamless Production Upgrades

Database upgrades are often a source of “performance anxiety.” Even with extensive testing, the gap between a sterile staging environment and the chaotic reality of production—characterized by shifting SQL parameters, bursty concurrency, and complex execution contexts—often leads to unexpected post-upgrade regressions.

Traffic Replay on TiDB Cloud, an internal-only tool currently in development for a wider release, bridges this gap. It allows you to upgrade with production-fidelity confidence by catching execution plan regressions and performance “cliffs” before they ever reach your users.

Key Terminology:

CPS (Commands Per Second): The volume of SQL commands executed per second. Replay fidelity is measured by how closely the test CPS matches the production curve.

Prepared Statements & Plan Cache: By preserving statement IDs to 1:1 simulate prepared statements, we ensure Plan Cache efficiency is accurately modeled.

99% Accuracy: The statistical correlation between production and replay traffic in CPS shape and query mix.

The Problem with Traditional Database Testing

Common synthetic benchmarking (client-side simulation) often fails to reflect actual production pressure. Traffic Replay addresses these specific “fidelity gaps”:

Feature	Synthetic Simulation (Traditional)	Traffic Replay (TiDB Cloud)
Load Patterns	Static: Relies on fixed scripts or simple randomization.	Dynamic: Captures mixed request types and fluctuating frequencies.
Data Distribution	Uniform: Often misses hotspots or data skewness.	Authentic: Validates cache and indexing under real-world data skew.
Concurrency	Fixed: Linear or static concurrency models.	Real-world: Replicates bursts and interrelated session states.
Execution Context	Oversimplified: Often misses session variables or Plan Cache state.	High Fidelity: 1:1 mapping of connections and prepared statements.

When Should You Use TiDB Cloud Traffic Replay?

If your application meets any of the following criteria, Traffic Replay should be a mandatory step in your maintenance lifecycle:

Major Version Upgrades: Moving across significant architectural changes (e.g., TiDB 6.x to 8.x).
Optimizer Changes: When enabling new features like optimizer enhancements.
High P99/P999 Sensitivity: For latency-critical applications where even a 5ms regression is unacceptable.
Workload Volatility: For systems with highly bursty traffic or complex query patterns.

How TiDB Cloud Traffic Replay Works

TiDB Cloud Traffic Replay is an integrated operational workflow, not just a tool.

1. Storage & Security

To begin, you must enable audit logging via the web console, which records requests directly to object storage (e.g., S3).

Encryption: Audit logs are stored in encrypted S3 buckets. Data is encrypted in transit via TLS.
Sensitive Data Exclusion: Audit logs contain original SQL statements. To ensure high fidelity, the data is not masked during replay. However, users can manage data privacy at the source: during the recording phase, you can selectively exclude specific databases, tables, or sensitive SQL types.
Retention: Ensure your log retention window covers the “peak traffic” period you wish to replay.

2. The Operational Flow

Environment Setup: Before replaying, create a test cluster and use the TiDB Cloud BR tool to restore production data to this test cluster (usually taking just a few hours). The test cluster size is a trade-off:
- 1:1 Cluster: For highest fidelity (validating P99 latency and resource contention).
- Scaled-down Cluster: For cost-effective directional testing. Note: Expect different absolute latency and hotspot behavior.
Inputs: Select the backup snapshot’s timestamp as the replay’s starting point. You can manually terminate the session at any time once you have gathered sufficient data.
Execution: The replay tool then begins reading audit logs from that timestamp, parsing them into SQL statements, and executing them on the test cluster. The engine maps production connection IDs 1-to-1 to test connections, preserving transaction states.

3. Outputs

Once complete, TiDB Cloud generates a comparison report for efficiently comparing cluster health based on:

Key Metric Diffs: CPS, Latency, and CPU deltas.
Slow Query Diffs: Queries that were fast in production but slow in the new version.
Top SQL Diffs: Queries that consume more CPU resources in the new version.

The chart below compares the CPS, component CPU usage, and latency results from a customer’s pre-upgrade traffic replay:

The chart shows:

CPS Overlap: The CPS margin of error is within 1%, with the comparison curves nearly overlapping, indicating that the workload fidelity is nearly perfect. We are testing the actual pressure of your business.
P999 Latency: The max P999 latency decreases by more than 20%. The tail latency matters because upgrades often impact complex edge-case queries.
CPU Utilization: The CPU usage of TiDB, TiKV, and TiFlash decreases by more than 10%. You have identified a clear performance gain or cost-saving opportunity.

Next, we have a comparison of slow queries and resource-intensive SQL statements:

It is evident that the total number of slow queries decreased significantly, and optimized SQL statements far outnumbered those that showed regression. For the identified regressions, we can download the details and bind their execution plans to prevent any business impact after the upgrade.

The Regression Playbook: What if Things Go Wrong?

Finding a regression is a “win”—it means you caught it early. Follow this sequence:

Identify: Locate the specific SQL Digest in the “Slow Query” section.
Confirm: Compare execution plans. Is it a stats drift or an optimizer change?
Remediate: Use SQL Plan Binding to lock in a known “good” plan or update statistics.
Validate: Re-run the replay session to ensure the fix holds under production pressure.

Limitations to Keep in Mind

Users should be aware of the following technical constraints inherent in the replay process:

Inter-Connection Temporal Drifts: The absolute temporal alignment between independent sessions may vary, potentially leading to a different global execution order than seen in production.
DML Result Divergence: The results of DML statements may differ from production. For example, if a table uses Auto-increment columns, the generated IDs in the replay environment may not match those in production due to differences in concurrency timing or environment variables.

Conclusion

By achieving 99% accuracy in workload reproduction, Traffic Replay on TiDB Cloud transforms upgrades from a “risky event” into a “validated routine.”

Ready to secure your next upgrade? Run a 30-minute traffic session from your last peak period, and compare your P999 latency and Top SQL CPU time.

Get Started

Spin up a database with 25 GiB free resources.

Start Right Away

Engineering

Why You Should Replace Stored Procedures with a Service Layer

Tutorial

Building a Voice-First AI Journal: What I Learned About AI Memory, Vector Search, and TiDB

Engineering

Multi-Writer Change Data Capture (CDC): Architecture, Challenges, and How TiCDC Solves Them

Engineering

Why You Should Replace Stored Procedures with a Service Layer

Tutorial

Building a Voice-First AI Journal: What I Learned About AI Memory, Vector Search, and TiDB

Engineering

Multi-Writer Change Data Capture (CDC): Architecture, Challenges, and How TiCDC Solves Them

View All

Have questions? Let us know how we can help.

TiDB Cloud Dedicated

A fully-managed cloud DBaaS for predictable workloads

TiDB Cloud Starter

A fully-managed cloud DBaaS for auto-scaling workloads

Start for Free Learn More

Seamless TiDB Cloud Upgrades: Replicating Production Workloads with Traffic Replay

The Problem with Traditional Database Testing

When Should You Use TiDB Cloud Traffic Replay?

How TiDB Cloud Traffic Replay Works

1. Storage & Security

2. The Operational Flow

3. Outputs

The Regression Playbook: What if Things Go Wrong?

Limitations to Keep in Mind

Conclusion

Related Resources

Why You Should Replace Stored Procedures with a Service Layer

Building a Voice-First AI Journal: What I Learned About AI Memory, Vector Search, and TiDB

Multi-Writer Change Data Capture (CDC): Architecture, Challenges, and How TiCDC Solves Them

Why You Should Replace Stored Procedures with a Service Layer

Building a Voice-First AI Journal: What I Learned About AI Memory, Vector Search, and TiDB

Multi-Writer Change Data Capture (CDC): Architecture, Challenges, and How TiCDC Solves Them

Have questions? Let us know how we can help.

TiDB Cloud Dedicated

TiDB Cloud Starter