{"id":7436,"date":"2022-06-29T07:53:52","date_gmt":"2022-06-29T14:53:52","guid":{"rendered":"https:\/\/en.pingcap.com\/?p=7436"},"modified":"2024-07-02T09:39:27","modified_gmt":"2024-07-02T16:39:27","slug":"analytics-on-tidb-cloud-with-databricks","status":"publish","type":"post","link":"https:\/\/www.pingcap.com\/ko\/blog\/analytics-on-tidb-cloud-with-databricks\/","title":{"rendered":"Analytics on TiDB Cloud with Databricks"},"content":{"rendered":"<p>Author: <a href=\"http:\/\/github.com\/Daemonxiao\">Qiang Wu<\/a> (TiDB Cloud Engineer at PingCAP)<br>Editors: <a href=\"http:\/\/github.com\/dcalvin\">Calvin Weng<\/a>, Tom Dewan<\/p>\n\n\n\n<p><a href=\"https:\/\/docs.pingcap.com\/tidbcloud\/public-preview?utm_source=ossinsight&amp;utm_medium=referral\"><strong>TiDB Cloud<\/strong><\/a> is a fully-managed Database-as-a-Service (DBaaS) for TiDB, an open source distributed SQL database.<\/p>\n\n\n\n<p><a href=\"https:\/\/docs.databricks.com\/getting-started\/introduction\/index.html\"><strong>Databricks<\/strong><\/a><strong> <\/strong>is a web-based data analytics platform that works with Spark. It combines the best of data warehouses and data lakes into a lake-house architecture.<\/p>\n\n\n\n<p>With a built-in JDBC driver in Databricks, you can now connect TiDB Cloud to Databricks in a few minutes and use Databricks to analyze the data in TiDB. In this article, we will walk you through how to create a TiDB Cloud Developer Tier cluster, connect TiDB to Databricks, and process TiDB data with Databricks.&nbsp;<\/p>\n\n\n\n<h2 class=\"wp-block-heading\"><span class=\"ez-toc-section\" id=\"Set_up_your_TiDB_Cloud_Dev_Tier_cluster\"><\/span><strong>Set up your TiDB Cloud Dev Tier cluster<\/strong><span class=\"ez-toc-section-end\"><\/span><\/h2>\n\n\n\n<p>To get started with TiDB Cloud, do the following:<\/p>\n\n\n\n<ol class=\"wp-block-list\">\n<li><a href=\"https:\/\/tidbcloud.com\/free-trial?_ga=2.44480749.81743823.1655813347-1323813425.1649663190&amp;_gac=1.62137694.1655195688.CjwKCAjw46CVBhB1EiwAgy6M4uPB5Z-WGqBSRdHFbQizHZMF--Hu8xSvHjX2F5nV17QRfBrI3kfyVhoC9u0QAvD_BwE\"><strong>Sign up<\/strong><\/a> for a TiDB Cloud account and<a href=\"https:\/\/tidb.auth0.com\/login?state=hKFo2SBTemR5aTNBQTBPMkdCUDlqT2wzX1BZQXhJVVJNOW9CaaFupWxvZ2luo3RpZNkgTDlRWmdXNEItaXBVTEZVVG1IVkRON0l2aUZWWXI5WmSjY2lk2SA2SVp0aENmbVJLSVBFblFTVDhhRGJ0TTdTR2RNbmlSbA&amp;client=6IZthCfmRKIPEnQST8aDbtM7SGdMniRl&amp;protocol=oauth2&amp;response_type=token%20id_token&amp;redirect_uri=https%3A%2F%2Ftidbcloud.com%2Fauth_redirect%3Fprev%3D%2Fconsole%2Fclusters%3Futm_source%3Dossinsight%26utm_medium%3Dreferral&amp;scope=openid%20email&amp;nonce=loJ9qTOSY7MDGQKoQ3Kbh7lKi2DFZ-9s&amp;auth0Client=eyJuYW1lIjoiYXV0aDAuanMiLCJ2ZXJzaW9uIjoiOS4xOS4wIn0%3D\"> <strong>log in<\/strong><\/a>.<\/li>\n\n\n\n<li>Under <strong>Create Cluster &gt; Developer Tier<\/strong>, select <strong>1 year Free Trial<\/strong>. <\/li>\n\n\n\n<li>Set your cluster name and choose the region for your cluster.<\/li>\n\n\n\n<li>Click <strong>Create<\/strong>. A TiDB Cloud cluster will be created in approximately 1 to 3 minutes.<\/li>\n\n\n\n<li>In the <strong>Overview<\/strong> panel, click <strong>Connect<\/strong> and create the traffic filter. Here we add an IP address of 0.0.0.0\/0 to allow access from any other IPs. <\/li>\n\n\n\n<li>Take note of your JDBC URL, which you will use later in Databricks.<\/li>\n<\/ol>\n\n\n\n<h2 class=\"wp-block-heading\"><span class=\"ez-toc-section\" id=\"Import_sample_data_to_TiDB_Cloud\"><\/span><strong>Import sample data to TiDB Cloud<\/strong><span class=\"ez-toc-section-end\"><\/span><\/h2>\n\n\n\n<p>After you create a cluster, it\u2019s time to <a href=\"https:\/\/docs.pingcap.com\/tidbcloud\/import-sample-data\"><strong>migrate the sample data<\/strong><\/a> to TiDB Cloud. For demonstration purposes, we will use a sample system dataset from Capital Bikeshare, a bicycle-sharing platform. The sample data is released under the Capital Bikeshare Data License Agreement.<\/p>\n\n\n<ol>\n<li>In the cluster information pane, click <strong>Import<\/strong>. The <strong>Data Import Task<\/strong> page is displayed.<\/li>\n<li>Configure the import task as follows:\n<ul>\n<li>Data Source Type: <code>Amazon S3<\/code><\/li>\n<li>Bucket URL: <code>s3:\/\/tidbcloud-samples\/data-ingestion\/<\/code><\/li>\n<li>Data Format: <code>TiDB Dumpling<\/code><\/li>\n<li>Role-ARN: <code>arn:aws:iam::385595570414:role\/import-sample-access<\/code><\/li>\n<\/ul>\n<\/li>\n<li>For <strong>Target Database<\/strong>, enter the <strong>Username<\/strong> \uadf8\ub9ac\uace0 <strong>Password<\/strong> of the TiDB cluster.<\/li>\n<li>To start importing the sample data, click <strong>Import<\/strong>. The process takes about 3 minutes.<\/li>\n<li>Return to the overview panel and click Connect to Get the MyCLI URL.<\/li>\n<li>Use the MyCLI client to check your sample data import:\n<pre><code>$ mycli -u root -h tidb.xxxxxx.aws.tidbcloud.com -P 4000<\/code><br \/><br \/><code>(none)&gt; SELECT COUNT(*) FROM bikeshare.trips; <\/code><br \/><code>+----------+<\/code><br \/><code>| COUNT(*) |<\/code><br \/><code>+----------+<\/code><br \/><code>| 816090   |<\/code><br \/><code>+----------+<\/code><br \/><code>1 row in set<\/code><br \/><code>Time: 0.786s<\/code><code><\/code><\/pre>\n<\/li>\n<\/ol>\n<h2><span class=\"ez-toc-section\" id=\"Connect_to_TiDB_Cloud_on_Databricks\"><\/span><a id=\"connect\"><\/a>Connect to TiDB Cloud on Databricks<span class=\"ez-toc-section-end\"><\/span><\/h2>\n<p>Before you continue, make sure you have logged into your workspace on Databricks with your own account. If you don\u2019t have a Databricks account, sign up for a free one here. If you are an experienced Databricks user and want to import the notebook directly, you can skip to <a href=\"#optional\">(Optional) Import the TiDB Cloud example notebook to Databricks<\/a>.<\/p>\n<p>In this section, we will create a new notebook on Databricks, attach it to a Spark cluster, and then use the JDBC URL to connect it to TiDB Cloud.<\/p>\n<ol>\n<li>In the Databricks workspace, create and attach a Spark cluster as shown below:\u00a0<br \/><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter wp-image-7438 size-full\" src=\"https:\/\/www.pingcap.com\/core\/uploads\/2022\/06\/databricks-workspace.png\" alt=\"\" width=\"1080\" height=\"698\" srcset=\"https:\/\/static.pingcap.com\/files\/2022\/06\/databricks-workspace.png 1080w, https:\/\/static.pingcap.com\/files\/2022\/06\/databricks-workspace-300x194.png 300w, https:\/\/static.pingcap.com\/files\/2022\/06\/databricks-workspace-1024x662.png 1024w, https:\/\/static.pingcap.com\/files\/2022\/06\/databricks-workspace-768x496.png 768w\" sizes=\"auto, (max-width: 1080px) 100vw, 1080px\" \/><\/li>\n<li>Configure JDBC in the Databricks notebook. TiDB can use the default JDBC driver in Databricks, so we don&#8217;t need to configure the driver parameter:\n<pre><code>%scala\nval url = \"jdbc:mysql:\/\/tidb.xxxx.prod.aws.tidbcloud.com:4000\"\nval table = \"bikeshare.trips\"\nval user = \"root\"\nval password = \"xxxxxxxxxx\"<\/code><\/pre>\n<div>where<br \/><strong>\u00a0 \u00a0\u00a0 url<\/strong>: JDBC URL used to connect to TiDB Cloud<br \/><strong>\u00a0 \u00a0\u00a0 table<\/strong>: Specify the table, such as ${database}.${table}<br \/><strong>\u00a0 \u00a0 \u00a0user<\/strong>: The username to use to connect to TiDB Cloud<br \/><strong>\u00a0 \u00a0\u00a0 password<\/strong>: The password of the user<\/div>\n<\/li>\n<li>Check the connectivity to TiDB Cloud:\n<pre><code>%scala\nimport java.sql.DriverManager\nval connection = DriverManager.getConnection(url, user, password)\nconnection.isClosed()\nres2: Boolean = false<\/code><\/pre>\n<\/li>\n<\/ol>\n<h2><span class=\"ez-toc-section\" id=\"Analyze_your_data_in_Databricks\"><\/span>Analyze your data in Databricks<span class=\"ez-toc-section-end\"><\/span><\/h2>\n<p>Once the connection is established, you can load TiDB data as a Spark DataFrame and analyze the data in Databricks.<\/p>\n<ol>\n<li>Load TiDB data by creating a DataFrame for Spark. Here we will reference the variables we defined in the previous step:\n<pre><code>%scala\nval remote_table = spark.read.format(\"jdbc\")\n.option(\"url\", url)\n.option(\"dbtable\", table)\n.option(\"user\", user)\n.option(\"password\", password)\n.load()<\/code><\/pre>\n<\/li>\n<li>Query the data. Databricks provides a powerful chart display function that customizes the type of chart you want:\n<pre><code>%scala\ndisplay(remote_table.select(\"*\"))<\/code><\/pre>\n<img loading=\"lazy\" decoding=\"async\" class=\"aligncenter wp-image-7440 size-full\" src=\"https:\/\/www.pingcap.com\/core\/uploads\/2022\/06\/databricks-chart.png\" alt=\"\" width=\"1280\" height=\"650\" srcset=\"https:\/\/static.pingcap.com\/files\/2022\/06\/databricks-chart.png 1280w, https:\/\/static.pingcap.com\/files\/2022\/06\/databricks-chart-300x152.png 300w, https:\/\/static.pingcap.com\/files\/2022\/06\/databricks-chart-1024x520.png 1024w, https:\/\/static.pingcap.com\/files\/2022\/06\/databricks-chart-768x390.png 768w\" sizes=\"auto, (max-width: 1280px) 100vw, 1280px\" \/>\n<\/li>\n<li>Create a view or a table for the DataFrame. In our example, we create a temporary view named &#8220;trips&#8221;:\n<pre><code>%scala\nremote_table.createOrReplaceTempView(\"trips\")<\/code><\/pre>\n<\/li>\n<li>Query the data using SQL statements. The following statement will query the count of bikes per type:\n<pre><code>%sql\nSELECT rideable_type, COUNT(*) count FROM trips GROUP BY <br \/>rideable_type ORDER BY count DESC<\/code><\/pre>\n<\/li>\n<li>Write the analytic results to TiDB Cloud:\n<pre><code>%scala\nspark.table(\"type_count\")\n.withColumnRenamed(\"type\", \"count\")\n.write\n.format(\"jdbc\")\n.option(\"url\", url)\n.option(\"dbtable\", \"bikeshare.type_count\")\n.option(\"user\", user)\n.option(\"password\", password)\n.option(\"isolationLevel\", \"NONE\")\n.mode(SaveMode.Append)\n.save()<\/code><\/pre>\n<\/li>\n<\/ol>\n<h2 id=\"optional\"><span class=\"ez-toc-section\" id=\"Optional_Import_the_TiDB_Cloud_example_notebook_to_Databricks\"><\/span>(<a id=\"optional\"><\/a>Optional) Import the TiDB Cloud example notebook to Databricks<span class=\"ez-toc-section-end\"><\/span><\/h2>\n<p>This is a TiDB Cloud sample notebook that contains steps of <a href=\"#connect\"><strong>Connect to TiDB Cloud on Databricks<\/strong><\/a> \uadf8\ub9ac\uace0 <a href=\"#analyze-data\"><strong>Analyze your TiDB data in Databricks<\/strong><\/a>. You can import this directly to focus more on the analytic process.\u00a0<\/p>\n<ol>\n<li>In your Databricks workstation, click <strong>Create<\/strong> &gt; <strong>Import<\/strong> and paste <a href=\"https:\/\/databricks-prod-cloudfront.cloud.databricks.com\/public\/4027ec902e239c93eaaa8714f173bcfc\/609149376287780\/2904026767880673\/444172601226222\/latest.html\">TiDB Cloud example URL<\/a> to download a notebook to your own Databricks workspace.\n<img loading=\"lazy\" decoding=\"async\" class=\"aligncenter wp-image-7442 size-full\" src=\"https:\/\/www.pingcap.com\/core\/uploads\/2022\/06\/DB-workspace-1.png\" alt=\"\" width=\"1280\" height=\"715\" srcset=\"https:\/\/static.pingcap.com\/files\/2022\/06\/DB-workspace-1.png 1280w, https:\/\/static.pingcap.com\/files\/2022\/06\/DB-workspace-1-300x168.png 300w, https:\/\/static.pingcap.com\/files\/2022\/06\/DB-workspace-1-1024x572.png 1024w, https:\/\/static.pingcap.com\/files\/2022\/06\/DB-workspace-1-768x429.png 768w\" sizes=\"auto, (max-width: 1280px) 100vw, 1280px\" \/>\n<\/li>\n<li>Attach this notebook to your Spark cluster.<\/li>\n<li>Replace the JDBC configurations of the example with your own TiDB Cloud cluster.<\/li>\n<li>Follow the steps in the notebook and try TiDB cloud with Databricks.<\/li>\n<\/ol>\n<h2><span class=\"ez-toc-section\" id=\"Conclusion\"><\/span>Conclusion<span class=\"ez-toc-section-end\"><\/span><\/h2>\n<p>This article shows how to use TiDB Cloud with Databricks. You can click <a href=\"https:\/\/tidbcloud.com\/free-trial?_ga=2.44480749.81743823.1655813347-1323813425.1649663190&amp;_gac=1.62137694.1655195688.CjwKCAjw46CVBhB1EiwAgy6M4uPB5Z-WGqBSRdHFbQizHZMF--Hu8xSvHjX2F5nV17QRfBrI3kfyVhoC9u0QAvD_BwE\">\uc5ec\uae30<\/a> to try TiDB Cloud now in just a few minutes. In the meantime, we are working on another tutorial about how to connect TiDB from Databricks via <a href=\"https:\/\/github.com\/pingcap\/tispark\">TiSpark<\/a>, a thin query layer built for running Apache Spark on top of TiDB\/TiKV. Subscribe to our blog to stay tuned.<\/p>\n\n\n<p><strong>Keep reading:<\/strong><br><a href=\"https:\/\/www.pingcap.com\/ko\/blog\/analytics-on-tidb-cloud-with-databricks\/\">Using Airbyte to Migrate Data from TiDB Cloud to Snowflake<\/a><br><a href=\"https:\/\/www.pingcap.com\/ko\/blog\/how-to-achieve-high-performance-data-ingestion-to-tidb-in-apache-flink\/\">How to Achieve High-Performance Data Ingestion to TiDB in Apache Flink<\/a><br><a href=\"https:\/\/www.pingcap.com\/ko\/blog\/data-transformation-on-tidb-made-easier\/\">Data Transformation on TiDB Made Easier<\/a><\/p>","protected":false},"excerpt":{"rendered":"<p>In this article, we will walk you through how to create a TiDB Cloud Developer Tier cluster, connect TiDB to Databricks, and process TiDB data with Databricks.\u00a0<\/p>","protected":false},"author":179,"featured_media":7450,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"ub_ctt_via":"","footnotes":""},"categories":[13],"tags":[163,31,29],"class_list":["post-7436","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-product","tag-app-developer","tag-tidb-cloud","tag-tutorial"],"acf":[],"featured_image_src":"https:\/\/static.pingcap.com\/files\/2022\/06\/tidbcloud-databricks-scaled.jpg","author_info":{"display_name":"Qiang Wu","author_link":"https:\/\/www.pingcap.com\/ko\/blog\/author\/wuqiang\/"},"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v26.9 - https:\/\/yoast.com\/product\/yoast-seo-wordpress\/ -->\n<title>Analytics on TiDB Cloud with Databricks | TiDB<\/title>\n<meta name=\"description\" content=\"In this article, we will walk you through how to create a TiDB Cloud Developer Tier cluster, connect TiDB to Databricks, and process TiDB data with Databricks.\u00a0\" \/>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/www.pingcap.com\/ko\/blog\/analytics-on-tidb-cloud-with-databricks\/\" \/>\n<meta property=\"og:locale\" content=\"ko_KR\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Analytics on TiDB Cloud with Databricks | TiDB\" \/>\n<meta property=\"og:description\" content=\"In this article, we will walk you through how to create a TiDB Cloud Developer Tier cluster, connect TiDB to Databricks, and process TiDB data with Databricks.\u00a0\" \/>\n<meta property=\"og:url\" content=\"https:\/\/www.pingcap.com\/ko\/blog\/analytics-on-tidb-cloud-with-databricks\/\" \/>\n<meta property=\"og:site_name\" content=\"TiDB\" \/>\n<meta property=\"article:publisher\" content=\"https:\/\/facebook.com\/pingcap2015\" \/>\n<meta property=\"article:published_time\" content=\"2022-06-29T14:53:52+00:00\" \/>\n<meta property=\"article:modified_time\" content=\"2024-07-02T16:39:27+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/static.pingcap.com\/files\/2022\/06\/tidbcloud-databricks-scaled.jpg\" \/>\n\t<meta property=\"og:image:width\" content=\"2560\" \/>\n\t<meta property=\"og:image:height\" content=\"853\" \/>\n\t<meta property=\"og:image:type\" content=\"image\/jpeg\" \/>\n<meta name=\"author\" content=\"Qiang Wu\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:image\" content=\"https:\/\/static.pingcap.com\/files\/2022\/06\/tidbcloud_databricks-scaled.jpg\" \/>\n<meta name=\"twitter:creator\" content=\"@PingCAP\" \/>\n<meta name=\"twitter:site\" content=\"@PingCAP\" \/>\n<meta name=\"twitter:label1\" content=\"Written by\" \/>\n\t<meta name=\"twitter:data1\" content=\"Qiang Wu\" \/>\n\t<meta name=\"twitter:label2\" content=\"Est. reading time\" \/>\n\t<meta name=\"twitter:data2\" content=\"5\ubd84\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":\"Article\",\"@id\":\"https:\/\/www.pingcap.com\/blog\/analytics-on-tidb-cloud-with-databricks\/#article\",\"isPartOf\":{\"@id\":\"https:\/\/www.pingcap.com\/blog\/analytics-on-tidb-cloud-with-databricks\/\"},\"author\":{\"name\":\"Qiang Wu\",\"@id\":\"https:\/\/www.pingcap.com\/#\/schema\/person\/6d5c9724aac811f92d08b1680044d3d4\"},\"headline\":\"Analytics on TiDB Cloud with Databricks\",\"datePublished\":\"2022-06-29T14:53:52+00:00\",\"dateModified\":\"2024-07-02T16:39:27+00:00\",\"mainEntityOfPage\":{\"@id\":\"https:\/\/www.pingcap.com\/blog\/analytics-on-tidb-cloud-with-databricks\/\"},\"wordCount\":812,\"commentCount\":0,\"publisher\":{\"@id\":\"https:\/\/www.pingcap.com\/#organization\"},\"image\":{\"@id\":\"https:\/\/www.pingcap.com\/blog\/analytics-on-tidb-cloud-with-databricks\/#primaryimage\"},\"thumbnailUrl\":\"https:\/\/static.pingcap.com\/files\/2022\/06\/tidbcloud-databricks-scaled.jpg\",\"keywords\":[\"App Developer\",\"TiDB Cloud\",\"Tutorial\"],\"articleSection\":[\"Product\"],\"inLanguage\":\"ko-KR\",\"potentialAction\":[{\"@type\":\"CommentAction\",\"name\":\"Comment\",\"target\":[\"https:\/\/www.pingcap.com\/blog\/analytics-on-tidb-cloud-with-databricks\/#respond\"]}]},{\"@type\":\"WebPage\",\"@id\":\"https:\/\/www.pingcap.com\/blog\/analytics-on-tidb-cloud-with-databricks\/\",\"url\":\"https:\/\/www.pingcap.com\/blog\/analytics-on-tidb-cloud-with-databricks\/\",\"name\":\"Analytics on TiDB Cloud with Databricks | TiDB\",\"isPartOf\":{\"@id\":\"https:\/\/www.pingcap.com\/#website\"},\"primaryImageOfPage\":{\"@id\":\"https:\/\/www.pingcap.com\/blog\/analytics-on-tidb-cloud-with-databricks\/#primaryimage\"},\"image\":{\"@id\":\"https:\/\/www.pingcap.com\/blog\/analytics-on-tidb-cloud-with-databricks\/#primaryimage\"},\"thumbnailUrl\":\"https:\/\/static.pingcap.com\/files\/2022\/06\/tidbcloud-databricks-scaled.jpg\",\"datePublished\":\"2022-06-29T14:53:52+00:00\",\"dateModified\":\"2024-07-02T16:39:27+00:00\",\"description\":\"In this article, we will walk you through how to create a TiDB Cloud Developer Tier cluster, connect TiDB to Databricks, and process TiDB data with Databricks.\u00a0\",\"breadcrumb\":{\"@id\":\"https:\/\/www.pingcap.com\/blog\/analytics-on-tidb-cloud-with-databricks\/#breadcrumb\"},\"inLanguage\":\"ko-KR\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\/\/www.pingcap.com\/blog\/analytics-on-tidb-cloud-with-databricks\/\"]}]},{\"@type\":\"ImageObject\",\"inLanguage\":\"ko-KR\",\"@id\":\"https:\/\/www.pingcap.com\/blog\/analytics-on-tidb-cloud-with-databricks\/#primaryimage\",\"url\":\"https:\/\/static.pingcap.com\/files\/2022\/06\/tidbcloud-databricks-scaled.jpg\",\"contentUrl\":\"https:\/\/static.pingcap.com\/files\/2022\/06\/tidbcloud-databricks-scaled.jpg\",\"width\":2560,\"height\":853},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\/\/www.pingcap.com\/blog\/analytics-on-tidb-cloud-with-databricks\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\/\/www.pingcap.com\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"Analytics on TiDB Cloud with Databricks\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\/\/www.pingcap.com\/#website\",\"url\":\"https:\/\/www.pingcap.com\/\",\"name\":\"TiDB\",\"description\":\"TiDB | SQL at Scale\",\"publisher\":{\"@id\":\"https:\/\/www.pingcap.com\/#organization\"},\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\/\/www.pingcap.com\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"ko-KR\"},{\"@type\":\"Organization\",\"@id\":\"https:\/\/www.pingcap.com\/#organization\",\"name\":\"PingCAP\",\"url\":\"https:\/\/www.pingcap.com\/\",\"logo\":{\"@type\":\"ImageObject\",\"inLanguage\":\"ko-KR\",\"@id\":\"https:\/\/www.pingcap.com\/#\/schema\/logo\/image\/\",\"url\":\"https:\/\/static.pingcap.com\/files\/2021\/11\/pingcap-logo.png\",\"contentUrl\":\"https:\/\/static.pingcap.com\/files\/2021\/11\/pingcap-logo.png\",\"width\":811,\"height\":232,\"caption\":\"PingCAP\"},\"image\":{\"@id\":\"https:\/\/www.pingcap.com\/#\/schema\/logo\/image\/\"},\"sameAs\":[\"https:\/\/facebook.com\/pingcap2015\",\"https:\/\/x.com\/PingCAP\",\"https:\/\/linkedin.com\/company\/pingcap\",\"https:\/\/youtube.com\/channel\/UCuq4puT32DzHKT5rU1IZpIA\"]},{\"@type\":\"Person\",\"@id\":\"https:\/\/www.pingcap.com\/#\/schema\/person\/6d5c9724aac811f92d08b1680044d3d4\",\"name\":\"Qiang Wu\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"ko-KR\",\"@id\":\"https:\/\/www.pingcap.com\/#\/schema\/person\/image\/\",\"url\":\"https:\/\/static.pingcap.com\/files\/2022\/10\/17234942\/avatar.jpg\",\"contentUrl\":\"https:\/\/static.pingcap.com\/files\/2022\/10\/17234942\/avatar.jpg\",\"caption\":\"Qiang Wu\"},\"url\":\"https:\/\/www.pingcap.com\/ko\/blog\/author\/wuqiang\/\"}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"Analytics on TiDB Cloud with Databricks | TiDB","description":"In this article, we will walk you through how to create a TiDB Cloud Developer Tier cluster, connect TiDB to Databricks, and process TiDB data with Databricks.\u00a0","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/www.pingcap.com\/ko\/blog\/analytics-on-tidb-cloud-with-databricks\/","og_locale":"ko_KR","og_type":"article","og_title":"Analytics on TiDB Cloud with Databricks | TiDB","og_description":"In this article, we will walk you through how to create a TiDB Cloud Developer Tier cluster, connect TiDB to Databricks, and process TiDB data with Databricks.\u00a0","og_url":"https:\/\/www.pingcap.com\/ko\/blog\/analytics-on-tidb-cloud-with-databricks\/","og_site_name":"TiDB","article_publisher":"https:\/\/facebook.com\/pingcap2015","article_published_time":"2022-06-29T14:53:52+00:00","article_modified_time":"2024-07-02T16:39:27+00:00","og_image":[{"width":2560,"height":853,"url":"https:\/\/static.pingcap.com\/files\/2022\/06\/tidbcloud-databricks-scaled.jpg","type":"image\/jpeg"}],"author":"Qiang Wu","twitter_card":"summary_large_image","twitter_image":"https:\/\/static.pingcap.com\/files\/2022\/06\/tidbcloud_databricks-scaled.jpg","twitter_creator":"@PingCAP","twitter_site":"@PingCAP","twitter_misc":{"Written by":"Qiang Wu","Est. reading time":"5\ubd84"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"Article","@id":"https:\/\/www.pingcap.com\/blog\/analytics-on-tidb-cloud-with-databricks\/#article","isPartOf":{"@id":"https:\/\/www.pingcap.com\/blog\/analytics-on-tidb-cloud-with-databricks\/"},"author":{"name":"Qiang Wu","@id":"https:\/\/www.pingcap.com\/#\/schema\/person\/6d5c9724aac811f92d08b1680044d3d4"},"headline":"Analytics on TiDB Cloud with Databricks","datePublished":"2022-06-29T14:53:52+00:00","dateModified":"2024-07-02T16:39:27+00:00","mainEntityOfPage":{"@id":"https:\/\/www.pingcap.com\/blog\/analytics-on-tidb-cloud-with-databricks\/"},"wordCount":812,"commentCount":0,"publisher":{"@id":"https:\/\/www.pingcap.com\/#organization"},"image":{"@id":"https:\/\/www.pingcap.com\/blog\/analytics-on-tidb-cloud-with-databricks\/#primaryimage"},"thumbnailUrl":"https:\/\/static.pingcap.com\/files\/2022\/06\/tidbcloud-databricks-scaled.jpg","keywords":["App Developer","TiDB Cloud","Tutorial"],"articleSection":["Product"],"inLanguage":"ko-KR","potentialAction":[{"@type":"CommentAction","name":"Comment","target":["https:\/\/www.pingcap.com\/blog\/analytics-on-tidb-cloud-with-databricks\/#respond"]}]},{"@type":"WebPage","@id":"https:\/\/www.pingcap.com\/blog\/analytics-on-tidb-cloud-with-databricks\/","url":"https:\/\/www.pingcap.com\/blog\/analytics-on-tidb-cloud-with-databricks\/","name":"Analytics on TiDB Cloud with Databricks | TiDB","isPartOf":{"@id":"https:\/\/www.pingcap.com\/#website"},"primaryImageOfPage":{"@id":"https:\/\/www.pingcap.com\/blog\/analytics-on-tidb-cloud-with-databricks\/#primaryimage"},"image":{"@id":"https:\/\/www.pingcap.com\/blog\/analytics-on-tidb-cloud-with-databricks\/#primaryimage"},"thumbnailUrl":"https:\/\/static.pingcap.com\/files\/2022\/06\/tidbcloud-databricks-scaled.jpg","datePublished":"2022-06-29T14:53:52+00:00","dateModified":"2024-07-02T16:39:27+00:00","description":"In this article, we will walk you through how to create a TiDB Cloud Developer Tier cluster, connect TiDB to Databricks, and process TiDB data with Databricks.\u00a0","breadcrumb":{"@id":"https:\/\/www.pingcap.com\/blog\/analytics-on-tidb-cloud-with-databricks\/#breadcrumb"},"inLanguage":"ko-KR","potentialAction":[{"@type":"ReadAction","target":["https:\/\/www.pingcap.com\/blog\/analytics-on-tidb-cloud-with-databricks\/"]}]},{"@type":"ImageObject","inLanguage":"ko-KR","@id":"https:\/\/www.pingcap.com\/blog\/analytics-on-tidb-cloud-with-databricks\/#primaryimage","url":"https:\/\/static.pingcap.com\/files\/2022\/06\/tidbcloud-databricks-scaled.jpg","contentUrl":"https:\/\/static.pingcap.com\/files\/2022\/06\/tidbcloud-databricks-scaled.jpg","width":2560,"height":853},{"@type":"BreadcrumbList","@id":"https:\/\/www.pingcap.com\/blog\/analytics-on-tidb-cloud-with-databricks\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/www.pingcap.com\/"},{"@type":"ListItem","position":2,"name":"Analytics on TiDB Cloud with Databricks"}]},{"@type":"WebSite","@id":"https:\/\/www.pingcap.com\/#website","url":"https:\/\/www.pingcap.com\/","name":"\ud2f0DB","description":"TiDB | SQL at Scale","publisher":{"@id":"https:\/\/www.pingcap.com\/#organization"},"potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/www.pingcap.com\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"ko-KR"},{"@type":"Organization","@id":"https:\/\/www.pingcap.com\/#organization","name":"PingCAP","url":"https:\/\/www.pingcap.com\/","logo":{"@type":"ImageObject","inLanguage":"ko-KR","@id":"https:\/\/www.pingcap.com\/#\/schema\/logo\/image\/","url":"https:\/\/static.pingcap.com\/files\/2021\/11\/pingcap-logo.png","contentUrl":"https:\/\/static.pingcap.com\/files\/2021\/11\/pingcap-logo.png","width":811,"height":232,"caption":"PingCAP"},"image":{"@id":"https:\/\/www.pingcap.com\/#\/schema\/logo\/image\/"},"sameAs":["https:\/\/facebook.com\/pingcap2015","https:\/\/x.com\/PingCAP","https:\/\/linkedin.com\/company\/pingcap","https:\/\/youtube.com\/channel\/UCuq4puT32DzHKT5rU1IZpIA"]},{"@type":"Person","@id":"https:\/\/www.pingcap.com\/#\/schema\/person\/6d5c9724aac811f92d08b1680044d3d4","name":"Qiang Wu","image":{"@type":"ImageObject","inLanguage":"ko-KR","@id":"https:\/\/www.pingcap.com\/#\/schema\/person\/image\/","url":"https:\/\/static.pingcap.com\/files\/2022\/10\/17234942\/avatar.jpg","contentUrl":"https:\/\/static.pingcap.com\/files\/2022\/10\/17234942\/avatar.jpg","caption":"Qiang Wu"},"url":"https:\/\/www.pingcap.com\/ko\/blog\/author\/wuqiang\/"}]}},"grav_blocks":false,"card_markup":"<a class=\"card-resource bg-white\" href=\"https:\/\/www.pingcap.com\/ko\/blog\/analytics-on-tidb-cloud-with-databricks\/\"><div class=\"card-resource__image-container\"><img class=\"card-resource__image\" alt=\"tidbcloud-databricks\" src=\"https:\/\/static.pingcap.com\/files\/2022\/06\/tidbcloud-databricks-scaled.jpg\" loading=\"lazy\" width=2560 height=853 \/><\/div><div class=\"card-resource__content-container\"><div class=\"card-resource__content-head\"><div class=\"card-resource__category\">Product<\/div><\/div><h5 class=\"card-resource__title\">Analytics on TiDB Cloud with Databricks<\/h5><\/div><\/a>","_links":{"self":[{"href":"https:\/\/www.pingcap.com\/ko\/wp-json\/wp\/v2\/posts\/7436","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.pingcap.com\/ko\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.pingcap.com\/ko\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.pingcap.com\/ko\/wp-json\/wp\/v2\/users\/179"}],"replies":[{"embeddable":true,"href":"https:\/\/www.pingcap.com\/ko\/wp-json\/wp\/v2\/comments?post=7436"}],"version-history":[{"count":40,"href":"https:\/\/www.pingcap.com\/ko\/wp-json\/wp\/v2\/posts\/7436\/revisions"}],"predecessor-version":[{"id":18004,"href":"https:\/\/www.pingcap.com\/ko\/wp-json\/wp\/v2\/posts\/7436\/revisions\/18004"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.pingcap.com\/ko\/wp-json\/wp\/v2\/media\/7450"}],"wp:attachment":[{"href":"https:\/\/www.pingcap.com\/ko\/wp-json\/wp\/v2\/media?parent=7436"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.pingcap.com\/ko\/wp-json\/wp\/v2\/categories?post=7436"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.pingcap.com\/ko\/wp-json\/wp\/v2\/tags?post=7436"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}