Jump to content

Data build tool

From Wikipedia, the free encyclopedia
This is an old revision of this page, as edited by Citation bot (talk | contribs) at 21:27, 16 November 2021 (Alter: url. URLs might have been anonymized. | Use this bot. Report bugs. | Suggested by AManWithNoPlan | #UCB_webform 554/1581). The present address (URL) is a permanent link to this revision, which may differ significantly from the current revision.
dbt
Developer(s)dbt-Labs
Initial release[1]
Stable release
0.19.2 / June 29, 2021; 4 years ago (2021-06-29)
Repository
Written inPython
Operating systemMicrosoft Windows, macOS, Linux
Available inPython
TypeData analytics, data management
LicenseApache License 2.0
Websitedocs.getdbt.com

dbt is an open-source command line tool that helps analysts and engineers transform data in their warehouse more effectively.[1] It started at RJMetrics in 2016 as a solution to add basic transformation capabilities to Stitch (acquired by Talend in 2018.[2]. The earliest versions of dbt allowed analysts to contribute to the data transformation process following the best practices of software engineering [3].

From the beginning, dbt was open source. In 2018, the dbt Labs team (then called Fishtown Analytics) released a commercial product on top of dbt Core.[4] In April 2020, dbt Labs announced its Series A led by Andreessen Horowitz.[5] In November, dbt Labs announced its Series B led by Andreessen Horowitz and Sequoia.[6] And in June 2021, dbt Labs raised its Series C led by Altimeter, Sequoia, and Andreessen Horowitz.[7]

Overview

dbt enables analytics engineers to transform data in their warehouses by simply writing select statements. dbt handles turning these select statements into tables and views. dbt does the transformation (T) in extract, load, transform (ELT) processes – it doesn’t extract or load data, but it’s extremely good at transforming data that’s already loaded into your warehouse. dbt also enables analysts to work more like software engineers, in line with the dbt viewpoint [8].

Developers

dbt-Labs (previously Fishtown Analytics) is on a mission to help analysts create and disseminate organization knowledge. dbt Labs pioneered the practice of analytics engineering, built the primary tool in the analytics engineering toolbox, and has been fortunate enough to see a fantastic community coalesce to help push the boundaries of the analytics engineering workflow. Today there are 5,500 companies using dbt every week, 15,000 folks in the dbt Community Slack, and 1,000 companies paying for dbt Cloud.

Notes

References

  1. ^ Atwal, Harvinder (9 December 2019). Practical DataOps: Delivering Agile Data Science at Scale. Apress. p. 223. ISBN 978-1-4842-5104-1.
  2. ^ "Stitch is joining Talend". Stitch Data. 2018-11-07.
  3. ^ "Goodbye RJMetrics, Hello Fishtown Analytics". dbt Blog. 2016-08-01.
  4. ^ "Sinter Release Notes, August 2018: pull request builder, fine-grained GitHub permissions, and more". 2018-07-31.
  5. ^ "Fishtown Analytics raises $12.9M Series A for its open-source analytics engineering tool". TechCrunch. 2020-04-22.
  6. ^ "Fishtown Analytics raises $29.5M Series B for its data engineering platform". TechCrunch. 2020-11-11.
  7. ^ "Of the Community, By the Community, For the Community". dbt Blog. 2021-06-30.
  8. ^ "dbt viewpoint".