Datasaur is the leading NLP data-labeling platform that helps annotators train AI algorithms.

Datasaur creates the most intelligent, efficient and productive tools for all your data labeling needs.

Datasaur was founded in 2019 by Ivan Lee. The company is headquartered in Sunnyvale, California.


Datasaur helps financial, legal, and healthcare companies turn raw unstructured data into valuable ML datasets. The company uses semi-automated labeling and some pretrained models to speed up the data-labeling process and fuel the improvement of natural language processing (NLP) models.


Datasaur automatically fits into your existing workflows with automatic project creation and export, API access to plug in your existing model, object storage (AWS, GCP, etc.) and much more.


Datasaur launched a new feature, Datasaur Dinamic, which allows users to effortlessly train custom NLP models. Dinamic automatically learns and becomes more accurate and powerful. With this streamlined process, teams can quickly build and iterate on models. 


Datasaur has helped support companies such as Spotify, Google, and Qualtrics label a vast array of text data ranging from Word documents to PDFs to audio clips.


Datasaur is backed by Initialized Capital, HNVR, Gold House Ventures, TenOneTen, Segment CTO Calvin French-Owen, and others. The company raised $4M in a Seed round on Aug 03, 2023. This brings Datasaur's total funding to $7.9M to date.



  • Year founded: 2019
  • Funding Info: $7.9M over 3 Rounds (Latest Funding Type: Seed- Series Unknown)
  • Yearly Revenue: NA
  • Employee Size: 51-200
  • Business Valuation: NA
  • City/Town: Sunnyvale
  • State: California
  • Country: United States
Related businesses