Get Started Free
HomeAI / Machine LearningScale AI
Scale AI logo
Scale AI
AI / Machine Learning

Scale AI Company Overview

Scale AI logo

Scale AI

● Active
AI / Machine Learning · San Francisco, California · Est. 2016
AIData LabelingEnterpriseGovernmentML Infrastructure↗ Website
$13.8B
Valuation
$870M
Revenue
2k+
Employees
2016
Founded

Scale AI is a data platform company that provides high-quality training data and AI evaluation infrastructure for machine learning models. It serves enterprise customers including the U.S. Department of Defense, OpenAI, Meta, Microsoft, and General Motors by combining proprietary AI-assisted tooling with a global network of human annotators and domain experts.

Recent Press
5 items
Product Launch·2026-03-03
Scale AI Releases SEAL Leaderboard for Enterprise AI Model Evaluation
Scale AI publicly launched SEAL (Scale Evaluation and Alignment Leaderboard), a human-evaluation-based benchmark ranking frontier AI models on coding, reasoning, instruction following, and safety tasks, providing enterprises a trusted signal for model selection beyond automated metrics.
Press Release·2026-01-09
Scale AI Appoints Alexandr Wang as Executive Chairman, Names New CEO
Scale AI founder Alexandr Wang transitioned to Executive Chairman as Scale appointed a new CEO to lead day-to-day operations, reflecting the company's maturation from a startup to an enterprise and government AI infrastructure provider serving Fortune 500 and U.S. defense customers.
Partnership·2025-10-14
Scale AI Launches Frontier Data Partnership Program with Leading AI Labs
Scale AI announced a structured data partnership program providing AI labs with curated pre-training datasets, RLHF preference data pipelines, and safety red-teaming services under multi-year agreements, with Meta, Google DeepMind, and Mistral among inaugural partners.
Partnership·2025-08-05
Scale AI Wins U.S. Army AI Task Force Contract Worth Up to $249M
The U.S. Army selected Scale AI to provide AI data labeling, model evaluation, and AI-assisted analysis services under a multi-year contract through the Army Futures Command AI Task Force, expanding Scale Donovan's footprint across defense operations.
Funding·2025-05-22
Scale AI Raises $1B at $13.8B Valuation to Accelerate Government and Enterprise AI
Scale AI closed a $1 billion Series F funding round led by Accel and Tiger Global, valuing the company at $13.8 billion. The capital will be used to expand Scale Donovan government deployments and grow the enterprise GenAI Platform business.
Company History
9 milestones
Alexandr Wang (19) and Lucy Guo founded Scale AI in San Francisco after Wang left MIT, launching with an API for human-in-the-loop data annotation tasks to help early self-driving car companies label sensor data.

Scale AI Organization Structure & Team

Org Chart
1,500 employees · Click a leader to explore their team
1,500 across 12 departments
Chief Executive Officer
Alexandr Wang
Jason Droege — Departments
· 1500 people across 12 depts

Scale AI Financials, Revenue & Market Share

Annual Revenue
$870M
+28% vs prior year
YoY Growth
+28%
From $250M to $870M
Revenue / Employee
$580K
Annual revenue per full-time employee
Revenue Growth
2021
2022
2023
2024
2025
$250M
$400M
$520M
$680M
$870M
Market Share
AI Training Data Services
35%
share
Scale AI
35%
Appen
18%
Labelbox
14%
Snorkel AI
8%
Others
25%
$280B
TAM
$45B
SAM
$870M
SOM
Revenue Streams
Data Annotation & Labeling55%
Government & Defense AI30%
Enterprise GenAI Platform15%
Business Units
Commercial AI & Enterprise55%
Serves AI labs, autonomous vehicle companies, and enterprise customers with data annotation, RLHF services, and the GenAI Platform through Scale Data Engine and Scale Evaluation products.
Government & Defense45%
Serves U.S. Department of Defense, military commands, and intelligence community customers through Scale Donovan and classified data annotation programs, with FedRAMP-aligned security infrastructure.

Scale AI Internal Tools & Processes

Internal Tools
12 departments
Engineering480 people · 3 roles
Standards & Certifications
10 standards
Compliance frameworks, security audits, and quality certifications this company maintains.
Security
SOC 2 Type II
Certified
Scale AI maintains SOC 2 Type II certification across its data annotation and AI evaluation platforms, demonstrating rigorous controls over security, availability, and confidentiality of the sensitive customer training datasets processed through Scale Data Engine.
Security
ISO 27001
Certified
Scale AI's information security management system is ISO 27001 certified, covering the policies, processes, and controls that govern how Scale handles confidential AI training data, model outputs, and customer intellectual property across its global annotation workforce.
Regulatory
FedRAMP Moderate
In Progress
Scale AI is pursuing FedRAMP Moderate authorization for Scale Donovan to meet U.S. federal procurement requirements, enabling broader deployment of its generative AI platform across civilian government agencies beyond current DoD program offices.
Security
CMMC Level 2
Compliant
Scale AI's government division complies with CMMC (Cybersecurity Maturity Model Certification) Level 2 requirements, enabling it to handle Controlled Unclassified Information (CUI) for U.S. Department of Defense contracts awarded through Scale Donovan and defense data programs.
Privacy
GDPR
Compliant
Scale AI complies with GDPR for annotation work performed by its European-based workforce and for processing personal data contained within training datasets provided by EU-based customers, with data processing agreements and data residency controls embedded in Scale Data Engine.
Privacy
CCPA
Compliant
Scale AI adheres to CCPA requirements governing the collection and use of personal information belonging to California residents who work as taskers in Scale's annotation network or whose data appears in customer-provided training datasets processed on the platform.
Regulatory
NIST AI RMF
Compliant
Scale AI aligns Scale Donovan and Scale Evaluation to the NIST AI Risk Management Framework, providing government customers with structured AI risk identification, measurement, and governance documentation required by recent federal AI executive orders.
Regulatory
ITAR
Compliant
Scale AI's government division complies with International Traffic in Arms Regulations (ITAR) for defense-related AI programs, maintaining a U.S.-person-only workforce and access controls for annotation and evaluation tasks involving export-controlled technical data.
Privacy
HIPAA
Compliant
Scale AI signs Business Associate Agreements (BAAs) and applies HIPAA-compliant data handling controls when processing healthcare training datasets for medical AI customers, including de-identification verification and access logging through Scale Data Engine.
Regulatory
EU AI Act
In Progress
Scale AI is adapting its platform compliance posture to meet EU AI Act obligations as a provider of data and evaluation services used in high-risk AI system development, implementing transparency documentation and conformity assessment support for EU enterprise customers.

Scale AI Interview Preparation

Interview Prep
Role-specific interview questions and keywords. Select a department, then click any role to prepare.
Engineering· 3 roles

Scale AI Products & Competitors

Product Suite
4 products · select one to explore
AI / Machine Learning
Data & Analytics
Scale Data Engine
The data foundation for AI

Scale Data Engine is an end-to-end platform for curating, annotating, and managing the training datasets that power large language models and multimodal AI systems. Enterprise and government teams use it to run annotation pipelines at scale, combining AI-assisted labeling with expert human review to achieve the data quality required for frontier model training.

Use Cases
Curating and filtering web-scale text corpora for pre-training large language modelsRunning multi-modal annotation pipelines for image, video, and 3D point cloud dataManaging human expert review queues for complex reasoning and RLHF preference data
No image
Key Customers
OAI
OpenAI
META
Meta
MSFT
Microsoft
Competitive Intelligence
VSAppen
THEM

Appen is a data annotation and AI training data company that provides crowd-sourced labeling services for text, image, audio, and video datasets used in machine learning model development.

EDGE

Purpose-built platform with AI-assisted labeling pipeline significantly reduces annotation cost and turnaround time vs. Appen's crowd model

VSLabelbox
THEM

Labelbox is a data-centric AI platform offering annotation tools, model-assisted labeling, and data management workflows for computer vision and NLP training datasets.

EDGE

Scale's enterprise-grade quality management and government-cleared workforce gives it an advantage on regulated and high-stakes annotation projects

VSSurge AI
THEM

Surge AI is a human-in-the-loop data labeling platform that focuses on high-complexity NLP tasks including preference labeling and RLHF data collection for language model fine-tuning.

EDGE

Scale Data Engine integrates annotation with data pipeline orchestration and version management that Surge's more narrowly scoped tooling does not offer

Related Companies