Top California Big Data Startups (275)
Our mission is to remove inefficiency from the foundation of AI. By combining new research in information theory, probabilistic modeling, and distributed systems, we’re creating self-optimizing data infrastructure that continuously improves how information is represented and used by intelligent systems.
Hex is changing the way people work with data. Our platform makes analytics workflows more powerful, collaborative, and shareable. Hex solves key pain points with today's data and analytics tooling, and is loved by thousands of users all over the world for the beautiful UI, new superpowers, and boundless flexibility. We are a tight-knit crew of engineers, designers, and data aficionados....
Product.ai (formerly Demand.io) is the truth layer for commerce. Built on Axiomatic Intelligence — a proprietary adversarial reasoning methodology that stress-tests product claims against physics, economics, and engineering constraints — Product.ai delivers verified purchase verdicts, not summaries. Product.ai tells consumers when NOT to buy. Product.ai emerges from Demand.io, a profitable, bootstrapped AI commerce company whose SimplyCodes platform processes over $1B...
Proven at global web scale in production for modern data services, Alluxio is the developer of open source data orchestration software for the cloud. Alluxio moves data closer to big data and machine learning compute frameworks in any cloud across clusters, regions, clouds and countries, providing memory-speed data access to files and objects. Intelligent data tiering and data management deliver...
InfluxData is the creator of InfluxDB, the leading time series database used by millions of developers building real-time systems. InfluxDB captures and analyzes massive streams of high-resolution data, giving AI models the context for predictive maintenance and anomaly detection so systems can detect, alert, and adapt as conditions change. Built for high performance at any scale and in any environment,...
At Poggio Labs, we’re on a mission to help sellers win with an AI workspace. Poggio builds dynamic, realtime account plans for all of your customers, tailored to your unique value props, and generated in minutes. Our general view of team building is that good, smart people want big problems to solve with great teammates, and our job is to bring...
Last year at Loka, our remote team of engineers, developers, and designers helped route out hate speech on Twitter, eliminate $1 billion in food waste, launch 4 projects in LokaLabs™ (our own incubator), accelerate 3 startups to acquisition, and still enjoyed every other Friday off. We are using modern technologies to support an incredible variety of meaningful projects, while...
VergeSense is a Workplace Analytics Platform trusted by enterprises across the globe. Businesses use VergeSense to transform their static office into a dynamic workplace that matches today's employee needs and expectations. Its AI-driven platform includes intelligent sensors that collect real-time data, dashboards and insights that drive workplace strategy, and integrations with the leading workplace technologies. Today VergeSense analyzes over 40...
Element Critical owns and operates data centers in Chicago, Austin, Houston, Silicon Valley, and Northern Virginia. Our Tier III, hybrid IT-ready facilities are network-rich and concurrently maintainable. Element Critical cares as much about the people we serve as the servers we house. Our local teams offer robust colocation with highly customizable deployment space in a variety of deployment sizes and...
SHEIN Technology is a U.S. technology company. Founded in 2012, SHEIN is a leading global online retailer with operations in Guangzhou, Los Angeles and Singapore, along with other key markets. SHEIN reaches consumers across more than 150 countries and regions around the world. We place a premium on choice, delivering more than 6,000 new fashion, beauty and lifestyle products daily...
Measurabl is the world’s most widely adopted ESG (environmental, social, governance) data management solution for commercial real estate. With more than 53,000 commercial buildings representing nearly 10 billion square feet across 78 countries, Measurabl helps innovative companies measure, manage and disclose their ESG performance, assess their portfolio’s exposure to physical climate risk, and gain access to additional services such as...
Airbyte specializes in open-source data integration, designed to centralize data from diverse sources into storage solutions like data warehouses and lakes. Supporting over 400 connectors and a self-serve, extensible framework, Airbyte enables organizations to move both structured and unstructured data seamlessly for uses like AI, analytics, and business intelligence. Airbyte’s flexibility in deployment—whether cloud, hybrid, or on-premises—prioritizes data security, compliance,...
Guidewheel is on a mission to empower all the world’s factories to reach sustainable peak performance. Inspired by the simple, universal truth that every machine on the factory floor has a power cord, our plug-and-play FactoryOps platform makes the power of the cloud accessible to any factory. Guidewheel clips onto any machine to turn its real-time “heartbeat” into a connected,...
Cypris is the vertical AI platform for corporate R&D and innovation teams. We centralize scientific papers, patents, market news, and company intelligence into a single platform with over 500M+ global data points. On top of this foundation, we’re building AI Agents tailored to R&D workflows, powered by the latest models from OpenAI, Anthropic, and others. With Cypris, teams can manage projects,...
Fuel Cycle unleashes the power of decision intelligence for legendary brands. We achieve this by enabling brands to rapidly capture and act on the mission-critical insights required to launch new products, acquire customers, and gain market share. By leveraging the Research Engine, brands forge connections with their key audiences and harness actionable insights that drive confident business decisions.
Datawizz is revolutionizing data management with advanced synthetic data solutions. We help businesses unlock the power of their data while ensuring privacy and compliance. Our technology generates realistic synthetic data for machine learning, software testing, data enrichment, and augmentation—all within a click of a button. Enhance your data strategies with Datawizz and drive innovation securely and efficiently.
Tidepool is a 501(c)3 nonprofit organization on a mission to make diabetes data more accessible, actionable, and meaningful for people with diabetes, their care teams, and researchers. Founded in 2013, Tidepool hosts a suite of free software tools for people with diabetes and the clinics that serve them, including Tidepool Web, Tidepool Mobile, Tidepool Uploader, and, pending submission to FDA...
Everstream Analytics sets the global supply chain standard. Through the application of artificial intelligence and predictive analytics to its vast proprietary dataset, Everstream delivers the predictive insights and risk analytics businesses need for a smarter, more autonomous and sustainable supply chain. Everstream’s proven solution integrates with procurement, logistics and business continuity platforms generating the complete information, sharper analysis, and accurate...
Metabase is bringing data tools with the elegance and simplicity of consumer products to the crufty world of enterprise business intelligence. We provide an opinionated open source starting point for how companies should measure, analyze and share their data as well as a suite of tools to deal with the complexity that arises as they grow. We'd hiring and would love...
We are a consulting firm that specializes in engineering and management. We have provided services for some of the largest tech companies in the San Francisco Bay Area. These services include mechanical, electrical, software design, data collection quality assurance, and testing of new hardware and software. We pride ourselves in being a small company, which is able to be extremely...

.png)
_0.png)
.png)



_0.png)
























.png)



