Data Engineer (New Grad) – SingleStore
SingleStore is the world's fastest database for data-intensive applications — a distributed SQL database combining transactional (OLTP) and analytical (OLAP) workloads in a single system, eliminating the operational complexity of maintaining separate databases for operations and analytics. Used by financial services, gaming, and SaaS companies including NASDAQ, Priceline, and GE Digital, SingleStore delivers sub-millisecond analytical queries on live transactional data at petabyte scale. With $230M+ raised and 500+ enterprise customers, SingleStore is disrupting the $100B database market. We are hiring New Grad Data Engineers to build the data infrastructure and tooling powering SingleStore's cloud database platform.
Responsibilities
- Build SingleStore data ingestion pipelines — implementing real-time data loading from Kafka topics and CDC streams into SingleStore's distributed columnar storage engine
- Develop SingleStore's Pipelines feature — building automated connectors ingesting from Kafka, S3, Azure Blob, and Google Cloud Storage into SingleStore workspaces
- Implement SingleStore's external tables integration — querying Parquet and ORC files on S3 data lakes directly from SingleStore SQL without data movement
- Build performance benchmarking data pipelines comparing SingleStore query latency against PostgreSQL, MySQL, and Snowflake for customer proof-of-concept engagements
- Develop SingleStore's vector database capabilities — implementing embedding storage, HNSW approximate nearest neighbor indexing, and vector similarity search pipelines for AI applications
- Support SingleStore's internal data engineering team maintaining analytics datasets for product telemetry, customer usage, and revenue operations
Requirements
- Bachelor's degree in Computer Science, Data Engineering, or Software Engineering
- Strong SQL skills and understanding of distributed database systems
- Python proficiency for pipeline development and database benchmarking automation
- Familiarity with Apache Kafka, real-time data streaming, or columnar databases
- Interest in database internals, real-time analytics, or AI vector search applications
Benefits
- Competitive salary with SingleStore pre-IPO equity
- SingleStore Helios cloud database credits for personal development
- Medical, dental, and vision benefits
- 401(k) with SingleStore matching
- San Francisco hybrid office with fast-paced database startup culture