Data Engineer (New Grad) – Dremio
Dremio is the SQL Lakehouse Platform — providing sub-second query performance directly on data lake storage without requiring a separate data warehouse. Dremio's Apache Arrow-native query engine and data reflections (automated materialization) enable analysts to run Tableau and Power BI dashboards directly against S3 data lakes at data warehouse speeds. 2,000+ organizations including Maersk, Regeneron, and Postmates use Dremio to eliminate expensive data warehouse copies and query data in-place on open-format data lakes. With $440M raised, Dremio is pioneering the open data lakehouse revolution. We are hiring New Grad Data Engineers to build the data infrastructure powering Dremio's lakehouse analytics platform.
Responsibilities
- Build Dremio's data source connectors — implementing JDBC, REST, and native connectors to databases (PostgreSQL, MySQL, MongoDB), cloud storage (S3, ADLS, GCS), and SaaS sources
- Develop Dremio's Data Reflections (materialization) engine — implementing incremental refresh pipelines automatically creating and maintaining Apache Iceberg materialized views
- Implement Dremio's Apache Iceberg catalog service — managing Iceberg table metadata, snapshot lifecycle, and compaction operations on cloud object storage
- Build Dremio Arctic data lakehouse catalog — implementing multi-table transaction support, time-travel queries, and cross-table isolation using Apache Iceberg on Nessie catalog
- Develop Dremio's semantic layer — building virtual datasets, calculated columns, and metric definitions that unify business logic across Dremio's federated data sources
- Implement Dremio Cloud monitoring — tracking query performance, reflection hit rates, and data freshness for Dremio's managed cloud customers
Requirements
- Bachelor's degree in Computer Science, Data Engineering, or Software Engineering
- Strong SQL skills, including knowledge of columnar query optimization
- Java or Python proficiency for connector and platform development
- Familiarity with Apache Iceberg, Apache Arrow, Parquet, or open lakehouse formats
- Interest in data lakehouse architecture, open-source database technology, and query engine development
Benefits
- Competitive salary with Dremio pre-IPO equity
- Dremio Cloud platform access and lakehouse engineering mentorship
- Medical, dental, and vision benefits
- 401(k) with Dremio matching
- Santa Clara, CA headquarters with hybrid flexibility and Silicon Valley innovation culture