Data Engineer – New Grad – Starburst Data
Starburst is the analytics engine for the data lakehouse era — providing a commercial distribution and managed cloud service for Trino (formerly PrestoSQL), the open-source distributed SQL query engine originally created at Facebook to query petabytes of data across heterogeneous data sources without moving data. Used by 300+ enterprises including DoorDash, Lyft, and Bloomberg, Starburst Galaxy enables data teams to run SQL queries across data lakes (S3, ADLS, GCS), data warehouses (Snowflake, Redshift), and operational databases — all from a single Trino cluster. With $414M raised, Starburst is defining data lakehouse analytics. We are hiring New Grad Data Engineers to build the data platform enabling federated analytics at enterprise scale.
Responsibilities
- Build Starburst Galaxy data connector integrations — implementing Trino catalog connectors to new data sources including SaaS APIs, operational databases, and custom data formats
- Develop Starburst's data lakehouse accelerator pipelines — building Iceberg table compaction, Z-ordering, and bloom filter optimization for high-performance Trino queries on S3 data lakes
- Implement Starburst's column-level security and data masking policies — enforcing RBAC and ABAC access controls for enterprise compliance requirements across federated data queries
- Build Starburst's query performance analytics — collecting query execution metrics, identifying slow queries, and developing automated optimization recommendations
- Develop Starburst's data product marketplace — enabling data teams to publish, discover, and subscribe to certified data products across organizational domains
- Support Starburst's enterprise customers migrating Hive and Presto workloads to Starburst Galaxy's managed cloud platform
Requirements
- Bachelor's degree in Computer Science, Data Engineering, or Software Engineering
- Strong SQL skills, particularly experience with distributed SQL query engines (Trino, Presto, Spark SQL, BigQuery)
- Python and Java proficiency for connector and platform development
- Understanding of Apache Iceberg, Delta Lake, or Hudi open table formats
- Interest in data lakehouse architecture, federated query, and open-source database technology
Benefits
- Competitive salary with Starburst pre-IPO equity
- Starburst Galaxy cloud platform access and Trino certification support
- Medical, dental, and vision benefits
- 401(k) with Starburst matching
- Remote-first culture with Boston, MA home base and annual team retreats