Data Engineer – New Grad – Cloudera
Cloudera is the only enterprise data platform that runs natively on both public clouds and on-premises data centers — providing Cloudera Data Platform (CDP) for data ingestion (Apache NiFi), stream processing (Apache Flink), batch analytics (Apache Spark), and machine learning at enterprise scale. 3,000+ enterprise customers in financial services, healthcare, telecommunications, and government trust Cloudera to process their most sensitive data workloads where public cloud-only solutions are not viable. We are hiring New Grad Data Engineers to build and support enterprise data pipelines on Cloudera's hybrid cloud platform.
Responsibilities
- Build enterprise data ingestion pipelines using Apache NiFi — connecting on-premise databases, mainframes, and IoT sensors to Cloudera's cloud data platform
- Develop Apache Spark batch processing jobs in PySpark and Scala transforming large-scale structured and unstructured datasets in Cloudera's data lakehouse
- Implement Apache Flink real-time stream processing pipelines for telecommunications event processing, fraud detection, and IoT sensor analytics
- Design and optimize Apache Hive and Impala data warehouse schemas for large-scale analytical queries on Cloudera Data Platform
- Build data governance workflows using Apache Atlas metadata management and Ranger security policy enforcement
- Support enterprise customers migrating legacy Hadoop on-premise workloads to Cloudera's hybrid cloud platform
Requirements
- Bachelor's degree in Computer Science, Data Engineering, or Information Systems
- Python and SQL proficiency; Scala or Java for Spark/Flink development is a plus
- Familiarity with Apache Spark, Hadoop, Kafka, or Flink big data frameworks
- Understanding of distributed computing, data lakehouse architecture, and cloud storage (S3, ADLS, GCS)
- Interest in enterprise big data, hybrid cloud, and the Apache open-source ecosystem
Benefits
- Competitive salary with Cloudera equity and annual bonus
- Cloudera Data Platform certification training and exam reimbursement
- Medical, dental, and vision benefits
- 401(k) with Cloudera matching
- Santa Clara, CA headquarters with hybrid schedule and big data engineering mentorship