
Service Co
Data Engineer – PySpark & AWS
Bengaluru (Bangalore), Mumbai, Pune, Hyderabad, Chennai, GurugramPosted 1 month ago₹1,500,000 – ₹1,900,000
Full TimeSeniorIN
See how this job matches your profile
Sign in for an AI-powered fit score, breakdown, and a tailored resume.
Job Description
Pyspark Data Engineer:Hands-on expertise in designing, building, and maintaining Apache Spark pipelines in production environments. Proven experience building and scaling data ingestion frameworks tha
Key Highlights
- Hands-on expertise in designing, building, and maintaining Apache Spark pipelines in production environments.
- Proven experience building and scaling data ingestion frameworks that integrate data from multiple source systems, with a focus on reliability, reusability, and scalability.
- Deep understanding of Spark architecture (driver/executors, DAG, partitioning, shuffles, caching, cluster resource management) and experience operating pipelines at scale, including data transformations on datasets ~500 GB+.
- Strong understanding of Oracle SQL and HDFS, including handling file formats and applying appropriate data cleansing, normalization, and formatting to produce curated output datasets.
- Ability to write Python, Pyspark, and shell scripts to process, transform, and automate data workflows. The Candidate should be good in writing application programs and automation manual data processing steps using python.
Skills & Technologies
PySparkAmazon Web Services (AWS)PythonShell ScriptingOracle SQLHDFS
About the Company
Service Co
View company profile →
Interested in this role?
Sign in or create a free account to see how this job matches your skills, apply with one click, and let our AI tailor your resume.
Sign in to applyAI-powered resume optimization
Save and track your applications
Job Details
Employment Type
Full Time
Experience Level
Senior
Salary Range
₹1,500,000 – ₹1,900,000
Location
Bengaluru (Bangalore), Mumbai, Pune, Hyderabad, Chennai, Gurugram
Posted
1 month ago
Country
IN