Service Co

Data Engineer – PySpark & AWS

Bengaluru (Bangalore), Mumbai, Pune, Hyderabad, Chennai, GurugramPosted 1 month ago₹1,500,000 – ₹1,900,000

Full TimeSeniorIN

See how this job matches your profile

Job Description

Pyspark Data Engineer:Hands-on expertise in designing, building, and maintaining Apache Spark pipelines in production environments. Proven experience building and scaling data ingestion frameworks tha

Key Highlights

Hands-on expertise in designing, building, and maintaining Apache Spark pipelines in production environments.
Proven experience building and scaling data ingestion frameworks that integrate data from multiple source systems, with a focus on reliability, reusability, and scalability.
Deep understanding of Spark architecture (driver/executors, DAG, partitioning, shuffles, caching, cluster resource management) and experience operating pipelines at scale, including data transformations on datasets ~500 GB+.
Strong understanding of Oracle SQL and HDFS, including handling file formats and applying appropriate data cleansing, normalization, and formatting to produce curated output datasets.
Ability to write Python, Pyspark, and shell scripts to process, transform, and automate data workflows. The Candidate should be good in writing application programs and automation manual data processing steps using python.

Skills & Technologies

PySparkAmazon Web Services (AWS)PythonShell ScriptingOracle SQLHDFS

About the Company

Service Co

View company profile →

Interested in this role?

Sign in or create a free account to see how this job matches your skills, apply with one click, and let our AI tailor your resume.

AI-powered resume optimization

Save and track your applications

Job Details

Employment Type

Full Time

Experience Level

Senior

Salary Range

₹1,500,000 – ₹1,900,000

Location

Bengaluru (Bangalore), Mumbai, Pune, Hyderabad, Chennai, Gurugram

Posted

1 month ago

Country