Service Co

Service Co

Data Engineer – PySpark & AWS

Bengaluru (Bangalore), Mumbai, Pune, Hyderabad, Chennai, GurugramPosted 1 month ago₹1,500,000 – ₹1,900,000
Full TimeSeniorIN

See how this job matches your profile

Sign in for an AI-powered fit score, breakdown, and a tailored resume.

Sign in

Job Description

Pyspark Data Engineer:Hands-on expertise in designing, building, and maintaining Apache Spark pipelines in production environments. Proven experience building and scaling data ingestion frameworks tha

Key Highlights

  • Hands-on expertise in designing, building, and maintaining Apache Spark pipelines in production environments.
  • Proven experience building and scaling data ingestion frameworks that integrate data from multiple source systems, with a focus on reliability, reusability, and scalability.
  • Deep understanding of Spark architecture (driver/executors, DAG, partitioning, shuffles, caching, cluster resource management) and experience operating pipelines at scale, including data transformations on datasets ~500 GB+.
  • Strong understanding of Oracle SQL and HDFS, including handling file formats and applying appropriate data cleansing, normalization, and formatting to produce curated output datasets.
  • Ability to write Python, Pyspark, and shell scripts to process, transform, and automate data workflows. The Candidate should be good in writing application programs and automation manual data processing steps using python.

Skills & Technologies

PySparkAmazon Web Services (AWS)PythonShell ScriptingOracle SQLHDFS

Interested in this role?

Sign in or create a free account to see how this job matches your skills, apply with one click, and let our AI tailor your resume.

Sign in to apply
AI-powered resume optimization
Save and track your applications

Job Details

Employment Type

Full Time

Experience Level

Senior

Salary Range

₹1,500,000 – ₹1,900,000

Location

Bengaluru (Bangalore), Mumbai, Pune, Hyderabad, Chennai, Gurugram

Posted

1 month ago

Country

IN