Data Engineer - R01561112

Posted 3 Hours Ago
Be an Early Applicant
St Louis, MO
Hybrid
Senior level
Information Technology
The Role
Design, build, and maintain scalable Spark/PySpark ETL pipelines and data models. Optimize SQL and Spark jobs, debug production workflows, collaborate with analysts, and document architectures to support large-scale analytics.
Summary Generated by Built In
Senior Software Development Engineer

Primary Skills

  • Spark - Pyspark, SQL, SQL (Basic + Advanced), Python, Hive, Data Modelling Fundamentals

Specialization

  • Data Analysis: Data Engineer

Job requirements

  • Experience Range: 2 - 4 years of experience in software development, with hands-on expertise in data analysis and engineering

  • Key Responsibilities:
  • 1. Design and develop scalable data processing solutions using Spark and PySpark to support advanced analytics initiatives
  • 2. Implement robust data models and optimize SQL queries for efficient data retrieval and transformation
  • 3. Collaborate with data analysts and business stakeholders to translate requirements into technical solutions
  • 4. Build and maintain ETL pipelines leveraging Python, Hive, and SQL for large-scale data integration
  • 5. Conduct thorough code reviews, performance tuning, and debugging to ensure high-quality deliverables
  • 6. Monitor, troubleshoot, and resolve issues in production data workflows, ensuring data accuracy and reliability
  • 7. Document technical processes, data models, and workflow architectures to facilitate knowledge sharing
  • 8. Stay updated with industry trends in big data technologies and proactively recommend improvements

  • Required Skills:
  • 1. Advanced proficiency in Spark and PySpark
  • 2. Strong knowledge of SQL (basic and advanced)
  • 3. Expertise in Python programming for data processing
  • 4. Experience with Hive for data warehousing solutions
  • 5. Solid understanding of data modelling fundamentals
  • 6. Ability to design and optimize ETL pipelines
  • 7. Hands-on experience with large-scale data processing
  • 8. Proficient in performance tuning of SQL and Spark jobs
  • 9. Familiarity with distributed computing concepts
  • 10. Competence in debugging and troubleshooting data workflows

  • Preferred Skills:
  • 1. Experience with cloud-based data platforms such as AWS or Azure
  • 2. Knowledge of Airflow or similar workflow orchestration tools
  • 3. Familiarity with Scala for Spark development
  • 4. Understanding of data governance and security best practices
  • 5. Exposure to machine learning frameworks in Python
  • 6. Experience with continuous integration and deployment for data applications

  • Desired Qualifications:
  • 1. Bachelor’s degree in Computer Science, Information Technology, Engineering, or a related field 2. Relevant certifications in big data technologies or data engineering (such as Spark, Python, or SQL) are advantageous

Top Skills

Spark,Pyspark,Sql,Python,Hive
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
Chennai
2,676 Employees
Year Founded: 2014

What We Do

Brillio is the leader in global digital business transformation, applying technology with a human touch. We help businesses define internal and external transformation objectives, and translate those objectives into actionable market strategies using proprietary technologies. With 2600+ experts and 13 offices worldwide, Brillio is the ideal partner for enterprises that want to quickly increase their core business productivity, and achieve a competitive edge, with the latest digital solutions.

Similar Jobs

PwC Logo PwC

Senior Associate - Oracle Cloud EPM Analyst

Artificial Intelligence • Professional Services • Business Intelligence • Consulting • Cybersecurity • Generative AI
Hybrid
36 Locations
370000 Employees
77K-202K Annually

Sprinter Health Logo Sprinter Health

Implementation Manager

Artificial Intelligence • Healthtech • Logistics • Social Impact • Software • Telehealth
Remote or Hybrid
USA
500 Employees
140K-160K Annually
Remote or Hybrid
United States
1750 Employees

LogicMonitor Logo LogicMonitor

GSI Sales Director

Artificial Intelligence • Cloud • Information Technology • Machine Learning • Software
Easy Apply
Remote or Hybrid
USA
1100 Employees

Similar Companies Hiring

Turion Space Thumbnail
Software • Manufacturing • Information Technology • Hardware • Defense • Artificial Intelligence • Aerospace
Irvine, CA
150 Employees
Axle Health Thumbnail
Logistics • Information Technology • Healthtech • Artificial Intelligence
Santa Monica, CA
19 Employees
Standard Template Labs Thumbnail
Software • Information Technology • Artificial Intelligence
New York, NY
15 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account