Data Processing Quality Engineer

Reposted 2 Days Ago
Be an Early Applicant
Hyderābād, Telangāna
In-Office
Senior level
Software
The Role
Ensure the quality of the data processing engine by testing for result accuracy, performance fidelity, and robust execution through various methodologies.
Summary Generated by Built In

Ensure quality of the data processing engine, in terms of result accuracy, performance fidelity, and  robust execution at scale.  

Requirements  

BS EE/CS or equivalent 

5+ years of experience in data processing quality or performance testing for database,  data warehouse, or query engine applications.  Experience testing for platforms such as  Apache Spark, Gluten, Velox, DataFusion preferred.  

Solid knowledge of SQL, Python, and similar data processing languages  

Automation-first mindset, experienced with programming/scripting languages and  automation tools.  

Strong in problem-solving and coming up with the test strategy for the complex system. 
Strong in debugging, root cause, and narrowing down the failures.  

Experience in Functional, Performance, Integration, System Level testing 
Experience with the use of public cloud platforms such as AWS, GCP, and MS Azure 

Good Knowledge of tools like Jira, Confluence, Git, Jenkins. 
Good understanding of SDLC and agile methodologies. 
Good understanding of CI/CD implementations.

Top Skills

Spark
AWS
Confluence
Datafusion
GCP
Git
Gluten
Jenkins
JIRA
Ms Azure
Python
SQL
Velox
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: Mountain View, California
60 Employees
Year Founded: 2022

What We Do

We are on a mission to make it viable to extract value from all data in the world — so humanity can capture every insight, cure, invention, and opportunity.

Traditional processing solutions based on CPUs and today’s software architectures cannot handle the complexity and volume of data, doubling every two years, with unstructured data now accounting for 90% of all data created. The surge of GenAI and its dependence on huge volumes of unstructured data is compounding the processing challenge. DataPelago is creating a new data processing standard for the accelerated computing era to overcome these performance, cost and scalability limitations.

DataPelago's revolutionary Universal Data Processing Engine accelerates any engine, including open source, on any hardware, using any data type. DataPelago enables organizations to extract value from data at unprecedented price and performance for their GenAI and analytics workloads.

Similar Jobs

In-Office
Hyderābād, Telangāna, IND

The Flex Logo The Flex

Customer Experience Representative - Asia

Artificial Intelligence • Real Estate • Software
In-Office
22 Locations

The Flex Logo The Flex

Founder’s Associate – Remote

Artificial Intelligence • Real Estate • Software
In-Office or Remote
99 Locations

The Flex Logo The Flex

Chief Of Staff

Artificial Intelligence • Real Estate • Software
In-Office or Remote
99 Locations

Similar Companies Hiring

Credal.ai Thumbnail
Software • Security • Productivity • Machine Learning • Artificial Intelligence
Brooklyn, NY
Standard Template Labs Thumbnail
Software • Information Technology • Artificial Intelligence
New York, NY
10 Employees
PRIMA Thumbnail
Travel • Software • Marketing Tech • Hospitality • eCommerce
US
15 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account