Big Data Architect

Reposted 5 Days Ago
Be an Early Applicant
Madrid, Comunidad de Madrid
In-Office
7-7 Annually
Senior level
Information Technology
The Role
The Big Data Architect is responsible for designing scalable data architectures using Cloudera and Spark, optimizing data workflows, and ensuring data quality while leading cross-functional teams and maintaining compliance with regulations.
Summary Generated by Built In

Big Data Architect
Position Purpose:
We are seeking a highly skilled and experienced Big Data Architect to join our international team. You will play a pivotal role in shaping our Big Data environments and projects, including the Global Data Lake, while enhancing our Sustainable Estimatics offerings. Sustainable Estimatics is a leading suite within the company, recognized for its substantial impact on the industry. With our innovative and certified algorithms, we provide our customers with significant cost savings by minimizing waste and optimizing resource usage. By embedding sustainability principles into our Estimatics practices, we actively contribute to the industry's collective effort to reduce environmental impact. Our commitment to sustainability goes beyond individual projects; we aim to drive industry-wide innovation through the continuous development of new technologies and practices that create a positive ripple effect for both the environment and society.
As a Big Data Architect, you will be responsible for designing the overall architecture of our data systems, ensuring they are robust, scalable, and efficient. You will develop architectural strategies and frameworks that guide our data processing initiatives, enabling the effective management of large volumes of data from diverse sources worldwide.

What You Will Be Doing:

- Design and implement scalable and efficient data architectures that support data processing pipelines using Cloudera, Spark, and other relevant technologies.

- Lead the development of scalable API solutions to facilitate Data as a Service (DaaS), providing seamless access to data for both external and internal customers.

- Establish best practices for data ingestion, transformation, and storage processes to ensure data quality, integrity, and availability across international locations.

- Collaborate with cross-functional teams to gather business requirements and translate them into comprehensive architectural specifications for data processing and analysis.

- Optimize data workflows and the performance of Spark jobs to ensure they meet stringent latency and throughput requirements while processing massive datasets.

- Conduct troubleshooting and performance tuning of Cloud or On-premises infrastructure to identify performance bottlenecks and enhance resource utilization.

- Leveraging tools like New Relic for performance monitoring and Graylog for log analysis.

- Work closely with data scientists and analysts to ensure timely and reliable data sets for advanced analytics and machine learning models.

- Implement data governance practices and ensure compliance with data privacy and security regulations across various regions.

- Stay abreast of emerging technologies and industry trends related to big data processing, Cloudera, and Spark, and propose innovative architectural solutions to enhance data processing capabilities.

- Provide technical leadership, mentorship, and guidance to engineering teams, fostering a collaborative and innovative culture within the international group.

- Participate in agile development practices, including sprint planning, architecture reviews, and continuous integration and deployment, to ensure high-quality software delivery.
What You Need for this Position:

- Bachelor’s or Master’s degree in Computer Science, Data Science, or a related field.

- A minimum of 7 years of working experience in big data architecture, preferably with Cloudera and Spark technologies.

- Strong understanding of API architectures and best practices, with experience in developing APIs for Data as a Service (DaaS) solutions.

- Strong proficiency in programming languages such as Scala, Python, or Java, with the ability to design and implement complex data solutions.

- In-depth knowledge of distributed computing principles and frameworks, including Hadoop and Spark.

- Extensive experience with Cloudera distribution and tools like HDFS, Hive, Impala, and HBase.

- Strong understanding of data modeling and database design principles, including schema design, partitioning, and indexing.

- Solid understanding of SQL and NoSQL databases, data warehousing concepts, and ETL processes.

- Proven expertise in designing, implementing, and optimizing data pipelines using Spark Streaming, Spark SQL, or other Spark modules.

- Familiarity with data ingestion techniques and tools, such as Kafka, Flume, Sqoop, or Nifi.

- Experience with cloud platforms like AWS or Azure, and knowledge of containerization technologies like Docker or Kubernetes is a plus.

- Understanding of data governance, data privacy, and security practices, particularly in an international context.

- Excellent problem-solving and analytical skills, with the ability to design solutions that optimize data processing workflows.

- English is required, with communication skills to effectively convey complex technical concepts to both technical and non-technical stakeholders.

#LI-JG1

Top Skills

AWS
Azure
Cloudera
Docker
Flume
Hadoop
Hbase
Hdfs
Hive
Impala
Java
Kafka
Kubernetes
Nifi
NoSQL
Python
Scala
Spark
SQL
Sqoop
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: Westlake, TX
1,689 Employees
Year Founded: 2005

What We Do

Solera is a leading global provider of integrated vehicle lifecycle and fleet management software-as-a-service, data, and services. Through four lines of business – vehicle claims, vehicle repairs, vehicle solutions and fleet solutions – Solera is home to many leading brands in the vehicle lifecycle ecosystem, including Identifix, Audatex, DealerSocket, Omnitracs, eDriving/Mentor, Explore, CAP HPI, Autodata, and others. Solera empowers its customers to succeed in the digital age by providing them with a “one-stop shop” solution that streamlines operations, offers data-driven analytics, and enhances customer engagement, which Solera believes helps customers drive sales, promote customer retention, and improve profit margins. Solera serves over 300,000 global customers and partners in 100+ countries. For more information, visit www.solera.com.

Similar Jobs

ServiceNow Logo ServiceNow

Director, Renewal Sales

Artificial Intelligence • Cloud • HR Tech • Information Technology • Productivity • Software • Automation
Remote or Hybrid
Madrid, Comunidad de Madrid, ESP
28000 Employees
10-10 Annually

ServiceNow Logo ServiceNow

Architect

Artificial Intelligence • Cloud • HR Tech • Information Technology • Productivity • Software • Automation
Remote or Hybrid
Madrid, Comunidad de Madrid, ESP
28000 Employees

Datadog Logo Datadog

Field Marketing Manager

Artificial Intelligence • Cloud • Security • Software • Cybersecurity
Easy Apply
Hybrid
Madrid, Comunidad de Madrid, ESP
6500 Employees

Datadog Logo Datadog

Product Design Intern

Artificial Intelligence • Cloud • Security • Software • Cybersecurity
Easy Apply
Hybrid
Madrid, Comunidad de Madrid, ESP
6500 Employees

Similar Companies Hiring

Axle Health Thumbnail
Logistics • Information Technology • Healthtech • Artificial Intelligence
Santa Monica, CA
17 Employees
Scrunch AI Thumbnail
Software • SEO • Marketing Tech • Information Technology • Artificial Intelligence
Salt Lake City, Utah
Standard Template Labs Thumbnail
Software • Information Technology • Artificial Intelligence
New York, NY
10 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account