Jobs//New York City, NY//Data + Analytics//

Software Engineer, Data Infrastructure

| Remote | Hybrid

Sorry, this job was removed at 1:50 a.m. (CST) on Tuesday, December 14, 2021

View 11220 Jobs

Find out who's hiring in New York City, NY.

See all Data + Analytics jobs in New York City, NY

View 11220 Jobs

Apply

By clicking Apply Now you agree to share your profile information with the hiring company.

Reddit is a network of more than 100,000 communities where people can dive into anything through experiences built around their interests, hobbies and passions. Reddit users submit, vote and comment on content, stories and discussions about the topics they care about the most. From pets to parenting, there’s a community for everybody on Reddit and with more than 50 million daily active people, it is home to the most open and authentic conversations on the internet. For more information, visit redditinc.com.

As a data engineer, you will build and maintain the data infrastructure tools used by the entire company to generate, ingest, and access petabytes of raw data. A focus on performance and optimization will enable you to write scalable / fault tolerant code while collaborating with a team of top engineers, all while learning about and contributing to one of the most powerful streaming event pipelines in the world.

Not only will your work directly impact hundreds of millions of users around the world, but your output will also shape the data culture across all of Reddit!

How you will contribute:

Refine and maintain our data infrastructure technologies to support real-time analysis of hundreds of millions of users.
Consistently evolve data model & data schema based on business and engineering requirements.
Own the data pipeline that surfaces 40B+ daily events to all teams, and the tools we use to improve data quality.
Support warehousing and analytics customers that rely on our data pipeline for analysis, modeling, and reporting.

Qualifications:

4+ years of experience writing clean, maintainable, and well-tested code.
Experience with Python and/or Scala.
Familiarity with large scale distributed real-time tools such as Kafka, Flink, or Spark.
Familiarity with ETL design (both implementation and maintenance).
Bonus points for experience with (or desire to learn) Kubernetes.
Excellent communication skills to collaborate with stakeholders in engineering, data science, and product.

#LI-SAP1

Reddit is committed to providing reasonable accommodations for qualified individuals with disabilities and disabled veterans in our job application procedures. If you need assistance or an accommodation due to a disability, please contact us at [email protected].

More Information on Reddit

Reddit operates in the Information Technology industry. The company is located in San Francisco, CA. Reddit was founded in 2005. It has 1900 total employees. It offers perks and benefits such as Volunteer in local community, Open door policy, OKR operational model, Team based strategic planning, Open office floor plan and Flexible work schedule. To see all 137 open jobs at Reddit, click here.

Read Full Job Description

Software Engineer, Data Infrastructure

Similar Jobs