This is a remote position.
Please go through the entire job post thoroughly before pressing Apply. Post pressing Apply, you shall reach the assessment page that must be attempted.
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
Busigence is a Decision Intelligence Company. We create decision intelligence products for real people by combining data, technology, business, and behaviour enabling strengthened decisions.
PySpark Developer
Team: Engineering
Location: Remote
Relevant Exp: 0-4 Years
Background: Been there-Done that
Compensation: Above industry standards
Requirements
Remote position (work-from-anywhere)
Immediate joiners must apply
Data Engineering Experienced - course/competitions/internships/job (<4 years)
Competitive compensation
1. Code in Python3 - Numpy?
2.Code in Python3 - Pandas?
4. Code in PySpark3 - SQL?
5.Developed data engineering pipelines on real-world problem (not just toy projects)?
6.Implemented advanced SQL queries
7.Developed complex logics in PySpark3
8.Confidence to learn PySpark3 -MLlib within two weeks?https://spark.apache.org/docs/latest/api/python/reference/pyspark.ml.html (we shall guide but won't spoon-feed)
===========================================
We are offering one of the most challenging & exciting work on Data Pipelines and Machine Learning Pipelines. You shall be working on sophisticated platforms, products and applications
===========================================
ROLE
We are looking for engineers with real passion for distributed computing with actual hands-on experience developing data application on PySpark. You would be required to work with our data science team on development of several data applications.
Mandatory
2. A firm understanding of the underlying mathematics will be needed to adapt modelling techniques to fit the problem space with large data (1M+ records)
5. Worked on development of data platform
Benefits
For more information, visit http://www.busigence.com
Products: http://busigence.com/offering
Careers: http://careers.busigence.com
Research: http://research.busigence.com
Jobs: http://careers.busigence.com
We work extensively & intensely on big data, data science, machine learning, deep learning, reinforcement learning, data analytics, natural language processing, cognitive computing, and business intelligence.
We offer you: [Greatest work of life]
You shall be working on our revolutionary products which are pioneer in their respective categories. This is a fact.
We try real hard to hire fun loving crazy folks who are driven by more than a paycheque. You shall be working with creamiest talent on extremely challenging problems at most happening workplace
Skills Required
- Proficient coding in Python3 (including NumPy)
- Proficient coding in Python3 (including Pandas)
- Proficient in PySpark3 core APIs
- Proficient in PySpark3 SQL
- Built real-world data engineering pipelines (production)
- Implemented advanced SQL queries
- Developed complex logic in PySpark at scale
- Willingness/confidence to learn PySpark MLlib quickly
- Fetch and ingest data from databases, APIs, and flat files
- Strong functional programming skills in Python and data structures
- Convert existing Python code to functional/distributed style
- Implemented complex mathematical logic using PySpark on clusters
- Ability to identify parallelism vs memory constraints and apply best practices
- Experience with PySpark performance tuning, optimization, configuration, and scheduling
- Integrated APIs, streams, databases, and various file formats through PySpark
- Familiarity with functional programming concepts (higher-order functions, immutability)
- Understanding of underlying mathematics for large-data modeling (1M+ records)
- Experience with PySpark MLlib and PySpark ML
- Configured checkpointing and DAGs on PySpark clusters
- Experience developing data platforms
What We Do
Busigence is a Decision Intelligence company that creates decision intelligence products for real people by combining data, technology, business, and behavior to enable strengthened decisions. Founded by IIT alumni, the company focuses on highly disruptive big data technologies, utilizing artificial intelligence and machine learning to deliver actionable business and relationship intelligence solutions.

.png)





