Implify, Inc is a Global IT Solutions and services firm. Since it's inception, Implify, Inc has been providing best-quality and cost-effective IT solutions to fortune 1000 companies, mid-range companies and upcoming companies via its onsite, Offshore and in-house service models.
IMPLIFY is an IT consulting services and software development firm dedicated to business success through long-term relationships with our clients and staff. IMPLIFY has built a dynamic, profitable, service-oriented enterprise, and is positioned to successfully respond to trends and changes in the information technology industry.
Job Title: Big Data Engineer / Sr. Engineer
Location: Jersey City NJ
Full Time Permanent
RESPONSIBILITIES
Our Big Data capability team needs hands-on developers who can produce beautiful & functional code to solve complex analytics problems. If you are an exceptional developer with an aptitude to learn and implement using new technologies, and who loves to push the boundaries to solve complex business problems innovatively, then we would like to talk with you.
• You would be responsible for evaluating, developing, maintaining and testing big data solutions for advanced analytics projects
• The role would involve big data pre-processing & reporting workflows including collecting, parsing, managing, analyzing and visualizing large sets of data to turn information into business insights
• The role would also involve testing various machine learning models on Big Data, and deploying learned models for ongoing scoring and prediction. An appreciation of the mechanics of complex machine learning algorithms would be a strong advantage.
QUALIFICATIONS & EXPERIENCE
• 3+ years of demonstrable experience designing technological solutions to complex data problems, developing & testing modular, reusable, efficient and scalable code to implement those solutions.
Ideally, this would include work on the following technologies:
• Expert-level proficiency in at-least one of Java, C++ or Python (preferred). Scala knowledge a strong advantage.
• Strong understanding and experience in distributed computing frameworks, particularly Apache Hadoop 2.0 (YARN; MR & HDFS) and associated technologies -- one or more of Hive, Sqoop, Avro, Flume, Oozie, Zookeeper, etc..
• Hands-on experience with Apache Spark and its components (Streaming, SQL, MLLib) is a strong advantage.
• Operating knowledge of cloud computing platforms (AWS, especially EMR, EC2, S3, SWF services and the AWS CLI)
• Experience working within a Linux computing environment, and use of command line tools including knowledge of shell/Python scripting for automating common tasks
• Ability to work in a team in an agile setting, familiarity with JIRA and clear understanding of how Git works
In addition, the ideal candidate would have great problem-solving skills, and the ability & confidence to hack their way out of tight corners.
Must Have (hands-on) experience:
• Java or Python or C++ expertise
• Linux environment and shell scripting
• Distributed computing frameworks (Hadoop or Spark)
• Cloud computing platforms (AWS).
Desirable (would be a plus):
• Statistical or machine learning DSL like R
• Distributed and low latency (streaming) application architecture
• Row store distributed DBMSs such as Cassandra
• Familiarity with API design
EDUCATION
• B.E/B.Tech in Computer Science or related technical degree
All your information will be kept confidential according to EEO guidelines.
Top Skills
What We Do
Implify, Inc is a Global IT Solutions and services firm. We continue to earn excellent reputation and trust in providing best-quality and cost-effective IT solutions to fortune 1000 companies, mid-range and upcoming companies with its Onsite, Offshore and Near shore service models. Our project are varied in scope and complexity; a few person hours such as trouble-shooting assignments to complete SDLC enterprise level engagements.
Our Services are categorized into Consulting Services, E2E Application Development, Re-engineering & Integration Services, Professional Services (Smart and Scalable)
Our team primarily comprises of full-timers with high standards of accountability, integrity & proficiency. We have demonstrated our focus on long-term relationships with our resources through transparency, performance-based recognition & advancement.
Our project portfolio encompasses a diverse spectrum of technologies and services including on-site Business Process Review & Analysis, Product Evaluation and Recommendations, Package-specific SDLC implementation, Strategic Planning, Software Configuration Management (SCM), Program Management, Web-based application development, Web Design, Data Warehousing, Systems Integration using middleware products like MQSeries, Web Logic and TIBCO, document management and workflow and computer-based training and development.
Our deep domain knowledge in Banking, Life Sciences, Retail/Supply Chain, etc. blended with software design, development & user experience has helped us to be ahead of the curve.
For Lifesciences, we have tailored end-to-end Quality, Compliance and Regulatory solutions. Our team comprises of best-in-class resources across functional areas/services such as, Computer Systems Validation, Equipment Validation, Process Validation, Cleaning Validation. We understand how vital Validation Engineers and Quality Assurance are to make sure client’s products meet regulatory standards and guidelines.

.png)






