Top Tech Jobs & Startup Jobs

Reposted 9 Days AgoSaved
In-Office or Remote
San Francisco, CA, USA
Senior level
Senior level
Artificial Intelligence • Information Technology • Software
As a Machine Learning Engineer, you'll design and implement distributed systems for training large-scale ML models while optimizing performance under challenging conditions.
Top Skills: DeepspeedDistributed SystemsFsdpGrpcMegatronModel ParallelismPython
Reposted 9 Days AgoSaved
In-Office or Remote
Melbourne, Victoria, AUS
Expert/Leader
Expert/Leader
Artificial Intelligence • Information Technology • Software
The Research Scientist will tackle advanced research problems in Protocol Learning, publish findings, and work collaboratively in a distributed model training environment.
Top Skills: Distributed TrainingFederated LearningLarge Language ModelsMachine LearningPyTorch
Reposted 9 Days AgoSaved
In-Office or Remote
2 Locations
Entry level
Entry level
Artificial Intelligence • Information Technology • Software
Pluralis Research invites established researchers for collaboration on decentralized AI through joint publications and shared insights, focusing on model-parallel training.
Top Skills: Distributed MlFederated LearningModel Parallelism
Reposted 13 Days AgoSaved
In-Office or Remote
Sydney, New South Wales, AUS
Expert/Leader
Expert/Leader
Artificial Intelligence • Information Technology • Software
Conduct research in Protocol Learning, publish findings, and engage with distributed machine learning on consumer-grade devices. Transform ideas into foundational papers.
Top Skills: PyTorch
Reposted 13 Days AgoSaved
In-Office or Remote
Melbourne, Victoria, AUS
Internship
Internship
Artificial Intelligence • Information Technology • Software
The Research Scientist Intern will conduct novel research in Protocol Learning, aimed at publishing in top-tier machine learning conferences, and will gain mentorship from senior scientists.
Top Skills: PyTorch
Reposted 20 Days AgoSaved
In-Office
2 Locations
Senior level
Senior level
Artificial Intelligence • Information Technology • Software
Design and implement robust, large-scale distributed ML training systems optimized for low-bandwidth, high-latency environments. Build model-parallel training strategies, checkpointing and recovery, GPU and memory optimizations, P2P networking, NAT traversal, and monitoring to ensure resilient, efficient multi-participant training.
Top Skills: DeepspeedFsdpGrpcMegatronNat TraversalP2PPython
New

Track Smarter, Apply Better.

Ditch the spreadsheets. Organize your job search with our freeApplication Tracker.

Use For Free
Application Tracker Preview
All Filters
JobType
New Jobs
Job Category
Experience
Industry
Company Name
Company Size

Sign up now Access later

Create Free Account