100% Remote Golang Developer with Devops/LLM exp. W2 Consultant

Remote Full-time
Hi, Hope you are doing well, Please find the job description given below and let me know your interest. Position: 100% Remote Golang Developer with Devops/LLM exp. || W2 Consultant Location: Remote Duration: 6-12 months Visa: Only USC, GC Job Description We are looking for devs with general cloud services / distributed services experience, with LLM experience as a secondary skill. GPU experience is now low on the list of preferred skills: Dedicated Inference Service Required Skills- Proficiency in Golang for building scalable and performant backend services. Deep experience building services in modern cloud environments on distributed systems (i.e., containerization (Kubernetes, Docker), infrastructure as code, bolthires/CD pipelines, APIs, authentication and authorization, data storage, deployment, logging, monitoring, alerting, etc.) Experience working with Large Language Models (LLMs), particularly hosting them to run inference Strong verbal and written communication skills. Your job will involve communicating with local and remote colleagues about technical subjects and writing detailed documentation. Experience with building or using benchmarking tools for evaluating LLM inference for various models, engine, and GPU combinations. Familiarity with various LLM performance metrics such as prefill throughput, decode throughput, TPOT, and TTFT Experience with one or more inference engines: e.g., vLLM, SGLang, and Modular Max Familiarity with one or more distributed inference serving frameworks: e.g., llm-d, NVIDIA Dynamo, and Ray Serve etc. Experience with AMD and NVIDIA GPUs, using software like CUDA, ROCm, AITER, NCCL, RCCL, etc. Knowledge of distributed inference optimization techniques - tensor/data parallelism, KV cache optimizations, smart routing etc. What You'll Be Working On- Develop and maintain an inference platform for serving large language models optimized for the various GPU platforms they will be run on. Work on complex AI and cloud engineering projects through the entire product development lifecycle (PDLC) - ideation, product definition, experimentation, prototyping, development, testing, release, and operations. Build tooling and observability to monitor system health, and build auto tuning capabilities. Build benchmarking frameworks to test model serving performance to guide system and infrastructure tuning efforts. Build native cross platform inference support across NVIDIA and AMD GPUs for a variety of model architectures. Contribute to open source inference engines to make them perform better on DigitalOcean cloud., Gaurav Gaur Email: | Phone LinkedIn: DMS Vision,INC 4645 Avon Lane, Suite 210 Frisco, TX 75033 Apply tot his job Apply tot his job Apply tot his job
Apply Now →

Similar Jobs

Freelance Graphic Designer, Brochures & Infographics (Healthcare)

Remote Full-time

Digital Ethics & Data Privacy Officer, Oncology Business Unit

Remote Full-time

CONTRACT Ethics, Compliance, and Risk Management Consultant

Remote Full-time

[Remote] Computer Forensic Analyst

Remote Full-time

Remote Merchandise Planning Manager

Remote Full-time

NSW Houston, TX - Retail Merchandising - Part-Time & Flexible

Remote Full-time

Field Merchandiser; part-time - West Palm , FL

Remote Full-time

Experienced Strategy Consultant for AI-Focused Proposal Development

Remote Full-time

Account Executive - bolthires Solutions / Digital Transformation Consulting

Remote Full-time

Sage 100 Consulting Manager

Remote Full-time

Specialized Talent Sourcer

Remote Full-time

remote

Remote Full-time

Remote Customer Support Assistant - Delivering Exceptional Customer Experiences from the Comfort of Your Own Home at blithequark

Remote Full-time

Experienced Remote Customer Service Representative in Healthcare – Delivering Exceptional Patient Experience and Technical Support

Remote Full-time

Experienced International WebChat Outbound Sales Representative – Late Night/Early Morning Shift for a Leading Cruise Line Company

Remote Full-time

Construction Project Manager Entry Level Sales

Remote Full-time

Chewy Customer Support - Fresher Job $24/Hour

Remote Full-time

Bill Review Nurse Specialist - REMOTE! - LVN/LPN/RN

Remote Full-time

[Remote] Remote Sr Pharmacovigilance Associate III

Remote Full-time

Experienced Quality Control Standards Data Analyst and Information Expert – Remote Data Entry Opportunities with arenaflex

Remote Full-time
← Back to Home