AI Engineer - LLM Architect Job at Musing Ai, Pittsburgh, PA

VDlUVHFpVFRTSjdyckRCdERzKzhMeHo0Smc9PQ==
  • Musing Ai
  • Pittsburgh, PA

Job Description


AI Engineer (LLM Architect), Emotional Companion

 

The Role:

This position will help design and ship an emotionally intelligent conversational companion that reduces loneliness and improves daily life for older adults. You will architect the end-to-end AI stack, move fast with real users, and set the technical bar for the team.

What you will do :

  • Architecture : Design the conversational system from intake to response. Own policy, generation, tool use, long-term memory, personalization, and retrieval.
  • Model selection and training : Choose base models, build data pipelines, and run instruction tuning, safety tuning, and preference optimization. Use techniques such as LoRA, DPO, distillation, and quantization to reach latency and cost targets.
  • Prompt and agent design : Create robust system prompts, function-calling schemas, and tool APIs. Stand up an A/B framework to test prompts, policies, and safety rules with real users.
  • Evaluation : Build an automated and human-in-the-loop eval harness for empathy, helpfulness, safety, groundedness, latency, and cost. Define success metrics and wire them into dashboards.
  • Safety and ethics : Implement guardrails for prompt injection, jailbreaks, self-harm, medical boundaries, and misinformation. Add escalation, deflection, and human handoff paths that respect user consent.
  • Data and privacy : Set standards for PII handling, redaction, consent management, anonymization, and secure storage. Curate, generate, and label data that reflects diverse seniors and scenarios.
  • Serving and MLOps : Ship models to production using efficient inference stacks. Add observability, tracing, rollback, canary releases, and a model registry. Keep the system fast, stable, and affordable.
  • Voice pipeline : Integrate ASR, TTS with expressive prosody, barge-in, turn-taking, and latency budgets for a natural feel.
  • Collaboration : Work with design and research to translate user studies into product requirements. Mentor teammates and help make pragmatic build-vs-buy decisions.

Required skills & experience :

  • Deep Python : Production-grade code, profiling, testing, and packaging.
  • LLM implementation : Strong PyTorch and experience training or fine-tuning open models (e.g., Llama, Mistral, Qwen) including tokenizer issues, data curation, and distributed training with FSDP or DeepSpeed.
  • Inference and optimization : Quantization (GGUF, GPTQ, AWQ), serving stacks (vLLM, TensorRT-LLM, llama.cpp), caching, KV-reuse, streaming, and throughput tuning.
  • Prompt engineering and tool use : System and developer prompts, function calling, tool orchestration, and failure handling. Ability to make prompts measurable and testable.
  • Retrieval-augmented generation : Indexing, chunking, reranking, and grounding. Experience with FAISS, Milvus, Vespa, or Pinecone. Understanding of hallucination mitigation.
  • Evaluation and experimentation : Human ratings at scale, rubric design for empathy and safety, statistical testing, online A/B. Comfort turning qualitative findings into quantitative KPIs.
  • Security and privacy : PII handling, threat modeling for LLMs, prompt-level defenses, rate limiting, abuse detection. Familiarity with HIPAA-adjacent expectations and SOC 2 practices.
  • Product mindset : Ability to ship thin slices, instrument them, and iterate quickly based on user feedback.

Nice-to-have :

  • Affective computing : Emotion and intent classifiers, prosody features, conversation state tracking, de-escalation strategies.
  • Speech : ASR, diarization, VAD, latency-aware pipelines, expressive TTS.
  • Reinforcement and preference learning : DPO, PPO, ORPO, reward modeling, red-teaming loops.
  • On-device and edge : GPU and CPU constraints, memory mapping, mixed precision, mobile or embedded deployment.
  • Compliance awareness : Experience in healthcare or aging tech, consent UX, accessibility standards.
  • HCI and conversation design : Persona, turn-taking, long-term rapport, and evaluation methods suited for vulnerable users.

What success looks like in 90 days :

  • A production-ready conversational MVP with safety guardrails and memory that passes internal red-team checks.
  • An eval harness with live dashboards for empathy, safety, groundedness, latency, and cost per session.
  • A prompt and policy library with A/B tests running weekly and clear learnings.
  • A data pipeline with redaction, consent flags, and a high-quality instruction-tuning set sourced from real use.

Tools you might use:

 

Python, PyTorch, vLLM or TensorRT-LLM, llama.cpp, Weights & Biases, Ray, FAISS or Milvus, Redis, Postgres, Kubeflow or Flyte, Grafana or OpenTelemetry, Whisper or similar ASR, high-quality TTS, and standard MLOps tooling.

 

About us:

We are an exciting, new (funded) and stealthy AI startup that focuses on addressing the negative effects of isolation. You will be working with a group of experienced tech entrepreneurs and AI technologists. This position will help design and ship an emotionally intelligent conversational companion that reduces loneliness and improves daily life for older adults. You will architect the end-to-end AI stack, move fast with real users, and set the technical bar for the team.

 

What we offer:

  • Competitive base salary
  • Cash bonus 
  • Equity stack 
  • Unlimited PTO Plan
  • Dental, Vision, and Health Insurance
  • Hybrid Work Schedule in Pittsburgh, PA
  • We sponsor OPT and STEM OPT only

Job Tags

Full time,

Similar Jobs

Mutual of Omaha

Financial Representative Trainee (Sales) - Austin, TX Job at Mutual of Omaha

 ...celebrated by a diverse community of coworkers. Discover Our Culture Related Job Openings Summer 2026 Graduate Data Science Intern - Remote Remote | 504167 Summer 2026 Product Owner Intern - Remote Remote | 504151 Account Executive - Remote Remote | 5... 

Rapid Response Ambulance

EMT Job at Rapid Response Ambulance

 ...Job Description Job Description Join Our Team as an EMT! IMMEDIATE HIRING!Full-Time Opportunities Available We are currently seeking dedicated EMTs to join our team in Highlands County, FL. If you have an active, unrestricted EMT license in Florida and a passion... 

Earn Haus

Remote Brand Feedback Specialist (Hiring Immediately) Job at Earn Haus

Job description We are urgently looking for people interested in taking online surveys for Fortune 500 brands. If you are a self-starter, looking for flexible hours throughout the week, this may be for you! Earn up to $25 per survey. Share your opinion and help influence...

Compunnel Inc.

Assembler Job at Compunnel Inc.

 ...Job Title : Assembler I Location: Beaver ,PA (15009) USC and Green card only Please ,resume on ****@*****.*** Do Assemble details, sub-assemblies, and final assemblies according to instructions. Load and unload contacts, solder shims or paste,... 

International SOS Government Medical Services

EMT - Kuwait Job at International SOS Government Medical Services

 ...disasters, and refugee care. To protect your workforce, we are at your fingertips: internationalsos.com Job Description The EMT will utilize their knowledge and proficiency of evidenced-based medical practice and Intl.SOS protocols to manage unexpected and emergent...