M MemberIntel KB

reference

AI Engineer — Public Job Posting

External recruiting copy for the Senior AI Engineer hire — production LLM systems experience required, cost-discipline-as-engineering-surface posture, no LangChain. 30-min CEO screen → 60-90 min technical with Seth → paid week-long trial → offer in ~2-3 weeks.

Caseproof, LLC · Remote (US-friendly hours preferred)

Caseproof is the company behind MemberPress, the most widely used WordPress membership platform. We’re privately held, profitable, and bootstrapped. We’re building a new AI-powered product, and we’re hiring the engineer who will own the AI substrate that powers it.

Internal companions: internal AI Engineer JD (decision rights, success measures, reporting cadence) · Seth — Lead Architect JD (the role this position reports to).


The Role

You’ll own the AI core of a new product that will be used weekly by thousands of paying customers. Inference pipeline, retrieval, prompt versioning, eval suite, cost discipline, model-improvement loop… all of it.

This is a shipping role, not a research role. We’re not training models or publishing papers. We’re building a production system that has to be accurate, cheap to run, and steadily better month over month. You’ll report to our senior engineering lead and work alongside a small, focused team.


What You’ll Do

  • Design and run the inference pipeline. Retrieval-augmented generation with structured tool calls, citation-grounded responses, tier-aware model routing.
  • Own prompt versioning and the eval suite. Real evals, with adversarial cases and release-blocking gates. Vibes-based evals don’t ship here.
  • Own cost telemetry and cost discipline. Per-user caps, model routing enforcement, caching, abuse detection. The product has a free tier; you’re accountable for keeping it profitable at scale.
  • Build the feedback loop that makes the system improve over time.
  • Iterate on quality continuously based on real-customer signals.

What We Need

  • You have personally shipped at least one production LLM-powered product or feature that real users rely on. You can describe its prompts, evals, and cost telemetry in detail because you built them.
  • You’ve experienced and recovered from at least one of: prompt drift, model version regression, retrieval quality degradation, cost overrun, hallucination incident, eval-suite failure that blocked a release.
  • Strong full-stack engineering. Comfortable in Python or TypeScript, comfortable with Postgres and SQL, comfortable owning integrations end-to-end.
  • Vector store experience (pgvector, Pinecone, or equivalent).
  • Hands-on Anthropic API experience preferred; OpenAI API also fine.
  • You think about prompt drift, eval coverage, cost discipline, and feedback loops as engineering surfaces. Not afterthoughts.

What We Don’t Need

  • Research scientists. We’re not training models.
  • LangChain wrappers. We own our orchestration in-house.
  • Resumes heavy on AI buzzwords and light on shipped systems.

Compensation

Generous compensation & bonus structure. Health, dental, and vision for US employees. Remote-first with US-Mountain Time overlap preferred.


How to Apply

Send to hiring@caseproof.com:

  1. A short cover note (under 300 words) describing the most interesting LLM-powered system you’ve shipped, what was hard about it, and what you’d do differently next time.
  2. A link to a portfolio piece, GitHub repo, or write-up that shows your work.
  3. Resume.
  4. Your expected compensation range and your earliest start date.

Process

We move fast. About 2–3 weeks end to end.

  1. 30-minute screen with the CEO.
  2. 60–90 minute technical conversation with the engineering lead. Walk us through one of your shipped LLM systems in depth.
  3. Paid week-long trial project on a scoped problem.
  4. Final conversation. Offer typically within 48 hours.
For: A AI Engineer B Blair Williams S Seth Shoultes