Skip to main content
Cerebras Systems

AI Engineering Intern - Growth Team

3w

Cerebras Systems

Toronto, CA · Internship · $65,000 – $85,000

About this role

Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip. This delivers industry-leading training and inference speeds for large-scale ML applications.

The Growth Team drives AI adoption across Cerebras as a multi-disciplinary group owning product, engineering, and marketing. We build agentic workflows, internal knowledge systems, and developer infrastructure using Claude Code, MCP, RAG pipelines, and multi-agent architectures. You'll embed with kernel, design verification, and cloud platform engineers to ship AI tooling.

Own end-to-end workstreams by scoping problems with engineering teams, building agentic systems, iterating on user feedback, and shipping internal tools. Projects target AI agents for design verification and ASICs, kernel and model bringup, and cloud platform SRE workflows. Work directly with teams building Cerebras chips and inference platform.

This 12-week paid in-person internship runs June through August 2026 in Toronto or Sunnyvale offices. Capstone involves shipping a real tool like a RAG-powered knowledge base, MCP integration, or multi-agent system. Accelerate actual hardware and software development workflows that engineers use daily.

Requirements

  • Experience building agentic workflows and multi-agent architectures
  • Familiarity with RAG pipelines and internal knowledge systems
  • Knowledge of Claude Code and MCP (Model Context Protocol)
  • Ability to develop developer infrastructure for engineering teams
  • Interest in AI hardware, silicon design verification, and cloud platforms
  • Skills in accelerating chip development and model bringup processes

Responsibilities

  • Scope problems with engineering teams and own end-to-end workstreams
  • Build AI agents for design verification and ASICs including automated test generation and debug triage
  • Develop agentic workflows to accelerate kernel and model bringup on Cerebras hardware
  • Create AI agents for cloud platform and SRE using automated log analysis and intelligent runbook execution
  • Integrate Claude Code, MCP, and RAG systems to speed up engineering processes
  • Iterate on user feedback and ship a working internal tool by internship end

Benefits

  • 12-week paid internship
  • Ship internal tools that engineers actually use
  • Embed with teams building Cerebras chips and inference platform
  • Work on world's fastest Generative AI inference solution