AI Engineering Intern - Growth Team

9w2 months ago

Cerebras Systems

Toronto, CA · Internship · $65,000 – $85,000

About this role

Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip. This delivers industry-leading training and inference speeds for large-scale ML applications.

The Growth Team drives AI adoption across Cerebras as a multi-disciplinary group owning product, engineering, and marketing. We build agentic workflows, internal knowledge systems, and developer infrastructure using Claude Code, MCP, RAG pipelines, and multi-agent architectures. You'll embed with kernel, design verification, and cloud platform engineers to ship AI tooling.

Own end-to-end workstreams by scoping problems with engineering teams, building agentic systems, iterating on user feedback, and shipping internal tools. Projects target AI agents for design verification and ASICs, kernel and model bringup, and cloud platform SRE workflows. Work directly with teams building Cerebras chips and inference platform.

This 12-week paid in-person internship runs June through August 2026 in Toronto or Sunnyvale offices. Capstone involves shipping a real tool like a RAG-powered knowledge base, MCP integration, or multi-agent system. Accelerate actual hardware and software development workflows that engineers use daily.

Requirements

Experience building agentic workflows and multi-agent architectures
Familiarity with RAG pipelines and internal knowledge systems
Knowledge of Claude Code and MCP (Model Context Protocol)
Ability to develop developer infrastructure for engineering teams
Interest in AI hardware, silicon design verification, and cloud platforms
Skills in accelerating chip development and model bringup processes

Responsibilities

Scope problems with engineering teams and own end-to-end workstreams
Build AI agents for design verification and ASICs including automated test generation and debug triage
Develop agentic workflows to accelerate kernel and model bringup on Cerebras hardware
Create AI agents for cloud platform and SRE using automated log analysis and intelligent runbook execution
Integrate Claude Code, MCP, and RAG systems to speed up engineering processes
Iterate on user feedback and ship a working internal tool by internship end