AI / Agentic Systems Architect

Nikita Alexeev

AI / Agentic Systems Architect & ML Engineer

Stavanger, Norway

I design, build and ship AI systems to production — agent and voice systems, RAG pipelines, computer vision, MCP tool servers and n8n automation — with evaluation, observability and guardrails, not notebook prototypes.

Portrait of Nikita Alexeev
Shipping to production
How I work

Principles I don't compromise on.

01

Evaluation-driven

I measure before I ship — precision / recall / F1, cost-per-unit. Numbers decide what's good enough, not vibes.

02

Safe by design

Kill-switch and audit trail on anything autonomous. Governance and guardrails are part of the architecture, not an afterthought.

03

Production, not prototypes

Version control, observability, reproducibility. Systems that run reliably for real users — not notebooks that work once.

Selected work

Systems in production and the ideas behind them.

In production VLM + RAG Orchestrator

Tilsyn.AI — AI inspection platform

A new approach to automated inspection in a regulated industrial domain. A VLM + RAG architecture with an orchestrator, eval-driven quality and a kill-switch.

The core idea is novel and kept private — live walkthrough on request.

Novel Working mobile app

Viskr — real-time AI call coaching

Live transcription, in-call AI coaching and speaker identification on Android. A product category nobody else is shipping.

Kotlin / Compose · Node / WebSocket · Deepgram · Claude
Mobile game

PanteMania

A Norway-first mobile game with an original real-world-to-game mechanic: collect-and-battle gameplay tied to a real national system.

React Native · Express · Prisma
pantemania.no
15th / 380 · solo NM i AI 2026

Multi-agent coordination

Norwegian AI Championship. A MAPF warehouse-robot controller built as a measured optimization ladder — one-way corridors (+129%), PIBT collision resolution, a central dispatcher and an anti-deadlock layer. 15th of 380 teams, entered solo.

MAPF · PIBT · measured step-by-step
Agentic orchestration Pilot 2027

Orkestrator AI

A visual workflow editor over a typed agent-coordination backend — composing and running multi-step agent pipelines. Built for Stavanger kulturskole, with a pilot project planned for next year.

Next.js · tRPC · Claude SDK
Live ML platform SkatteFUNN · Forskningsrådet

JobSwap — commute-matching ML

An AI platform that cuts commutes by matching "mirror twins" and chaining job/home swaps. I built the ML: 21 ONNX-exported models — a traffic GNN, a Siamese compatibility net, a LambdaMART ranker and an ILP chain solver (the kidney-exchange algorithm) — trained on 1M synthetic profiles. R&D approved under SkatteFUNN (Research Council of Norway).

ONNX · XGBoost · GNN · ILP · React + Mapbox (v2)
Shipped product

VibeVox — voice dictation

Windows voice dictation that runs entirely on-device — no audio leaves the machine. Push-to-talk transcription in any app, plus AI text rewriting, transformation and translation. Works fully offline across 26 languages, with near-instant translation.

Python · on-device ASR (Parakeet) · 26 languages · INT8
vibevox.io
Live WebGL

GlobalPrice — prices on a globe

A 36-country electricity, fuel and transport price tracker on an interactive WebGL globe — with markerless head-tracked perspective: a webcam shifts the 3D viewpoint in real time as you move, giving the scene real depth.

Node / Express · WebGL · webcam face-tracking
Live AI product FINN.no

PrisSnap — AI pricing for FINN.no

Point your phone at an item and get an instant price estimate plus a ready-to-post Norwegian listing — Gemini Vision reads the photo, prices it and writes the ad in seconds. Web + Android.

Gemini Vision · Android · web
prissnap.no
Live Edge CV Privacy-first

SkiPuls.AI — crowd estimation

Crowd estimation for ski resorts that runs object detection on CPU at the edge with ROI masking — and discards every frame after inference. No facial recognition, no stored images.

FastAPI · Next.js · Postgres · Docker
skiai-demo.netlify.app
Motion sim Sensor fusion

SIMO — drone-cinema motion engine

The motion pipeline behind Heimdall Innovation's flying-cinema pod: real FPV-drone gyroscope/IMU data is replayed through a classical washout filter into a motorized chair, so the seat banks, climbs and dives in sync with first-person flight footage. Quaternion telemetry → washout → chair rotation, synced to the video clock.

Unity 6 · C# · IMU quaternions · washout filter · Yaw VR
AI voice agents

Voice agents you can call right now.

Live ElevenLabs Conversational AI agents I built — each speaks Norwegian and English. Open one and talk to it in the browser.

Live ElevenLabs NO / EN

SIMO — in-cabin agent

The in-cabin voice assistant for SIMO, a motion-simulator platform (Heimdall Innovation) — answers questions and guides the session by voice.

Talk to it
Live ElevenLabs NO / EN

Stavanger Kulturskole — secretary

A front-desk voice agent for a municipal music school: handles inquiries and booking-style requests in natural speech.

Talk to it
Live ElevenLabs NO / EN

Visam AI agent

A customer-facing voice agent for Visam AS — bilingual and conversational, available around the clock.

Real-time Twilio + OpenAI Realtime

VoxBridge — live call translation

Two-way phone-call translation in real time: each caller hears the other in their own language, synchronously — built on Twilio Media Streams and OpenAI's Realtime API.

Twilio Media Streams · OpenAI Realtime · WebSocket
AI automation & integrations

Agents that do the work, end to end.

n8n-orchestrated pipelines with AI agents in the loop — scraping, scoring, generating and publishing, with a human approving the calls that matter.

Production n8n backend

FinnFunn — AI-driven social automation

A social-media automation system driven by AI agents with n8n in the backend: it scrapes, scores, generates and schedules content daily, with a Telegram human-in-the-loop approval step before anything goes live. It runs in production — publishing daily to its Facebook community: daily FINN.no finds with a short price analysis.

n8n · Claude / Gemini / GPT · Deepgram · Supabase
FinnFunn on Facebook
200+ workflows Integrations

n8n workflows & integrations

Production automation across products — lead scraping and enrichment, cold-email outreach, video generation and booking chatbots — wired into Brevo, Google Calendar, Places and Supabase, with retries, throttling and idempotency.

n8n · Brevo · Google Calendar / Places · webhooks
Stack

What I build with.

Languages

PythonTypeScriptSQLKotlinC# / UnityGo

AI / ML

ClaudeOpenAIGemini / GemmaVLM / multimodalRAG (pgvector)NN trainingObject detectionSTT (Parakeet, Deepgram)OpenAI RealtimeQuantization (FP8/INT8)ONNXvLLMEval (P/R/F1)

Agents

MCP servers & tool-useMulti-agent orchestrationVoice agents (ElevenLabs)n8n orchestrationHuman-in-the-loopReAct / plan-executeGuardrailsMulti-provider LLM gateway

Backend / infra

FastAPINext.jsNode / Expressn8nSupabase / PostgresTwilioBrevoGoogle APIsDockervast.ai GPUVercel / Netlify / Hetzner
Get in touch

Let's talk.

Open to senior AI roles in Norway. The fastest way to reach me is LinkedIn — happy to give a live walkthrough of any of the work above.