I build & operate
production systems
at scale.

Systems Engineer & Founder.
Based in Los Angeles.

Keeping hundreds of environments running.

Selected Work

RLA Studios

Founder - 2024

Full-stack business operations platform for a real estate videography company. Lead scraping, invoicing, CRM, commission tracking. One admin panel.

AI Ticketing System

Internal Tool - 2024

AI-powered system that transforms unstructured emails into structured, prioritized tickets with auto-generated task breakdowns and knowledge base integration.

About

streamGuidance.ts

import { OpenAI } from 'openai';

import { WikiService } from './WikiService';

const ai = new OpenAI({

baseURL: "https://generativelanguage.googleapis.com",

apiKey: process.env.GEMINI_KEY,

});

async function *streamGuidance(ticket) {

const wiki = WikiService.getInstance(ticket.workspaceId);

const pages = await wiki.findRelevantPages(ticket.subject);

const docs = await docService.findRelevant(ticket.subject);

const context = buildPrompt({

ticket,

wikiPages: pages.slice(0, 5),

documents: docs.slice(0, 5),

});

const stream = await ai.chat.completions.create({

model: "gemini-2.0-flash",

stream: true,

messages: context.messages,

});

for await (const chunk of stream) {

const token = chunk.choices[0]?.delta?.content;

if (token) yield token;

}

export async function handleGuidanceRequest(req, res) {

res.setHeader("Content-Type", "text/plain");

res.setHeader("Transfer-Encoding", "chunked");

const ticket = await TicketService.getById(req.params.id);

for await (const token of streamGuidance(ticket)) {

res.write(token);

}

res.end();

}

import { OpenAI } from 'openai';

import { WikiService } from './WikiService';

const ai = new OpenAI({

baseURL: "https://generativelanguage.googleapis.com",

apiKey: process.env.GEMINI_KEY,

});

async function *streamGuidance(ticket) {

const wiki = WikiService.getInstance(ticket.workspaceId);

const pages = await wiki.findRelevantPages(ticket.subject);

const docs = await docService.findRelevant(ticket.subject);

const context = buildPrompt({

ticket,

wikiPages: pages.slice(0, 5),

documents: docs.slice(0, 5),

});

const stream = await ai.chat.completions.create({

model: "gemini-2.0-flash",

stream: true,

messages: context.messages,

});

for await (const chunk of stream) {

const token = chunk.choices[0]?.delta?.content;

if (token) yield token;

}

export async function handleGuidanceRequest(req, res) {

res.setHeader("Content-Type", "text/plain");

res.setHeader("Transfer-Encoding", "chunked");

const ticket = await TicketService.getById(req.params.id);

for await (const token of streamGuidance(ticket)) {

res.write(token);

}

res.end();

}

Dashboard

prod-web-03

Engineer building
systems that
run themselves.

system status

all operational

nginx-proxy

99.98%↑14d

api-gateway

99.99%↑14d

postgres-main

100%↑31d

redis-cache

100%↑31d

traefik

99.97%↑61d

Dashboard

prod-web-03

streamGuidance.ts

import { OpenAI } from 'openai';

import { WikiService } from './WikiService';

const ai = new OpenAI({

baseURL: "https://generativelanguage.googleapis.com",

apiKey: process.env.GEMINI_KEY,

});

async function *streamGuidance(ticket) {

const wiki = WikiService.getInstance(ticket.workspaceId);

const pages = await wiki.findRelevantPages(ticket.subject);

const docs = await docService.findRelevant(ticket.subject);

const context = buildPrompt({

ticket,

wikiPages: pages.slice(0, 5),

documents: docs.slice(0, 5),

});

const stream = await ai.chat.completions.create({

model: "gemini-2.0-flash",

stream: true,

messages: context.messages,

});

for await (const chunk of stream) {

const token = chunk.choices[0]?.delta?.content;

if (token) yield token;

}

export async function handleGuidanceRequest(req, res) {

res.setHeader("Content-Type", "text/plain");

res.setHeader("Transfer-Encoding", "chunked");

const ticket = await TicketService.getById(req.params.id);

for await (const token of streamGuidance(ticket)) {

res.write(token);

}

res.end();

}

import { OpenAI } from 'openai';

import { WikiService } from './WikiService';

const ai = new OpenAI({

baseURL: "https://generativelanguage.googleapis.com",

apiKey: process.env.GEMINI_KEY,

});

async function *streamGuidance(ticket) {

const wiki = WikiService.getInstance(ticket.workspaceId);

const pages = await wiki.findRelevantPages(ticket.subject);

const docs = await docService.findRelevant(ticket.subject);

const context = buildPrompt({

ticket,

wikiPages: pages.slice(0, 5),

documents: docs.slice(0, 5),

});

const stream = await ai.chat.completions.create({

model: "gemini-2.0-flash",

stream: true,

messages: context.messages,

});

for await (const chunk of stream) {

const token = chunk.choices[0]?.delta?.content;

if (token) yield token;

}

export async function handleGuidanceRequest(req, res) {

res.setHeader("Content-Type", "text/plain");

res.setHeader("Transfer-Encoding", "chunked");

const ticket = await TicketService.getById(req.params.id);

for await (const token of streamGuidance(ticket)) {

res.write(token);

}

res.end();

}

I work on production infrastructure and applications that support real users at scale. That includes everything from cloud environments and DNS to backend services, automation pipelines, and legacy systems that need to stay online no matter what.

Most of my experience comes from operating live systems, not just building them. Debugging broken payment flows, tracing down infrastructure issues, and keeping hundreds of environments stable has shaped how I approach engineering: keep it simple, make it reliable, and remove as many failure points as possible.

I tend to focus on turning messy, manual processes into clean, repeatable systems. Whether it's internal tools, data pipelines, or full application workflows, the goal is always the same: make it predictable, scalable, and low maintenance.

RLA Studios came out of that same mindset. What started as creative work evolved into building systems behind it, automating everything from client intake to delivery so it can scale without becoming operational overhead.

I'm less interested in perfect architecture diagrams and more in systems that actually hold up in production, under load, with real users.

Outside of work, I'm usually watching tennis or F1, which probably explains why I care a bit too much about performance, consistency, and things working exactly the way they should.

I build & operateproduction systemsat scale.

RLA Studios

AI Ticketing System

I build & operate
production systems
at scale.