AI Platform Engineer

Designing the
infrastructure
of intelligence.

I build high-throughput inference engines, speculative inference pipelines, and autonomous vision systems that run on the edge.

01

Full Scale Platform

🤖 💬

CongressTrading.app

An automated transparency system tracking legislative public records. Features a Natural Language Query Engine to parse and structure unstructured government filings.

FastAPI HTMX / Alpine.js Vector DB
02

Open Source AI

03

Research & Feasibility

Infrastructure
2.1x

Throughput Speedup

High-Frequency Risk Engine

Migrating a Fraud Detection system from CPU-bound Python to NVIDIA Triton. Achieved 1,088 RPS by bypassing Python serialization tax with the C++ FIL Backend.

NVIDIA Triton FIL Backend Fintech
Infrastructure
33x

Throughput Increase

Inference Throughput Benchmark

A stress test for the proposed "Real-Time News" feature. Benchmarked vLLM vs HuggingFace on Google Colab (T4) to validate if a single GPU could handle the global news cycle using PagedAttention.

vLLM / PagedAttention Inference Profiling