Nairobi · Production Systems · Africa-First

Real AI & Data Systems, Built Under African Constraints.

Most technical content assumes stable power, cheap GPUs, and fast internet. I document what actually works in production. Systems engineered in Kenya, tested against the constraints most tutorials never mention.

Read by engineers building real systems.

Scroll

Signal Feed

Live signals from X, Hacker News, GitHub, and Reddit

Data Engineering

Pipelines & ETL Systems

Most failures are not the model. They are the pipeline.

Live Now

Artificial Intelligence

LLMs & Deep Learning

Agents are overhyped. Reliability is the real bottleneck.

Live Now

Machine Learning Systems

Model Training & Deployment

Training metrics lie. Production metrics tell the truth.

Live Now

AI Automation

Agents, MLOps & Workflows

The hardest part is not building the agent. It is knowing when not to.

Live Now

From the Field

Systems analysis, architecture deep dives, production postmortems, and applied research. Written from experience, not speculation.

The Hidden Cost of Switching Embedding Models
In Progress

The Hidden Cost of Switching Embedding Models

7 min read·Mar 2025·MLOps

We changed our embedding model mid-project. Here's what broke, what we didn't anticipate, and what re-indexing at scale actually costs.

Coming soon

12 published · 1 in progress

Ongoing

Signature Series

A 3-part production series. Read in order. Each part builds on the last.

PART 01

Signature Series

Ongoing

How It Actually Runs

Real systems, explained without the marketing diagram.

Every system that runs at scale has a gap between the architecture diagram and what's actually deployed. This series tears down real systems — payments infrastructure, data platforms, ML pipelines — and explains exactly how they work in production, constraints included.

System internalsArchitecture tradeoffsAfrican deployment contexts

In this series

01

How M-Pesa Handles 10 Million Transactions a Day

02

Inside a Real-Time Fraud Detection Pipeline

03

How Recommendation Engines Work at Low Data Volume

Starting soon

PART 02

Signature Series

Ongoing

If I Built This Today

Retrospective system design. What experience actually changes.

Looking back at systems I actually built in production. Not a highlight reel — the real decisions I made, why I made them given the constraints at the time, and what I would do completely differently with what I know now.

Decision archaeologyConstraint-driven tradeoffsHard-won redesigns

In this series

01

The Data Pipeline I Regret Building

02

Our First ML System: Every Wrong Assumption, Documented

03

Rebuilding a Feature Store After It Failed at Scale

Starting soon

PART 03

Signature Series

Ongoing

What Breaks in Production

Honest postmortems. No blame, no hand-waving.

Production failures documented in full: what the symptoms were, what we initially diagnosed, what was actually wrong, and what changed so it would not happen again. Written as engineering records, not incident theater.

Real failure modesRoot cause analysisWhat changed afterward

In this series

01

The Silent Data Loss We Found Three Months Late

02

When Our Model Passed All Tests and Failed in Production

03

The Latency Regression We Shipped to 100k Users

Starting soon

Start with Part 01 and read in sequence for the full picture.

Applied System Design

Research

Working papers on AI reliability, uncertainty quantification, and trustworthy systems. Built from production experience, not theoretical defaults.

Working Paper2024

A Reliability-First Framework Combining Adaptive AI with Hallucination Safeguards

Autonomous AI agents in enterprise create a specific tension: the system needs to adapt, but it also needs to produce outputs you can stake a business decision on. This framework combines Reinforcement Learning, RAG, and uncertainty quantification. The core contribution is the H-Score, a metric for measuring hallucination risk as a system property across IT, healthcare, and logistics.

H-Score

Hallucination risk metric for enterprise AI agents

CWAS

Confidence-Weighted Action Selection for reliable RL

REAC

Enterprise Agent Corpus for benchmarking

Reinforcement LearningRAGHallucination MitigationEnterprise AIAI Safety
Read Full Paper
Working Paper2025

CRISIS-LLM 2.0: Trustworthy AI for Disaster Response

Existing disaster AI fails on three measurable fronts: false alerts from unverified social data, no real-time multimodal verification, and black-box decisions responders cannot interrogate. CRISIS-LLM 2.0 addresses all three through Cascading Trust Propagation, fusing satellite imagery, drone feeds, IoT sensors, and LLMs into one uncertainty-aware pipeline targeting a 30% false alert reduction.

Cascading Trust

30% false alert reduction vs text-only baselines

Multimodal Fusion

Satellite, drone, IoT, and social data unified

Uncertainty UI

Confidence scoring for emergency operators

LLMsMultimodal AIDisaster ResponseUncertainty QuantificationAI Safety
Read Full Paper
Working Paper2025

Adaptive Multimodal AI for Real-Time Disaster Response Optimization

Kenya loses an estimated KES 300 billion annually to climate-related disasters. The AI tools used in response still rely on text-based social media analysis, static GIS maps, and delayed satellite feeds. This framework fuses IoT sensors, satellite imagery, and social media into a Hybrid Transformer and RL architecture that generates probabilistic risk maps and optimizes resource allocation in real time, validated against 2024 Kenyan flood and landslide events.

Hybrid Transformer-RL

Multimodal fusion feeding a Deep Q-Network for real-time resource optimization

30% Response Reduction

Target improvement validated against real Kenyan disaster events

Edge-First Design

Lightweight inference for field devices in low-connectivity East Africa

Multimodal AIReinforcement LearningDisaster ResponseEast AfricaEdge Computing
Read Full Paper
Working Paper2025

Dynamic Bias Mitigation in Social Media Recommender Systems

Algorithmic bias in recommender systems is not a bug you fix once. It drifts continuously as user behavior shifts, social events unfold, and platforms update their models. Static fairness audits catch yesterday's bias. This framework uses bias-aware Gated Recurrent Units to detect fairness drift in real time and online constrained optimization to recalibrate recommendation weights continuously, targeting 40% disparity reduction without full model retraining.

Temporal Bias GRU

Real-time drift detection across gender, language, and geographic protected attributes using rolling recommendation outcome sequences

Online Fairness Optimizer

Lagrangian recalibration via gradient descent maintaining equal opportunity difference below 0.1 continuously

Bias Drift Benchmark

First empirical dataset of fairness drift trajectories across election cycles, viral events, and algorithm updates

Algorithmic FairnessRecommender SystemsGRUOnline LearningAI Ethics
Read Full Paper
Working Paper2025

Explainable AI for Carbon Accountability in Decentralized Energy Systems across Sub-Saharan Africa

Sub-Saharan Africa contributes 3% of global emissions but bears the worst climate consequences. The continent's 3,000+ minigrids displace diesel with solar but have no carbon accounting layer. This framework combines LSTM, SHAP, and Multi-Objective RL to optimize battery health, energy dispatch, and carbon attribution simultaneously, with explainability designed for community cooperative operators in Swahili, not just data scientists.

LSTM-SHAP Battery AI

Hot-climate battery degradation prediction calibrated for East African PAYG solar conditions

MORL Carbon Optimizer

Multi-objective RL balancing cost and diesel displacement across P2P minigrid nodes

Community-Trusted XAI

SHAP explanations legible to cooperative operators and EPRA regulators, not only analysts

Explainable AIReinforcement LearningClean EnergyEast AfricaCarbon Accounting
Read Full Paper
Working Paper2025

Privacy-Preserving Generative AI for Secure Healthcare Data Sharing in Kenya

Kenya's health AI research is stuck in a standoff: researchers need patient records to train clinical models, but hospitals cannot legally share them under the Data Protection Act 2019. This framework combines Wasserstein GANs with differential privacy to generate synthetic EHRs that are statistically realistic, clinically useful for malaria and maternal health tasks, and provably private, designed for Kenya's 47-county devolved health infrastructure.

DP-GAN Framework

Wasserstein GAN with differential privacy for synthetic EHR generation under Kenya's DPA 2019

Privacy-Utility Balance

Jensen-Shannon divergence metrics preserving clinical signal for malaria and maternal health AI

County-Ready Deployment

Lightweight Docker container integrating with KHIS/DHIS2 across 47 county health systems

Generative AIDifferential PrivacyHealthcare AIEast AfricaData Governance
Read Full Paper

Experiments · Prototypes · Real Implementations

System Lab

Real systems built under real constraints. Each entry documents the problem, the design decisions, and what actually happened — including what failed. Not a portfolio. An engineering record.

QdrantBGE-smallFastAPIPython
SQLiteDuckDBAirflowPython
PythonSentence TransformersNLI ModelsPrometheus
FaustIsolation ForestRedisFastAPI

About the Author

Sam Odongo

Sam Odongo

Senior Data Engineer · Applied Data Scientist

Open to remote roles

I build AI and data systems where most architectures fail. Low bandwidth, unstable infrastructure, real users, real constraints.

Most technical content assumes stable power, cheap GPUs, and fast internet. I work in Nairobi, Kenya, where none of those are guaranteed. That gap between theory and production is where I live and where I write from.

I hold an MSc in Artificial Intelligence and have spent years building production data systems across fraud detection, real-time streaming, RAG pipelines, and MLOps infrastructure. Every system I document here has been tested against constraints most tutorials never mention.

If you are building systems that have to work in the real world, not just on a benchmark, this is where you should be.

Newsletter

Join engineers learning how to build AI systems that don't break in production.

Real systems. Real constraints. Written from Nairobi, tested against the infrastructure most tutorials never mention. No spam. No theory. Unsubscribe anytime.

Written by a practitioner, not a marketer.