Serkan Altuntaş

Software Architect in Istanbul, with a computational biology background. Learning LLMs in public.

Latest in Journal

GCTX passes its first data-readiness gate

Jun 28, 2026 · note · GCTX now has enough reviewed DEV and locked REPORT data to start the first 60M-100M proof language-model run.

Starting GCTX: a small model for Git diffs

Jun 21, 2026 · note · A starting note for GCTX, a from-scratch small language model project for reading Git diffs and writing Conventional Commit messages.

chess-zero: the platform's done, the intelligence isn't

Jun 16, 2026 · note · A month in, chess-zero has a complete, verified chess platform and no learned intelligence yet — the hand-written ML pipeline is what comes next.

A Research-Use Preview of the Gene-Disease Evidence Index

Jun 14, 2026 · note · A short note announcing the public preview of the Gene-Disease Evidence Index.

Nexus and the Shape of Information

Jun 7, 2026 · note · A note on Yuval Noah Harari's Nexus and how information networks shape trust, power, and AI.

Starting chess-zero

May 19, 2026 · note · Starting a hand-built chess engine and AlphaZero-style RL pipeline as a learning project for the ML side.

Featured Projects
AI Usage Standard (AIUS) Alpha

Deterministic AI-involvement disclosure standard for written artifacts. Six dimensions × four levels → five-tier glance-readable badge, with strict frontmatter validation. Live across this site; mirror repo for adopters.

Fresh 54% Next: Phase 2 — npm package (@serkanaltuntas/ai-usage-badge)
GCTX — a language model for Git diffs Building

A from-scratch small language model family for understanding Git diffs and writing Conventional Commit messages, with gitctx as the CLI and product shell around the model.

Fresh 58% Next: Train the first specialized commit-message language model
Gene-Disease Evidence Index Building

A research-use, versioned index of source-attributed gene-disease evidence with a twelve-disease 120-association public preview aggregate.

Fresh 91% Next: Complete final public access evidence, existing-resource comparison, source freshness/freeze checks, and DOI/no-DOI archive decision
About

Fifteen years of software, with a computational biology and bioinformatics background. Currently learning LLMs in public — distillation, fine-tuning, RAG, training internals.

Full bio →