BLOG / ENGINEERING

Engineering

Technical deep dives into dependency graphs, semantic search, and architecture

All Comparisons Guides MCP & AI Agents Engineering Use Cases Concepts Industry AI Coding

Does Code Health Predict Bugs? 21 Repos, 9 Languages, ROC AUC 0.74

A reproducible defect-prediction study: 21 repos, 9 languages, 2,770 labeled files, ROC AUC 0.74, and 2.3x more defects caught under a fixed review budget.

2026-06-26Read →

engineering11 min read

Is AI-written code buggier than human code? We blamed 112,000 commits to find out

We git-blamed 112,382 commits across 28 repos to test whether AI-agent code introduces more bugs than human code. After controlling for size, it doesn't, and its lines last longer.

2026-06-09Read →

engineering14 min read

Does our code-health score actually predict bugs? A leakage-free benchmark

I scored 21 repos six months before their bugs landed to test whether a deterministic code-health score predicts defects. AUC 0.737, and the honest caveats.

2026-06-03Read →

engineering12 min read

Process metrics beat structural metrics for predicting defects

Complexity and code smells are the metrics everyone reaches for. Across 25 markers and 21 repos, the strongest defect predictors were evolutionary, not structural. The numbers, with file size controlled.

2026-06-03Read →

engineering15 min read

How a 25-Biomarker Code Health Scorer Actually Works

A code health scorer works because one number is only useful if it is built from signals that map to real maintenance cost. A file can look clean and still…

2026-05-20Read →

engineering13 min read

Building a Deterministic PR Review Bot With Zero LLM Calls

A deterministic PR review bot can do useful review work without calling an LLM once. That sounds odd until you break “review” into smaller jobs: parse the…

2026-05-20Read →

engineering9 min read

Why vector-only retrieval misses co-change clusters in monorepos

Co-change analysis git shows which files move together, who owns them, and what breaks next. See why vector-only retrieval misses refactor signals.

2026-05-19Read →

engineering10 min read

Our git co-change model failed on a monorepo refactor

Why co-change analysis git overfit a monorepo refactor, and the guardrails we added after 500 commits of history pointed at the wrong coupling.

2026-05-17Read →

engineering11 min read

The dead-code detector we shipped with Pagerank and git history

Code hotspot detection as ranking, not a linter: Pagerank plus git history sorted quiet files from risky ones, so you can ignore the right 3.

2026-05-08Read →

engineering10 min read

Architectural Decision Records: Capture Why Your Code Is the Way It Is

Every seasoned engineer has experienced the 'archaeology phase' of a project. You’re looking at a specific abstraction—perhaps a custom implementation of a m...

2026-04-01Read →

engineering10 min read

Building a Dependency Graph for Any Codebase (Python, TS, Go, Rust, Java)

Every software engineer has experienced the 'wall of code' phenomenon. You join a new project, clone the repository, and find yourself staring at 100,000 lin...

2026-04-01Read →

engineering9 min read

How We Built repowise: Architecture of a Codebase Intelligence Platform

Most software engineers spend upwards of 70% of their time reading code rather than writing it. Yet, the tools we use to navigate these complex systems have ...

2026-04-01Read →

engineering9 min read

Self-Hosting a Codebase Intelligence Platform with Docker

As engineering teams scale, the primary bottleneck isn't usually writing code—it's understanding it. Between technical debt, architectural drift, and the 'bu...

2026-04-01Read →

engineering10 min read

Semantic Search Over Your Codebase: LanceDB + pgvector in Practice

Every developer has experienced the 'grep fatigue.' You’re navigating a 100k+ LOC codebase, trying to find where a specific business logic—say, the grace per...

2026-04-01Read →