repowiserepowise
Sign in

Repository

amithathreya/inference_layer
OverviewDocsArchitectureKnowledge GraphFilesCode HealthRefactoring

People & History

CommitsChange CouplingContributorsDecisions
ChatPro
By the Numbers
repowiserepowise
ExplorePricingDocs
Sign inIndex repoIndex your repo free
repowiseamithathreya/inference_layer

inference_layer

github.com/amithathreya/inference_layer

8c88c66mastersynced 12h ago

Files

12

Symbols

51

Doc Coverage

0%

Dead Exports

0

Entry Points

1

Code HealthView report

Average health
8.9/10
Excellent
Hotspot health
8.2/10
Excellent
Open findings
8findings
high1medium4low3
Worst fileKVCacheManager/kvcache.py8.2/10
Maintainability9.7/10ExcellentPerformance0All clear

Agent savingsView costs →

Trim what your agent reads with repowise distill and the MCP tools — savings show up here.

Open the Costs page →

Attention Needed20

1. Initialize the LLaMA model Note: llama-cpp-python requires GGUF formatted model files, not raw HFNeeds Review

Auto-proposed decision awaiting review

Use smaller quantization if llama-server OOMsNeeds Review

Auto-proposed decision awaiting review

Recommend Q4_K_M quantization for CPU-only setupsNeeds Review

Auto-proposed decision awaiting review

Add authentication and rate limitingNeeds Review

Auto-proposed decision awaiting review

Add SSE streaming for UINeeds Review

Auto-proposed decision awaiting review

No commit history

Commit activity appears once git history is indexed.

Recent DecisionsView all

1. Initialize the LLaMA model Note: llama-cpp-python requires GGUF formatted model files, not raw HFproposed
· code comment
Use smaller quantization if llama-server OOMsproposed
· readme mining
Recommend Q4_K_M quantization for CPU-only setupsproposed
· readme mining
Add authentication and rate limitingproposed
· readme mining
Add SSE streaming for UIproposed
· readme mining
Implement KV cache management via PagedAttentionproposed
· readme mining

Top Hotspots1 of 1View all

KVCacheManager/kvcache.py

4c/90d91%

Commit Activity

Feature Fix Refactor Dependency