context-compression

Star

Here are 15 public repositories matching this topic...

jeffreysijuntan / lloco

Star

The official repo for "LLoCo: Learning Long Contexts Offline"

pytorch finetune llm long-context context-compression

Updated Jun 15, 2024
Python

snu-mllab / Context-Memory

Star

Pytorch implementation for "Compressed Context Memory For Online Language Model Interaction" (ICLR'24)

efficient-llm-inference context-compression kv-cache-compression

Updated Apr 18, 2024
Python

Stop re-explaining your codebase to AI. Infinite speed memory + code graph for Claude Code & Codex CLI. 17 MCP tools, subagent protocol, hybrid search, TUI dashboard, crash recovery. Save 80-200K+ tokens/session.

python mcp developer-tools persistent-memory claude code-intelligence vector-search code-graph subagent anthropic ai-memory context-compression mcp-server vibecoding claude-code codex-cli

Updated Mar 3, 2026
Python

umitkacar / llm-context-optimizer

Star

Biological code organization system with 1,029+ production-ready snippets - 95% token reduction for Claude/GPT with AI-powered discovery & offline packs

Updated Nov 10, 2025
Python

bailynlove / Awesome-OCR-Vision-Based-Context-Compression

Star

Awesome list of paper on vision-based context compression

awesome-list code-generation large-language-models context-compression

Updated Feb 15, 2026

13051171521l-cmyk / openclaw-lowmem-optimization

Star

OpenClaw low-memory optimization guide for resource-constrained servers (2GB RAM)

low-memory memory-optimization ai-assistant context-compression openclaw

Updated Mar 3, 2026
JavaScript

joy7758 / token-governor

Star

天将｜LLM 智能体运行时资源控制引擎：Token 预算控制、成本优化、稳定性 Guard、多工具 Agent 调度

Updated Mar 3, 2026
Python

AntonioSabbatellaUni / nlp_llm_context_cost_optimization

Star

Exploring Context Compression techniques for token reduction. Fine-tuning LLMs for efficient text compression and reduced inference costs, analyzing the trade-offs with Q&A accuracy.

transformers finetuning efficient-nlp token-reduction llm-optimization context-compression