Beyond Test-Time Training: Learning to Reason via Hardware-Efficient Optimal Control

This repository contains the core implementation for paper Beyond Test-Time Training: Learning to Reason via Hardware-Efficient Optimal Control.

Peihao Wang, Shan Yang, Xijun Wang, Tesi Xiao, Xin Liu, Changlong Yu, Yu Lou, Pan Li, Atlas Wang, Ming Lin, Rene Vidal

Overview

TTC (Test-Time Control) is an LLM layer that takes initial memory states as inputs and outputs the solution to a receding-horizon optimal control problem with hardware-efficient LQR solvers.

The codebase provides:

TTCLayer for plugging TTC into neural architectures.
Multiple LQR solver backends:
- riccati (direct PyTorch)
- kkt (dual/KKT PyTorch)
- fused (Triton fused kernel)

Environment

The code in ttc/ depends on:

torch==2.8.0+cu128
triton==3.5.1
fla==0.3.1

Usage

You can use TTC directly from the package:

import torch
from ttc import TTCLayer

x = torch.randn(2, 128, 8, 32, device="cuda", dtype=torch.float32)  # [batch, seq, num_heads, in_dim]

ttc = TTCLayer(
    in_dim=32,
    out_dim=32,
    h_dim=16,
    num_heads=8,
    b_rank=16,
    q_rank=16,
    solver_impl="fused",
).cuda()

y = ttc(x, T=64)

Citation

If you find this repository useful, please cite:

@article{wang2026beyond,
  title   = {Beyond Test-Time Training: Learning to Reason via Hardware-Efficient Optimal Control},
  author  = {Wang, Peihao and Yang, Shan and Wang, Xijun and Xiao, Tesi and Liu, Xin and Yu, Changlong and Lou, Yu and Li, Pan and Wang, Atlas and Lin, Ming and Vidal, Rene},
  journal = {arXiv preprint arXiv:2603.09221},
  year    = {2026}
}

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
docs		docs
ttc		ttc
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Beyond Test-Time Training: Learning to Reason via Hardware-Efficient Optimal Control

Overview

Environment

Usage

Citation

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Beyond Test-Time Training: Learning to Reason via Hardware-Efficient Optimal Control

Overview

Environment

Usage

Citation

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages