Home
About
Projects
Ayman's Corner
Ayman's Corner
Loading knowledge graph...
Posts
Post-Training GPT-OSS-20B for ML Research Coding
Post-training • ML Systems • Research Process
May 18, 2026
A concise technical report on evaluator repair, SFT failure analysis, and early post-training results for GPT-OSS-20B on ML research coding tasks.
6 min read
PolyPPO for Bidirectional Decoding: Early Evidence From PushT
Reinforcement Learning • Robotics • Research Process
May 17, 2026
Early PushT evidence for PPO over VQ-BeT RVQ code IDs, including negative pass@1 results and a stronger same-start pass@k signal.
5 min read
Risk-Aware Supervision for OpenPI Robot Policies on LIBERO
Robotics • Risk Supervision • VLM Calibration
April 29, 2026
A technical summary of a project that uses calibrated VLM-based risk estimates to decide when a robot foundation policy should execute or abstain.
7 min read