Fatescript

me.jpeg

I currently work at Moonshot AI. My interests center on large language models, training and inference systems, and the infrastructure that turns models into reliable products.

Before Moonshot, I worked at Megvii on computer vision libraries and deep learning infrastructure. I was one of the contributors to YOLOX, an anchor-free object detector, and helped redesign the library before its public release. I also maintained cvpods, and worked on MegEngine-based classification and detection frameworks such as basecls and basedet.

This blog is mostly where I write down engineering details, model-system observations, and long-term reflections. I care about explanations grounded in practice: code paths, numerical behavior, distributed training details, and the small decisions that decide whether a system actually works.

Please feel free to reach out if you want to discuss deep learning systems, LLM infrastructure, computer vision engineering, or anything you find interesting here.

selected publications

  1. Kimi K2.5: Visual Agentic Intelligence
    Kimi Team, and Feng Wang
    arXiv preprint arXiv:2602.02276, 2026
  2. Kimi K2: Open Agentic Intelligence
    Kimi Team, and Feng Wang
    arXiv preprint arXiv:2507.20534, 2026
  3. Baichuan 2: Open Large-scale Language Models
    Aiyuan Yang, Bin Xiao, Bingning Wang, and 52 more authors
    arXiv preprint arXiv:2309.10305, 2023
  4. YOLOX: Exceeding YOLO Series in 2021
    Zheng Ge, Songtao Liu, Feng Wang, and 2 more authors
    arXiv preprint arXiv:2107.08430, 2021
  5. cvpods: All-in-one Toolbox for Computer Vision Research
    Benjin Zhu*, Feng Wang*, Jianfeng Wang, and 3 more authors
    2020