1.
Fang J. Latency-Bounded Embedding Table Partitioning Across Heterogeneous Accelerators for Large-Scale Recommendation Serving. JCEIM [Internet]. 2026 Jun. 29 [cited 2026 Jun. 29];21(3):8-14. Available from: https://jceim.org/index.php/ojs/article/view/216