(1)
Fang, J. Latency-Bounded Embedding Table Partitioning Across Heterogeneous Accelerators for Large-Scale Recommendation Serving. JCEIM 2026, 21 (3), 8-14. https://doi.org/10.54097/jd5aet13.