[1]
Fang, J. 2026. Latency-Bounded Embedding Table Partitioning Across Heterogeneous Accelerators for Large-Scale Recommendation Serving. Journal of Computing and Electronic Information Management. 21, 3 (Jun. 2026), 8–14. DOI:https://doi.org/10.54097/jd5aet13.