[1]
J. Fang, “Latency-Bounded Embedding Table Partitioning Across Heterogeneous Accelerators for Large-Scale Recommendation Serving”, JCEIM, vol. 21, no. 3, pp. 8–14, Jun. 2026, doi: 10.54097/jd5aet13.