Fang, J. (2026). Latency-Bounded Embedding Table Partitioning Across Heterogeneous Accelerators for Large-Scale Recommendation Serving. Journal of Computing and Electronic Information Management, 21(3), 8-14. https://doi.org/10.54097/jd5aet13