Fang, Jingyi. “Latency-Bounded Embedding Table Partitioning Across Heterogeneous Accelerators for Large-Scale Recommendation Serving”. Journal of Computing and Electronic Information Management, vol. 21, no. 3, June 2026, pp. 8-14, https://doi.org/10.54097/jd5aet13.