Franz Schiller. “OPTIMIZING ATTENTION AND INFERENCE IN LARGE LANGUAGE MODELS: BALANCING EFFICIENCY, INTERPRETABILITY, AND ENERGY CONSUMPTION”. International Multidisciplinary Journal for Research & Development 12, no. 11 (November 27, 2025): 582–588. Accessed March 6, 2026. https://www.ijmrd.in/index.php/imjrd/article/view/4103.