Franz Schiller (2025) “OPTIMIZING ATTENTION AND INFERENCE IN LARGE LANGUAGE MODELS: BALANCING EFFICIENCY, INTERPRETABILITY, AND ENERGY CONSUMPTION”, International Multidisciplinary Journal for Research & Development, 12(11), pp. 582–588. Available at: https://www.ijmrd.in/index.php/imjrd/article/view/4103 (Accessed: 6 March 2026).