Large Language Models and Information Retrieval in the Digital Environment: a theoretical analytical study

Authors

  • Mohammed Abdullah Saeed Al-Amri King Abdulaziz University Author

DOI:

https://doi.org/10.59992/IJCI.2025.v4n6p1

Keywords:

Large Language Models, Smart Language Models, Information Retrieval, Information Retrieval in the Digital Environment

Abstract

This study explores the role of Large Language Models (LLMs) in information retrieval within the digital environment through a theoretical analysis of their concepts, operational mechanisms, and a comparison with traditional methods, alongside identifying key challenges and contemporary applications. The findings reveal that LLMs represent a qualitative shift in processing natural language due to their ability to understand context and generate precise responses. The study highlights their superiority in enhancing retrieval systems through integration with cognitive technologies such as Retrieval-Augmented Generation (RAG) and Knowledge Graphs (KGs), thereby improving the reliability and effectiveness of results, especially in specialized domains. Despite these capabilities, the study identifies technical and methodological challenges, including hallucination and limited interpretability. It emphasizes that LLMs do not replace traditional retrieval methods but complement them, depending on task nature and user behavior. The study recommends developing hybrid models, enhancing multimodal capabilities, and expanding real-world evaluations—particularly in low-resource languages and specialized fields. It concludes that integrating LLMs with structured knowledge representations offers a promising path toward building more accurate, equitable, and intelligent information retrieval systems.

Author Biography

  • Mohammed Abdullah Saeed Al-Amri, King Abdulaziz University

    PhD Researcher, Knowledge Management, King Abdulaziz University, Saudi Arabia

References

المراجع العربية:

• إبراهيم، س. ر. س.، & زكريا، م. ش. (2023). الآثار المترتبة على: ChatGPT نحو تبني مولدات (المحادثات الاسترجاعية) (النماذج المستحدثة لمعالجة اللغة في البيئة الأكاديمية ومجال المكتبات). Arab Journal for Archives, Documentation & Information (AJADI), 54, 107–132. ارجع لهذه الدراسة مفيدة جداً في موضوعنا يمكن استنتاج منها المميزات والعيوب لاسترجاع المعلومات من خلال هذه التقنية.

• أبو غنيمة، عيد محمد عبدالعزيز، والزعليك، محمد السيد عبدالبر. (2024). استخدام نموذج مقترح لتدريس فيزياء المتحكمات الدقيقة قائم على النمذجة العلمية والنماذج اللغوية الكبيرة لتنمية التفكير الحاسوبي والمبادئ والتعميمات العلمية لطلاب الجامعة التكنولوجية. مجلة كلية التربية، مج21، ع122، 100 -178. مسترجع من http://search.mandumah.com.sdl.idm.oclc.org/Record/1512917

• أحمد، ع. ر. ع. ا.، & علا رمضان عبد الكريم. (2023). التطلعات المستقبلية لاستخدام نظم استرجاع المعلومات وأدواتها القائمة على الذكاء الاصطناعي في المكتبات: وفقًا للتحليل الرباعي SWOT. مجلة كلية الآداب بالوادي الجديد، 9(17)، 904-941.

• زقزوق، م. (2024). مقدمة في استخدام نماذج اللغة الكبيرة للمستخدمين غير التقنيين. مزن. mozn.ws/91484

• عوض، إيمان عبده حسن. (2024). تطبيقات هندسة أوامر النماذج اللغوية الكبيرة "LLMs" في التدريب على مهام هندسة البرمجيات: مراجعة منهجية. المجلة السعودية للعلوم التربوية، ع16، 89 -106. مسترجع من http://search.mandumah.com.sdl.idm.oclc.org/Record/1532095

• قناوي، يارة ماهر محمد. (2024). استخدام تقنية ChatGpt كأداة ذكية لتحليل البيانات في المكتبات دراسة استكشافية. Egyptian Journal of Information Sciences, 11(1), 505–540.

• AWS. (n.d.). What is a Large Language Model (LLM)? Retrieved April 29, 2025, from https://aws.amazon.com/what-is/large-language-model/

• Harvard Business Review. (n.d.) النماذج اللغوية الكبيرة. Retrieved April 29, 2025, https://hbrarabic.com/

• Holdsworth, J., & Kosinski, M. (2024). ما المقصود بقاعدة البيانات الموجهة؟ IBM Think. https://www.ibm.com/sa-ar/think/topics/vector-database

• IBM Think. (2023). ما عمليات النماذج اللغوية الكبيرة (LLMOps)? https://www.ibm.com/sa-ar/think/topics/llmops

المراجع الأجنبية:

• Bender, E. M., Gebru, T., McMillan-Major, A., & Shmitchell, S. (2021). On the Dangers of Stochastic Parrots: Can Language Models Be Too Big? In Conference on Fairness, Accountability, and Transparency (FAccT '21), March 3-10, 2021, Virtual Event, Canada. ACM, New York, NY, USA, 14 pages. https://doi.org/10.1145/3442188.3445922.

• Foote, K. D. (2023). A brief history of large language models. DATAVERSITY. https://www.dataversity.net/a-brief-history-of-large-language-models/.

• Gemini Team. (2024). Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context. arXiv preprint arXiv:2403.05530v5.

• Guinness, H. (2025). The best large language models (LLMs) in 2025. Zapier. https://zapier.com/blog/best-llm/.

• Haziqa S. (2024, August 10). What is information retrieval? Zilliz Learn. https://zilliz.com/learn/what-is-information-retrieval (Accessed February 16, 2025).

https://medium.com/@daniele.nanni/revolutionizing-information-retrieval-the-role-of large-language-models-in-a-post-search-engine-7dd370bdb62.

• Lewis, P., Perez, E., Piktus, A., Petroni, F., Karpukhin, V., Goyal, N., Küttler, H., Lewis, M., Yih, W-t., Rocktäschel, T., Riedel, S., & Kiela, D. (2021). Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks. arXiv preprint arXiv:2005.11401v4.

• McDonough, M. (2025, April 29). Large language model. [Retrieved from https://www.britannica.com/topic/large-language-model.

• Muhammad, A. (2025, January 7). Best Large Language Models for 2025 and How to Choose the Right One for Your Site. Retrieved from https://www.hostinger.com/tutorials/large-language-models.

• Murel, J., & Syed, M. (2024). What is information retrieval? IBM. Retrieved from https://www.ibm.com/think/topics/information-retrieval.

• Nanni, D. (2023, May 17). Revolutionizing Information Retrieval: The Role of Large Language Models in a Post-Search Engine Era. Medium.

• OpenAI. (2024). GPT-4 Technical Report. https://doi.org/10.48550/arXiv.2303.08774.

• Pakhale, Kalyani. "Large Language Models and Information Retrieval." International Journal for Multidisciplinary Research (IJFMR), vol. 5, no. 6, 2023, pp. 1-12.

• Pan, J. Z., Razniewski, S., Kalo, J.-C., Singhania, S., Chen, J., Dietze, S., Jabeen, H., Omeliyanenko, J., Zhang, W., Lissandrini, M., Biswas, R., de Melo, G., Bonifati, A., Vakaj, E., Saleem, J., & Lehmann, J. (2023). Large Language Models and Knowledge Graphs: Opportunities and Challenges. arXiv preprint arXiv:2307.04022.

• Tyson, M. (2024, May 15). What is information retrieval? Coveo. Retrieved from https://www.coveo.com/blog/information-retrieval/.

• Urista, T. (2024, November 9). Enhancing Information Retrieval with Large Language Models: Techniques and Best Practices. Medium. https://www.google.com/search?q=https://timothy-urista.medium.com/enhancing-information-retrieval-with-large-language-models-techniques-and-best-practices-3f414a37e27f.

• Zhu, Y., Yuan, H., Wang, S., Liu, J., Liu, W., Deng, C., Chen, H., Liu, Z., Dou, Z., & Wen, J.-R. (2024). Large Language Models for Information Retrieval: A Survey.

Downloads

Published

2025-06-15

Issue

Section

Articles

How to Cite

Mohammed Abdullah Saeed Al-Amri. (2025). Large Language Models and Information Retrieval in the Digital Environment: a theoretical analytical study. International Journal of Computers and Informatics, 4(6). https://doi.org/10.59992/IJCI.2025.v4n6p1