Please use this identifier to cite or link to this item: https://dspace.univ-ghardaia.edu.dz/xmlui/handle/123456789/9845
Title: RAG-based Question-Answering System for Algerian Tax law Context
Authors: Hamani, Wissal
Benyounes, Zineb
Bellaouar, Slimane Supervisor
Keywords: Legal QA, semantic retrieval, Retrieval-Augmented Genera- tion, Embeddings, Arabic NLP, Algerian tax law.
Réponse aux questions juridiques, Recherche Sémantique, Génération Augmentée par Recherche, Représentations Vectorielles, Traite- ment Automatique du Langage Naturel en Arabe, Droit Fiscal Algérien.
Issue Date: 2025
Publisher: université Ghardaia
Abstract: While large language models perform well in answering general questions, their deployment in specialized domains such as law faces several challenges, including generating inaccurate answers or responses unsupported by legal texts, and difficulty handling complex questions due to the lack of high-quality specialized data. These challenges are even more pronounced in the Algerian legal context, where Arabic legal texts are often limited and poorly digitized. This thesis aims to develop a legal question-answering system in Arabic based on Algerian tax law by combining dense semantic retrieval with a generative language model. The work includes several phases: collecting legal texts from a reliable source, preprocessing them, segmenting them into legal articles, representing them using models adapted to the Arabic language such as AraBERT and E5, and archiving them using FAISS to facilitate retrieval. Then, a generative model is used to formulate the answer based on the retrieved article. The system was implemented using Python in the Google Colab environment and was evaluated based on retrieval quality and answer accuracy. The experimental results demonstrated that the semantic retrieval approach using the E5 model achieved a recall of 91%, significantly outperforming keyword- based methods such as BM25. Furthermore, the integration of the retrieved content with a fine-tuned generative model led to more legally grounded and fluent answers, especially in handling multi-layered questions. These findings highlight the effec- tiveness of combining semantic search with generative modeling in addressing the unique challenges of Arabic legal question answering in the Algerian tax context.
Description: Specialty: Intelligent Systems for Knowledge Extraction
URI: https://dspace.univ-ghardaia.edu.dz/xmlui/handle/123456789/9845
Appears in Collections:Mémoires de Master

Files in This Item:
File Description SizeFormat 
RAG_based_QA_for_Algerian_Legal_Context__Ghardaia_ - BENYOUNES ZINEB.pdf1.4 MBAdobe PDFView/Open


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.