Integration of an Arabic Lemmatizer as Part of an Information Retrieval System
Abstract
Natural Language Processing (NLP) is a field of Artificial Intelligence that encompasses several topics: Machine translation, Text summarization, spell checkers, information retrieval, etc. The objective of information retrieval (IR) is to provide a user with easy access to the information of interest, this information being located in a mass of textual documents. To achieve this objective, it must represent, store and organize information, then provide the user with the elements corresponding to the information need expressed by his request. Most of information retrieval systems (IRS) use simple terms to index and retrieve documents based on models such as: Boolean model, vector model, probabilistic model, language model, etc. The aim of our work is to develop a hybrid Arabic system based on four (04) models where the originality of the work lies in incorporating a stemmer in the search process to improve the results of our system.