Skip to Main content Skip to Navigation

Alchemy and computer : a computational analysis of the Jabirian corpus

Abstract : This work presents a novel approach to the study of the Jābirian corpus while taking into consideration the existent works and literature and the problems of this peculiar corpus (synonymy, polysemy, dispersion of the knowledge, quotes od other authors, hypertextuality). Thanks to the modern technologies of computational analysis, this thesis aims to the digitalization of edited texts (Muḫtār Rasāʾil, Tabdīr al-iksīr al-aʿẓam, Kitāb al- ahjār) in order to create a digitalized corpus tagged following the Text Encoding Initiative (TEI), the most used annotation in Natural Language Processing (NLP). Section I is an introduction on the historical setting and remarks of the subject of the texts studied, comprising also an excursus on the figure of Jābir Ibn Hayyān and the querelle on his existence; and an explanation of the methodological setting in which this work is settled. Section II is the operational part, where are shown the compromises used in the realization of the digitalized corpus, as well as the strategies used so as to render the various issues presented in Section I. Section II presents the set of choices that tried to aswer to the questions made in Section I. The core of the work is represented by the Appendices, divided in four parts: Appendix A, B and C are extracts of the digitalized corpus, it was decided to include the first section of all the three source books, in order to represent every detail of the digitalization strategies and processes. Appendix D comprise a sample of concordances based on the lemmatization of the edition of the first two books of the Tadbīr. Appendix E is the frequency list of the same sample used for the concordances.
Document type :
Complete list of metadatas

Cited literature [22 references]  Display  Hide  Download
Contributor : Abes Star :  Contact
Submitted on : Friday, October 18, 2019 - 1:05:07 AM
Last modification on : Friday, September 18, 2020 - 2:34:45 PM
Long-term archiving on: : Sunday, January 19, 2020 - 12:44:18 PM


Version validated by the jury (STAR)


  • HAL Id : tel-02319443, version 1


Ilaria Cicola. Alchemy and computer : a computational analysis of the Jabirian corpus. Linguistics. École pratique des hautes études - EPHE PARIS; Università degli studi La Sapienza (Rome), 2016. English. ⟨NNT : 2016EPHE5056⟩. ⟨tel-02319443⟩



Record views


Files downloads