CRAN_Status_Badge DOI

An R package for the full text of the Qur’an

This package contains the complete text of the Qur’an in Arabic (with and without vowels) and in English (the Yusuf Ali and Saheeh International translations).

The text for each volume or translation was sourced from Tanzil and Al Quran Cloud.

Each volume is in a data frame with a row for each ayah, along with 18 columns of detailed metadata (including ayah number, surah number, surah name in Arabic and English, juz’, hizb, and so on), ready for text analysis.

The package contains:

Surah titles in Arabic, transliterated English, and translated English are all factors to preserve the correct order.


This package is not released on CRAN yet. You can install the development version from Github:



For some ideas on getting started with analyzing these texts, see Text Mining with R by Julia Silge and David Robinson, or Julia Silge’s blog post on sentiment analysis of Jane Austen’s novels. For help within R, try ?quran_ar or similar for getting started with the datasets.

Here’s how to do some basic data manipulation with the data:


quran_ar %>% 
  group_by(surah_title_en) %>% 
  summarize(total_ayat = n())
#> # A tibble: 114 x 2
#>    surah_title_en total_ayat
#>    <fct>               <int>
#>  1 Al-Faatiha              7
#>  2 Al-Baqara             286
#>  3 Aal-i-Imraan          200
#>  4 An-Nisaa              176
#>  5 Al-Maaida             120
#>  6 Al-An'aam             165
#>  7 Al-A'raaf             206
#>  8 Al-Anfaal              75
#>  9 At-Tawba              129
#> 10 Yunus                 109
#> # ... with 104 more rows

quran_ar %>% 
  filter(surah_title_en == "Al-Faatiha") %>% 
  select(surah, ayah, text)
#> # A tibble: 7 x 3
#>   surah  ayah text                                                
#>   <int> <int> <chr>                                               
#> 1     1     1 بِسْمِ اللَّهِ الرَّحْمَٰنِ الرَّحِيمِ                              
#> 2     1     2 الْحَمْدُ لِلَّهِ رَبِّ الْعَالَمِينَ                               
#> 3     1     3 الرَّحْمَٰنِ الرَّحِيمِ                                       
#> 4     1     4 مَالِكِ يَوْمِ الدِّينِ                                      
#> 5     1     5 إِيَّاكَ نَعْبُدُ وَإِيَّاكَ نَسْتَعِينُ                              
#> 6     1     6 اهْدِنَا الصِّرَاطَ الْمُسْتَقِيمَ                               
#> 7     1     7 صِرَاطَ الَّذِينَ أَنْعَمْتَ عَلَيْهِمْ غَيْرِ الْمَغْضُوبِ عَلَيْهِمْ وَلَا الضَّالِّينَ

Code of Conduct

This project is released with a Contributor Code of Conduct. By participating in this project you agree to abide by its terms.


The text and translations are all open source or licensed under Creative Commons by Tanzil and Al Quran Cloud. The licenses have been converted to MIT here, due to CRAN’s policy that the entire package use the same license throughout.

Tanzil Quran License

Text taken from AlQuran.cloud and the Islamic Network is open source


Saheeh International Translation provided by the Qur’an Project with no rights reserved

See https://archive.org/stream/QuranTranslationBySaheehInternational/Quran%20-%20Translation%20by%20Saheeh%20International_djvu.txt

Fatihah background in hex icon from seeklogo.