Indian Scripts Index for Herbs and Spices
अ आ इ ई उ ऊ ऋ ऎ ए ऐ ऒ ओ औ क ख ग घ च छ ज ज़ ट ठ ड ढ त द ध न प फ ब भ म य र ऱ ल व श ष स ह
A Ā I Ī U Ū R̥ Ĕ E Ai Ŏ O Au K Kh G Gh C Ch J Z Ṭ Ṭh Ḍ Ḍh T D Dh N P Ph B Bh M Y R Ṟ L V Ś Ṣ S H
| Unicode Name | Trans- literation | Deva- nagari | Gur- mukhi | Guja- rati | Ben- gali | Oriya | Te- lugu | Kann- ada | Tamil   | Malayalam | Sin- hala |
| Vowels: Independent vowel letters (Change) | |||||||||||
| A | a | अक | ਅਕ | અક | অক | ଅକ | అక | ಅಕ | அக | അക | අක |
| AA | ā | आका | ਆਕਾ | આકા | আকা | ଆକା | ఆకా | ಆಕಾ | ஆகா | ആകാ | ආකා |
| I | i | इकि | ਇਕਿ | ઇકિ | ইকি | ଇକି | ఇకి | ಇಕಿ | இகி | ഇകി | ඉකි |
| II | ī | ईकी | ਈਕੀ | ઈકી | ঈকী | ଈକୀ | ఈకీ | ಈಕೀ | ஈகீ | ഈകീ | ඊකී |
| U | u | उकु | ਉਕੁ | ઉકુ | উকু | ଉକୁ | ఉకు | ಉಕು | உகு | ഉകു | උකු |
| UU | ū | ऊकू | ਊਕੂ | ઊકૂ | ঊকূ | ଊକୂ | ఊకూ | ಊಕೂ | ஊகூ | ഊകൂ | ඌකූ |
| R | r̥ | ऋकृ | ઋકૃ | ঋকৃ | ଋକୃ | ఋకృ | ಋಕೃ | ഋകൃ | ඍකෘ | ||
| RR | r̥̄ | ॠकॄ | ૠકૄ | ৠকৄ | ୠକୄ | ౠకౄ | ೠಕೄ | ൠകൄ | ඎකෲ | ||
| L | l̥ | ऌकॢ | ઌકૢ | ঌকৢ | ଌକୢ | ఌకౢ | ಌಕೢ | ഌകൢ | ඏකෟ | ||
| LL | l̥̄ | ॡकॣ | ૡકૣ | ৡকৣ | ୡକୣ | ౡకౣ | ೡಕೣ | ൡകൣ | ඐකෳ | ||
| CANDRA E | ê | ऍकॅ | ઍકૅ | ||||||||
| SHORT E | ĕ | ऎकॆ | ఎకె | ಎಕೆ | எகெ | എകെ | එකෙ | ||||
| E | e | एके | ਏਕੇ | એકે | একে | ଏକେ | ఏకే | ಏಕೇ | ஏகே | ഏകേ | ඒකේ |
| AI | ai | ऐकै | ਐਕੈ | ઐકૈ | ঐকৈ | ଐକୈ | ఐకై | ಐಕೈ | ஐகை | ഐകൈ | ඓකෛ |
| CANDRA O | ô | ऑकॉ | ઑકૉ | ||||||||
| SHORT O | ŏ | ऒकॊ | ఒకొ | ಒಕೊ | ஒகொ | ഒകൊ | ඔකො | ||||
| O | o | ओको | ਓਕੋ | ઓકો | ওকো | ଓକୋ | ఓకో | ಓಕೋ | ஓகோ | ഓകോ | ඕකෝ |
| AU | au | औकौ | ਔਕੌ | ઔકૌ | ঔকৌ | ଔକୌ | ఔకౌ | ಔಕೌ | ஔகௌ | ഔകൌ | ඖකෞ |
| AE | æ | ඇකැ | |||||||||
| AAE | ǣ | ඈකෑ | |||||||||
| Consonants: | |||||||||||
| KA | k | क | ਕ | ક | ক | କ | క | ಕ | க | ക | ක |
| KHA | kʰ | ख | ਖ | ખ | খ | ଖ | ఖ | ಖ | ഖ | ඛ | |
| GA | g | ग | ਗ | ગ | গ | ଗ | గ | ಗ | ഗ | ග | |
| GHA | gʰ | घ | ਘ | ઘ | ঘ | ଘ | ఘ | ಘ | ഘ | ඝ | |
| NGA | ṅ | ङ | ਙ | ઙ | ঙ | ଙ | ఙ | ಙ | ங | ങ | ඞ |
| CA | c | च | ਚ | ચ | চ | ଚ | చ | ಚ | ச | ച | ච |
| CHA | cʰ | छ | ਛ | છ | ছ | ଛ | ఛ | ಛ | ഛ | ඡ | |
| JA | j | ज | ਜ | જ | জ | ଜ | జ | ಜ | ஜ | ജ | ජ |
| JHA | jʰ | झ | ਝ | ઝ | ঝ | ଝ | ఝ | ಝ | ഝ | ඣ | |
| NYA | ñ | ञ | ਞ | ઞ | ঞ | ଞ | ఞ | ಞ | ஞ | ഞ | ඤ |
| TTA | ṭ | ट | ਟ | ટ | ট | ଟ | ట | ಟ | ட | ട | ට |
| TTHA | ṭʰ | ठ | ਠ | ઠ | ঠ | ଠ | ఠ | ಠ | ഠ | ඨ | |
| DDA | ḍ | ड | ਡ | ડ | ড | ଡ | డ | ಡ | ഡ | ඩ | |
| DDHA | ḍʰ | ढ | ਢ | ઢ | ঢ | ଢ | ఢ | ಢ | ഢ | ඪ | |
| NNA | ṇ | ण | ਣ | ણ | ণ | ଣ | ణ | ಣ | ண | ണ | ණ |
| TA | t | त | ਤ | ત | ত | ତ | త | ತ | த | ത | ත |
| THA | tʰ | थ | ਥ | થ | থ | ଥ | థ | ಥ | ഥ | ථ | |
| DA | d | द | ਦ | દ | দ | ଦ | ద | ದ | ദ | ද | |
| DHA | dʰ | ध | ਧ | ધ | ধ | ଧ | ధ | ಧ | ധ | ධ | |
| NA | n | न | ਨ | ન | ন | ନ | న | ನ | ந | ന | න |
| NNNA | ṉ | ऩ | ன | ||||||||
| PA | p | प | ਪ | પ | প | ପ | ప | ಪ | ப | പ | ප |
| PHA | pʰ | फ | ਫ | ફ | ফ | ଫ | ఫ | ಫ | ഫ | ඵ | |
| BA | b | ब | ਬ | બ | ব | ବ | బ | ಬ | ബ | බ | |
| BHA | bʰ | भ | ਭ | ભ | ভ | ଭ | భ | ಭ | ഭ | භ | |
| MA | m | म | ਮ | મ | ম | ମ | మ | ಮ | ம | മ | ම |
| YA | y | य | ਯ | ય | য | ଯ | య | ಯ | ய | യ | ය |
| RA | r | र | ਰ | ર | র | ର | ర | ರ | ர | ര | ර |
| RRA | ṟ | ऱ | ఱ | ಱ | ற | റ | |||||
| LA | l | ल | ਲ | લ | ল | ଲ | ల | ಲ | ல | ല | ල |
| LLA | ḷ | ळ | ਲ਼ | ળ | ଳ | ళ | ಳ | ள | ള | ළ | |
| LLLA | ḻ | ऴ | ೞ | ழ | ഴ | ||||||
| VA | v | व | ਵ | વ | ଵ | వ | ವ | வ | വ | ව | |
| SHA | ś | श | ਸ਼ | શ | শ | ଶ | శ | ಶ | ஶ | ശ | ශ |
| SSA | ṣ | ष | ષ | ষ | ଷ | ష | ಷ | ஷ | ഷ | ෂ | |
| SA | s | स | ਸ | સ | স | ସ | స | ಸ | ஸ | സ | ස |
| HA | h | ह | ਹ | હ | হ | ହ | హ | ಹ | ஹ | ഹ | හ |
| QA | q | क़ | |||||||||
| KHHA | ķ | ख़ | ਖ਼ | ||||||||
| GHHA | ģ | ग़ | ਗ਼ | ||||||||
| DDDHA/RRA | ṛ | ड़ | ੜ | ড় | ଡ଼ | ||||||
| RHA | ṛʰ | ढ़ | ঢ় | ଢ଼ | |||||||
| FA | f | फ़ | ਫ਼ | ෆ | |||||||
| YYA | ẏ | य़ | য় | ୟ | |||||||
| NNGA | ⁿg | ඟ | |||||||||
| JNYA | ⁿj | ඥ | |||||||||
| NNDDA | ⁿḍ | ඬ | |||||||||
| NDA | ⁿd | ඳ | |||||||||
| MBA | ⁿb | ඹ | |||||||||
| ŕ | ৰ | ||||||||||
| ṿ | ৱ | ||||||||||
This index gives names for all spices in several languages of India. This includes all official Indian languages if written with a Brahmi-derived script (plus some from neighbouring regions); this means that both North-Indian (Aryan) and South-Indian (Dravidian) languages are accounted for. Those Indian languages written in Arabic script (Urdu, Kashmiri, Punjabi in Pakistan) are not included here, but will appear in the Arabic Index in due time.
The Brahmi-derived writing systems have a highly complex (but rather uniform) structure. They work basically alphabetically, i.e., they contain characters for both vowels and consonants; yet these are combined in a nontrivial and varied manner to yield symbols for syllables (a syllable here means a sequence of consonants followed by exactly one vowel). Inside the consonatic part, all consonants signs merge into complex ligatures called conjuncts. The vowel sign may be placed above or below the consonants like diacritics in the Latin alphabet; but some vowel signs follow or precede the consonant cluster (or in the case of two-part vowel signs even both at the same time). The vowel A s called implicit as it has no graphic representation of its own, unless it constitutes a full syllable without any preceding consonants (which is frequent word-initially, but extremely rare in other positions). Further complications arise from the large number of sounds and thus signs, and a small set of diacritical marks that modify the pronunciation.
The basic principles have different implementations in each of the modern scripts: Conjuncts may be formed horizontally, vertically or idiosyncratically, and in some scripts they involve subscripted characters; but Tamil has no conjuncts at all. In some scripts, the shape of a consonant depends on the vowel associated with it. Some scripts use two-part vowel signs one part of which precedes the associated consonant cluster (but in Sinhala, it only preceeds the last consonant of the cluster). And most scripts have one or few peculiarities not shared by the others (for example, the addak-sign in Gurmukhi that indicates double consonants, or the chillu shapes for word-final consonants in Malayalam). Also, some features common to all scripts may have different semantics (e.g., the anusvara or chandrabhindu signs may indicate nasalisation of the previous vowel or of the following consonant). Another puzzling case is the eyelash ra which appears in few Devanagari-written languages but whose precise meaning is not clear.
Each script is used for one or several languages, whose phonetics determine the character repertoire. Thus, one
can usually not spell a language in an alien
script without some dirty tricks. There is an exception,
though: The North Indian Devanagari script was augmented with special characters and is close to a superset of
all Indic scripts. These special characters are, however, rather an academic tool and hardly known to Indian speakers.
The romanizations given here are very close to both the common scientific transliteration (IAST) of Sanskrit and the enlarged ISO 15919 standard. Pure ASCII schemes like Harvard-Kyoto and ITRANS are very different.
Retroflex consonats of Sanskrit are marked by a dot below the letter, (Ṭ, Ḍ, Ṣ, Ṇ). The same dot is used for retroflex variety of L (Ḷ). For R, however, I have to use two different special characters: North Indian ड़ (DDDHA, equivalent to DDA with Nukta) is rendered with a dot (Ṛ); its equivalent letter in Bengali (ড়) and Oriya (ଡ଼) is called RRA, not DDDHA. South Indian RRA ऱ (e.g., Tamil ற) is romanized with an underline, Ṟ.
The Devanagari eyelash ra, which is characteristic of Nepali and Marathi, has no codepoint of its own; instead, it is coded as RA+VIRAMA+ZWJ र् in Unicode 2.0, but Unicode 3.0 interprets the character as half form of RRA (and consequently writes RRA+VIRAMA ऱ्). For transliteration, I use Ř.
Vocalic R and L are usually marked with a dot in scientific Sanskrit romanization (IAST). However, since the dot is already used for retroflex consonants in ISO-15919, I have to use a ring instead: R̥ and L̥. The long versions have an additional macron, R̥̄ and L̥̄. These characters might be poorly supported on some browsers (but fortunately, all except R̥ are rare or even nonexistent in Sanskrit, and do not appear at all in the modern languages except in learned Sanskrit loans).
The velar and palatal nasals are written Ṅ and Ñ, respectively. Note that in most Indic alphabets, both are frequently written as anusvara.
The implicit vowel is always rendered A, even if it is rather pronounced like O. Most North Indian languages often write this vowel even if it is not spoken, and in such cases my transliteration and sorting are often slightly inconsequent.
Superscript H following a consonant (ʰ) marks aspirated sounds (which exist in all languages save Tamil and Sinhala).
Superscript N preceding a voiced stop (ⁿ) indicates prenasalized stops (this feature is restricted to Sinhala).
Long vowels (Ā, Ī, Ū) are distinguished by a macron. E and O are always long in Aryan languages, and thus not specifically marked (as in IAST, but not in ISO-15919). I write the Dravidian short E and O as Ĕ and Ŏ, respectively. There is also a short E and O in some North Indian languages to write English loanwords (transliterated Ê and Ô), but these do not appear in this index.
The Gurmukhi and Devanagari letters KHHA and GHHA represent khah خ and ghain غ, respectively, in some Arabic loanwords. ISO-15919 suggests KH and Ġ as romanization. The former is in my opinion a poor choice, as it is not easily representable in Unicode. Thus, I use Ḫ and Ġ as in Arabic; however, neither of them appears in any spice name I could find. Also, the letters QA and ZA are used only for Arabic or Persian terms, and are consequently rare. FA, on the other side, appears in a few few native words plus Arabic, Persian and English loans.
The Tamil letters LLLA ழ (also in Malayalam ഴ) and NNNA ன are represented by L and N with an underline, in accord with the scientific transliteration of Tamil: Ḻ and Ṉ, respectively.
Additionally, Devanagari transliterations are given for languages that have another native alphabet. These Devanagarizations use a couple of special signs to accommodate sounds alien to most Aryan languages. In general, there is a 1:1 correspondence between Devanagari and native letters; as an exception, the anusvara has been used in Devanagari throughout even if the native alphabet expresses nasalization by means of character, and a few native signs not reprensentable in Devanagari had to be replaced by near-matches. Since the Devanagari transliterations are produced programmatically by sed, some of them might by systematically flawed.
Displaying this page correctly is currently a true challange to any computer system, many of which will fail. You will not only need fonts for all scripts used, but you must also make sure that your browser (or the underlying operating system) can handle the complex rules of Indic typography correctly. Malayalam is an especially tricky case, and many current systems fail the following test: മഞ്ഞള് (maññaḷ) and എള്ള് (ĕḷḷə). You should see only one virama (a breve-like mark) over the last letter of the second word, while on some systems up to four viramas are shown, or characters are replaced with question marks or square boxes. Fortunately, Unicode 5.1 offers a new way to code the first word as മഞ്ഞൾ, which should be easier to interpret for renderers (as soon as the fonts are updated).
The entries are sorted according to the canonical Devanagari collating sequence, which is mimicked by all the other Indic scripts. Anusvara is sorted as if it were written as a nasal. The handling of the implicit vowel is still somewhat unsystematic; it is often ignored in sorting if it is written but not pronounced.
By its very construction, this index can hold only a limited number of scripts and languages; the following table gives an overview. This list is basically identical to the set of all Indian languages filtered by two criteria: (a) written in a Brahmi-derived alphabet and (b) having official or officially recognized status (either by the Union or by a single state); plus Sinhala. Since Bishnupriya Manipuri has at least an active community on the web, I have tentatively added it as a possible candidate for future inclusion. The 200+ more Indian languages are either written in Latin or Arabic alphabet, or have not been reduced to writing at all, so that they make little sense in an index of this kind. Also excluded is Rajasthani, which is in fact a bundle of closely related languages without a standardized orthography, and I am not sure about its official status.
Those languages currently available are written in bold face. You can expect more languages to be included here as time goes. Note that some languages are somewhat problematic to add, because they contain sounds/letters that cannot be mapped to the Devanagari alphabet. There are two supplementary letters in Bengali script (needed for Assamese language) that have no Devanagari counterpart, and Sinhala is even more problematic with its extra vowels and its prenasalisation feature; these letters have been conflated with standard Devanagari letters in sorting. For the same reason, Tibetan was difficult to add, as many ad hoc decisions had to be made how to align Tibetan script with Devanagari; readers are probably better served with the Tibetan Index.
The South East
Asian Brahmi-derived scripts are perhaps impossible to add to this index, as they have evolved a long, long road from
the common ancestor.
| Script | Language | Remarks | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| Devanagari | संस्कृत | Sanskrit (sa) β | classical tongue of lore, religion and philosophy | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| हिंदी | Hindi (hi) | lingua franca in Northern India; official language of the Indian Union and many Northern union states | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| मराठी | Marathi (mr) | official state language in Maharashtra | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| कॉशुर | Kashmiri (Koshur, ks) | regional langage in Kashmir, now mostly (in Pakistan always) written in Arabic alphabet [کٲشر] (Dardic) | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| कोंकणी | Konkani (kok) α | official state language in Goa | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| नेपाली | Nepali (ne) | national langage in Nepal; official second language in West Bengal | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| नेपालभाषा | Nepal Bhasa | regional langage in Nepal; the traditional Newari-Script enjoys a partial revival in Kathmandu, but Unicode support is still lacking (Sino-Tibetan)
| बोड़ो | Bodo (brx) | official second language in Assam (Sino-Tibetan)
| मैथिली | Maithili (Bihari, bh) | official (?) regional language in Bihar and strong community in Southern Nepal. In the past, it was written in a variety of Bengali script known as Mithilakshar (there is yet no Unicode support) or in the Kaithi script [𑂍𑂶𑂘𑂲], but today Devanagari is used if the language is written at all.
| डोगरी | Dogri (doi) | regional language in Jammu und Kashmir; spoken in Pakistan and there written in Arabic alphabet [ڈوگرى]. Culinary vocabulary is very close to Hindi.
| सिन्धी | Sindhi (sd) | scattered over North Western India and Pakistan (where it is co-official); recognized by the Indian constitution, but no official status in any union state; in India partially and in Pakistan completely written in a variety of Arabic script with many additional characters [سنڌی]
| Gurmukhi | ਪੰਜਾਬੀ | Punjabi (Panjabi, pa) | official state language in Haryana and Punjab; widely spoken as a vernacular in the eastern part of Pakistan where it is not official (written in | Gujarati | ગુજરાતી | Gujarati (gu) | official state language in Gujarat
| Bengali (Eastern Nagari) | বাংলা | Bengali (Bangla, bn) | official state language in Western Bengal; national language of Bangladesh
| অসমীয়া | Assamese | official state language in Assam
| মনিপুরি | Manipuri (Meitei, mni) | official state language in Manipur; unofficial minority language in Bangla Desh. The native Meitei Mayek script (no Unicode support) was replaced by Bengali in the 18.th century (Sino-Tibetan)
| বিষ্ণুপ্রিয়া মণিপুরী | Bishnupriya Manipuri (bpy) | spoken in scattered communities in North East India and Bangla Desh (not official)
| সিলটী | Sylheti (Siloti) | spoken in Bangla Desh and North East India (not official); there is a native alphabet (Siloti Nagori [ꠡꠁꠟꠐꠡ ꠘꠀꠉꠠꠡ]), which is, however, mostly extinct.
| Oriya | ଓଡ଼ିଆ | Oriya (or) | official state language in Orissa
| Telugu | తెలుగు | Telugu (te) | official state language in Andhra Pradesh (Dravidian)
| Tamil | தமிழ் | Tamil (ta) | official state language in Tamil Nadu; second national language of Sri Lanka (Dravidian)
| Kannada | ಕನ್ನಡ | Kannada (kn) | official state language in Karnataka (Dravidian)
| Malayalam | മലയാളം | Malayalam (ml) | official state language in Kerala (Dravidian)
| Sinhala | සිංහල | Sinhala (Singhalese, si) | national language of Sri Lanka
| Uchen (Tibetan) | བོད་སྐད་ | Tibetan (bo) | a group of interrelated languages spoken in Tibet, China, Nepal, India and Pakistan (Sino-Tibetan)
| གླེ་སྐད་ | Ladakhi (ljb) | sublanguage of Tibetan located mainly on the North Western edge of India; has official status in Jammu & Kashmir state (Sino-Tibetan)
| རྫོང་ཁ་ | Dzonka (dz) | national language of Bhutan. Not part of the Tibetan macrolanguage (Sino-Tibetan)
| |
It should be noted that I obtained many of the names shown in this index from poorly legible, hand-written lists, and (as anyone with some knowledge of Indic scripts will confirm) it is quite difficult and error-prone to digitalize such scribblings, moreover one may easily fall victim to orthographic deficiencies of the indivuals who wrote them. Whenever possible, I checked on the Internet, but due to a scarcity of sources (especially bilingual ones), this was only partially possible for some languages (notably, Gujarati, Bengali and Malayalam). Spellings in Sinhala, Punjabi, Assamese, Nepali and Konkani should therefore taken with a grain of salt. Comments and corrections are of course welcome.
Some of the more exotic languages with null internet coverage were field researched, which poses significant problems with poor alphabetization rates and a general lack of writing tradition. Thus, spice names in Maithili, Dogri and Newari are perhaps to be seen as approximations only (the number of different spellings for a word can well equal the number of people asked); also, Tibetan proved hard work, although I was finally able to cross-check some of the names with written literatur on medicinal herbs.
Sanskrit presents a different problem, as the standard dictionary of Monier Williams contains an unreasonable amount of synonyms and polyvalencies (and invalid scientific plant names); neither ancient Indian writers nor modern linguists give much heed to botany, it seems. For that reason, Sanskrit will probably remain in the β state forever.