Skip to content
Surf Wiki
Save to docs
general/persian-orthography

From Surf Wiki (app.surf) — the open knowledge base

Romanization of Persian

Representation of the Persian language with the Latin script


Representation of the Persian language with the Latin script

The romanization of Persian is the representation of the Persian language (Iranian Persian, Dari and Tajik) with the Latin script. Several different romanization schemes exist, each with its own set of rules driven by its own set of ideological goals.

Romanization is familiar to many Persian speakers. Many use an ad hoc romanization for text messaging and email; road signs in Iran commonly include both Persian and English (in order to make them accessible to foreigners); and websites use romanized domain names.

Romanization paradigms

Because the Persian script is an abjad writing system (with a consonant-heavy inventory of letters), many distinct words in standard Persian can have identical spellings, with widely varying pronunciations that differ in their (unwritten) vowel sounds. Thus a romanization paradigm can follow either transliteration (which mirrors spelling and orthography) or transcription (which mirrors pronunciation and phonology).

Transliteration

Transliteration (in the strict sense) attempts to be a complete representation of the original writing, so that an informed reader should be able to reconstruct the original spelling of unknown transliterated words. Transliterations of Persian are used to represent individual Persian words or short quotations, in scholarly texts in English or other languages that do not use an abjad.

A transliteration will still have separate representations for different consonants of the Persian alphabet that are pronounced identically in Persian.

Transliterations commonly used in the English-speaking world include BGN/PCGN romanization and ALA-LC Romanization.

Non-academic English-language quotation of Persian words usually uses a simplification of one of the strict transliteration schemes (typically omitting diacritical marks) and/or unsystematic choices of spellings meant to guide English speakers using English spelling rules towards an approximation of the Persian sounds.

Transcription

Transcriptions of Persian attempt to straightforwardly represent Persian phonology in the Latin script, without requiring a close or reversible correspondence with the Persian script, and also without requiring a close correspondence to English phonetic values of Roman letters.

Main romanization schemes

  • DMG (1969), a strict scientific system by the German Oriental Society (Deutsche Morgenländische Gesellschaft). It corresponds to Deutsches Institut für Normung standard DIN 31635.
  • ALA-LC (1997), the ALA-LC romanization.
  • BGN/PCGN (1958), the BGN/PCGN romanization.
  • EI (1960), the system used in early editions of Encyclopædia Iranica.
  • EI (2012), its contemporary modification.
  • UN (1967), the Iranian national system (1966), that was approved by the UNGEGN in 1967.
  • UN (2012), its contemporary modification.

Comparison table

UnicodePersian
letterIPADMG (1969)ALA-LC (1997)BGN/PCGN (1958)EI (1960)EI (2012)UN (1967)UN (2012)Pronunciation
U+0627ا
U+0628ب
U+067Eپ
U+062Aت
U+062Bث
U+062Cج
U+0686چ
U+062Dح
U+062Eخ
U+062Fد
U+0630ذ
U+0631ر
U+0632ز
U+0698ژ
U+0633س
U+0634ش
U+0635ص
U+0636ض
U+0637ط
U+0638ظ
U+0639ع
U+063Aغ
U+0641ف
U+0642ق
U+06A9ک
U+06AFگ
U+0644ل
U+0645م
U+0646ن
U+0648و
U+0647ه
U+0629ة
U+06CCی
U+0621ء
U+0623أ
U+0624ؤ
U+0626ئ
UnicodeFinalMedialInitialIsolatedIPADMG (1969)ALA-LC (1997)BGN/PCGN (1958)EI (2012)UN (1967)UN (2012)Pronunciation
U+064Eـَ
U+064Fـُ
U+0648 U+064Fـو
U+0650ـِ
U+064E U+0627ـَا
U+0622ـآ
U+064E U+06CCـَی
U+06CC U+0670ـیٰ
U+064F U+0648ـُو
U+0650 U+06CCـِی
U+064E U+0648ـَو
U+064E U+06CCـَی
U+064E U+06CCـیِ
U+06C0ـهٔ

Antiquity

In antiquity, Old and Middle Persian employed various scripts including Old Persian cuneiform, Pahlavi and Avestan scripts. For each period there are established transcriptions and transliterations by prominent linguists.

IPAOld PersianMiddle Persian
(Pahlavi)Avestan
Consonants
p
f
b
β~ʋ~w
t
θ
d
ð
θr
s
z
ʃ
ʒ
c~tʃ
ɟ~dʒ
k
x
g
ɣ
h
m
ŋ
ŋʲ
n
r
l
w~ʋ~v
j
Vowels
Short
a
ã
ə
e
i
o
u
Long
ɑː~ɒː
ə
əː

Notes:

  • ā â
  • ī, ū i, u
  • x kh, ḵ, ḥ, ḫ
  • c/č ǩ
  • j/ǰ ǧ
  • θ ϑ, þ, th, ṯ, ṭ
  • ç tr, θʳ, ϑʳ, ṙ, s͜s, s̀
  • f
  • y, v j, w.}} A sample romanization (a poem by the Persian poet Hâfez):
Persian RūmiPerso-Arabic scriptEnglish
Yusefê gomgaşte báz áyad be Kanân qam maħor

Other romanization schemes

Baháʼí Persian romanization

Main article: Baháʼí orthography

Baháʼís use a system standardized by Shoghi Effendi, which he initiated in a general letter on March 12, 1923. The Baháʼí transliteration scheme was based on a standard adopted by the Tenth International Congress of Orientalists which took place in Geneva in September 1894. Shoghi Effendi changed some details of the Congress's system, most notably in the use of digraphs in certain cases (e.g. s͟h instead of š).

A detailed introduction to the Baháʼí Persian romanization can usually be found at the back of a Baháʼí scripture.

{{anchor|Fingilish}}ASCII Internet romanizations

PersianFingilish
[آ](a)،[ا]()}}a, â
[ب](b)}}b
[پ](p)}}p
[ت](t)}}t
ث}}s
[ج](j)}}j
چ}}ch, č
[ح](h)}}h
خ}}kh, x
[د](d)}}d
ذ}}z
[ر](r)}}r
[ز](z)}}z
[ژ](j)}}zh, ž
[س](s)}}s
ش}}sh, š
[ص](s)}}s
[ض](d)}}z
[ط](t)}}t
[ظ](z)}}z
[ع]()،[ء]()}}a, ə
[غ](g)}}gh, q
[ف](f)}}f
[ق](q)}}gh, q
ک}}k
[گ](g)}}g
[ل](l)}}l
[م](m)}}m
[ن](n)}}n
[و](w)}}o, u, v, w
[ه](h)}}h
[ی](y)}}i, y

It is common to write Persian language with only the Latin alphabet (as opposed to the Persian alphabet) especially in online chat, social networks, emails and SMS. It has developed and spread due to a former lack of software supporting the Persian alphabet, and/or due to a lack of knowledge about the software that was available. Although Persian writing is supported in recent operating systems, there are still many cases where the Persian alphabet is unavailable and there is a need for an alternative way to write Persian with the basic Latin alphabet. This way of writing is sometimes called Fingilish or Pingilish (a portmanteau of Fârsi or Persian and English). In most cases this is an ad hoc simplification of the scientific systems listed above (such as ALA-LC or BGN/PCGN), but ignoring any special letters or diacritical signs. The details of the spelling also depend on the contact language of the speaker; for example, the vowel is often spelt "oo" after English, but Persian speakers from Germany and some other European countries are more likely to use "u".

Persian alphabet based on Latin in USSR

In the USSR from 1931 to 1938, the Persian alphabet based on Latin was used. It was used for teaching in schools, the Bejraqe Sorx (Red Banner) newspaper was published in Ashgabat, as well as textbooks and other literature.

S s
ث ,س ,صŞ ş
شT t
ت ,طU u
اوV v
وX x
خZ z
ز ,ذ ظ ,ضƵ ƶ
ژ'
ع , ٴ

Tajik Latin alphabet

Main article: Tajik alphabet

The Tajik language or Tajik Persian is a variety of the Persian language. It was written in the Tajik SSR in a standardized Latin script from 1926 until the late 1930s, when the script was officially changed to Cyrillic. As a result of these factors romanization schemes of the Tajik Cyrillic script follow rather different principles. Even though it is largely unused, Google Translate implements the alphabet as the Latin transliteration for Tajik.

Variation proposed by Mir Shamsuddin Adib-Soltani

A variation (that is sometimes called "Pârstin") proposed by linguist Mir Shamsuddin Adib-Soltani in 1976 has seen some use by other linguists, such as David Neil MacKenzie for the transliteration of the Persian scripture.

The letters of this variation of the Latin alphabet are the basic Latin letters: Aa, Bb, Cc, Dd, Ee, Ff, Gg, Hh, Ii, Jj, Kk, Ll, Mm, Nn, Oo, Pp, Qq, Rr, Ss, Tt, Uu, Vv, Xx, Yy, Zz, plus the additional letters to support the native sounds: Ââ, Čč, Šš, Žž (the latter three from Slavic alphabets, like the Czech one).

Besides being one of the simplest variations proposed for the Latinization of the Persian alphabet, this variation is based on the Alphabetic principle. Based on this principle, each individual speech sound is represented by a single letter and there is a one-to-one correspondence between sounds and the letters that represent them. This principle, besides increasing the clarity of the text and preventing confusion for the reader, is specifically useful for representing the native sounds of the Persian language, for which there are no equivalents in most other languages written in a Latin-based alphabet. For instance, compound letters used in the other variations, such as kh and gh, in addition to sh and zh are respectively represented by x, q, š and ž.

Notes

References

References

  1. Akbari, Mohsen. (2013). "A preliminary linguistic analysis of Romanized Persian SMS messages". Journal of Novel Applied Sciences.
  2. Beam, Christopher. (2009-06-17). "Why do Iranian police uniforms say "police" in English?".
  3. Effendi, Shoghi. (1974). "Baháʼí Administration". Baháʼí Publishing Trust.
  4. Lambert, James. 2018. A multitude of 'lishes': The nomenclature of hybridity. ''English World-wide'', 39(1): 10. DOI: 10.1075/eww.38.3.04lam
  5. (1931). "Утвержденный президиумом научного совета ВЦК НА новый персидский алфавит". Культура и письменность Востока.
  6. R. Axundov. (1932). "Rahe nov. Əʟefвa вәraje kudәkan". Nәşrijjate dovʟәtije Torkmәnestan.
  7. "Tajik".
  8. Perry, John R.. (2005). "A Tajik Persian Reference Grammar". Brill.
  9. Adib-Soltani, Mir Shamsuddin. (1976). "An introduction to the writing of the Persian script". Amirkabir Publications.
  10. "Persian (Farsi)".
  11. "Persian". UNGEGN.
  12. ''[http://unstats.un.org/unsd/geoinfo/UNGEGN/docs/20th-gegn-docs/20th_gegn_WP41.pdf Toponymic Guidelines for map and other editors – Revised edition 1998. Working Paper No. 41]''. Submitted by the Islamic Republic of Iran. UNGEGN, 20th session. New York, 17–28 January 2000.
  13. ''[http://unstats.un.org/unsd/geoinfo/UNGEGN/docs/10th-uncsgn-docs/econf/E_CONF.101_118_Rev.1_Transcription%20symbols%20for%20Persian_Updated.pdf New Persian Romanization System. E/CONF.101/118/Rev.1*]''. Tenth United Nations Conference on the Standardization of Geographical Names. New York, 31 July – 9 August 2012.
  14. "Persian". The Library of Congress.
  15. "Romanization system for Persian (Dari and Farsi). BGN/PCGN 1958 System".
  16. "Transliteration". Encyclopædia Iranica.
  17. (1904). "Altiranisches Wörterbuch".
  18. (1950). "Old Persian".
  19. (1971). "A Concise Pahlavi Dictionary".
  20. (1996). "Avestische Laut- und Flexionslehre".
Info: Wikipedia Source

This article was imported from Wikipedia and is available under the Creative Commons Attribution-ShareAlike 4.0 License. Content has been adapted to SurfDoc format. Original contributors can be found on the article history page.

Want to explore this topic further?

Ask Mako anything about Romanization of Persian — get instant answers, deeper analysis, and related topics.

Research with Mako

Free with your Surf account

Content sourced from Wikipedia, available under CC BY-SA 4.0.

This content may have been generated or modified by AI. CloudSurf Software LLC is not responsible for the accuracy, completeness, or reliability of AI-generated content. Always verify important information from primary sources.

Report