Posts

Showing posts from March, 2016

Arabic Email Addresses

Image
Most human language scripts are written from Left to Right (L➡︎R). Arabic is written Right to Left (L⬅︎R). An email address written in the Latin script would be displayed L➡︎R — username@domain-name. An Arabic email address, on the other hand, would normally and without intervention be displayed L⬅︎R as domain-name@username. Letʼs take a fictitious Arabic email address — خالد@الدوحة.قطر خالد is the username Khalid الدوحة is the 2nd level domain name Doha قطر is the Top Level Domain (TLD) Qatar. This part is not fictitious as قطر is a valid ccTLD. Your browser should be displaying the email address خالد@الدوحة.قطر in L⬅︎R order which is not an order familiar to most L➡︎R readers and so requires some effort to parse. When text has mixed L➡︎R and L⬅︎R characters it is referred to as Bidirectional (bidi) text. There is a complex Unicode algorithm specifically to determine  display order of bidi text  unicode.org/reports/tr9/  If you read this report you will se...