Volapuk encoding

From Free net encyclopedia

List of encodings
Transcription Cyrillic Volapuk
alternative(s)
a a a
b б 6
v в B
g г r, 2
d д D, g
e е e
yo ё e", e~
zh ж }|{, >|<, *
z з 3
i и u, N
y й u~, u'
k к K
l л J|, Jl, /\
m м M
n н H
o о o
p п n, II
r р p
s с c
t т T, m
u у y
f ф (|), dp
kh х x
tz ц u_, U_, u, , U,
ch ч 4
sh ш W, w, LLI
shch щ W, , LLI, , LLI_
" ъ ~b, `b
y ы bl, b!, b1
' ь b
e э ~), E
yu ю |-0, |-O, I0, IO, I-0, I-O
ya я 9, 9|, 91, R, q

Volapuk encoding (Russian: кодировка "воляпюк" or "волапюк", kodirovka volapyuk) is a slang term for rendering the letters of the Cyrillic alphabet by the Latin ones.

It has been in use since the early days of the internet to write e-mail messages and other texts in Russian in cases where the support of Cyrillic fonts was limited: either the sender did not have a keyboard with Cyrillic letters or the receiver was not necessarily expected to have Cyrillic screen fonts. In the early days the situation was aggravated by a number of mutually incompatible computer encodings for the Cyrillic alphabet, so that the sender and receiver were not guaranteed to have the same one. Also, the 7-bit character encoding of the early days was an additional upset.

Some Russian e-mail providers even included this encoding into the list of available options for the e-mails routed abroad, and their menu looked like, e.g.,

MIME/BASE64, MIME/Quoted-Printable, volapuk, uuencode

The name comes from the Volapük constructed language, for two reasons. A Cyrillic text written in this way looks strange and often funny, just as a Volapük text may appear. At the same time, the word "Volapük" itself sounds funny to Russian ears, so the name stuck. It is worth pointing out here that Volapük is based on English vocabulary, but the resulting language is nothing like English.

Volapuk is not exactly a transliteration. There are no "standardized" rules. For example, some would use the "unused" Latin letters X and Y for Cyrillic Х (Kha) and У (U) that look the same. When written in a hurry, one may easily type, e.g., "P" instead of Р (Er) (R is normally expected). As a result, the text becomes even more funny and difficult to read.

In the late 90's the encoding problem has been resolved almost completely, due to constantly increasing number of internet users in Russia and subsequent development of support by software manufacturers and internet service providers. However, the avalanche-like spreading of cellphones, especially among young people, took the problem over. Until 2000—2001, very few cellphones imported into Russia had support for Cyrillic characters in SMS messages. Over the following five years the situation improved drastically, and now most of the mobile devices in Russia have full support for Cyrillic messaging. Nonetheless, Volapuk is still popular, especially among school and college students, because of the price (messages containing even one Cyrillic character cost twice as much as fully Latin messages; the explanation is that the standard message body can contain 160 Latin symbols, but Cyrillic letters are "coded" with two bytes, thus message size is limited to 70 Cyrillic symbols). This price matter made "volapukization" even more profound, because people not only transliterate Russian words to Latin script, but also abbreviate them chaotically, and change Russian words to English equivalents or "equivalents" (English words are shorter than Russian ones). This resulted in a Russian version of leetspeak, see an example SMS message below.

Some consider it a kind of joke to systematically substitute Cyrillic letters with Latin ones that look the same, rather than sound the same. In certain cases it leads to collisions, e.g., in the case of P and R vs. Cyrillic П (Pe) and Р (Er).

The Latin letters that basically match the Cyrillic ones by look and sound are E, T, O, A, K, M, and sometimes C.
The Latin letters that only look the same are Y, P, H, X, B, and sometimes C.

Some tricks include 'b' for 'ь', 'q' for 'я', the digraph 'b!' for 'Ы', and the trigraph '}|{' for 'Ж'.

Volapuk encoding enthusiasts sometimes use digits to convey similar Cyrillic letters, reminiscent of leetspeak. For example, '4' looks similar to Ч (Che), '9' looks similar to Я (Ya), and '3' is almost ideal for З (Ze).

Example

  • COBETCKIJ COIO3 ("advanced" volapuk)
  • СОВЕТСКИЙ СОЮЗ (Cyrillic/Cypnjijlnk)
  • SOVETSKIY SOYUZ (transliteration/tpahcjintepatnoh)
  • Soviet Union (English/Ihrjlnw)

Example of SMS message:

  • Xai Hat!skazu bcem 4to 9 ne npudy. Dabai bctpet cy6 7ve4era.9 lav tebya. ("advanced" volapuk—the goal was to compact the message down to 70 symbols!)
  • Привет, Наташа. Скажи всем, что я не приду. Давай встретимся в субботу в 7 вечера. Я люблю тебя. (Cyrillic)
  • Privet, Natasha. Skazhi vsem, chto ya ne pridu. Davay vstretimsya v subbotu v 7 vechera. Ya lyublyu tebya. (transliteration)
  • Hi Natasha, tell everyone that I'm not going to come. Let's meet on Saturday, 7PM. I love you. (English)

See also

Template:Internet Dialects Template:Unreferencedsl:Volapiško kodiranje