Photo:1 Photo:2 Photo:3 Photo:4 |
| Types | |
| 2>
Among the types of diacritic used in alphabets based on the Latin script are:
accent marks (thus called because the acute, the grave and the circumflex accent were originally used to indicate different types of pitch accents, in the polytonic transcription of Greek)
◌́ – acute accent (Latin apex)
◌̀ – grave accent
◌̂ – circumflex accent
◌̌ – caron, inverted circumflex, (Czech háček)
◌̋ – double acute accent
◌̏ – double grave accent
dots
◌̇ – dot (Indic anusvara)
◌̣ – a dot below is used in Rheinische Dokumenta
◌·◌ – Interpunct
tittle, the dot used by default in the modern lowercase form of the Latin letters "i" and "j"
◌̈ – trema, diaeresis, or umlaut sign
◌ː – colon, used in the International Phonetic Alphabet to mark long vowels.
ring
◌̊ – ring
vertical line
◌̩ – a vertical line below is used in Rheinische Dokumenta as a schwa mark
macron or horizontal line
◌̄ – macron
◌̱ – macron below
overlays
◌⃓ – bar through the basic letter
◌̷ – slash through the basic letter
◌̵ – stroke through the basic letter
curves
◌̆ – breve
◌͗ – sicilicus, a palaeographic diacritic similar to a caron or breve
◌̃ – tilde
◌҃ – titlo
curls above
◌̓ – apostrophe
◌̉ – hook (Vietnamese dấu hỏi)
◌̛ – horn (Vietnamese dấu móc)
curls below
◌̦ – comma
◌̧ – cedilla
◌̡ ◌̢ – hook, sometimes also attached above
◌̨ – ogonek
double marks (over or under two base characters)
◌͝◌ – double breve
◌͡◌ – ligature tie
◌᷍◌ – double circumflex
◌͞◌ – double macron
◌͠◌ – double tilde
The tilde, dot, comma, titlo, apostrophe, bar, and colon are sometimes diacritics, but also have other uses.
Not all diacritics occur adjacent to the letter they modify. In the Wali language of Ghana, for example, an apostrophe indicates a change of vowel quality, but occurs at the beginning of the word, as in the dialects ’Bulengee and ’Dolimi. Because of vowel harmony, all vowels in a word are affected, so the scope of the diacritic is the entire word. In abugida scripts, like those used to write Hindi and Thai, diacritics indicate vowels, and may occur above, below, before, after, or around the consonant letter they modify.
The dot on the letter i of the Latin alphabet originated as a diacritic to clearly distinguish i from the vertical strokes of the adjacent letters. It first appeared in the sequence ii (as in ingeníí) in Latin manuscripts of the 11th century, then spread to i adjacent to m, n, u, and later spread to all lower-case i. The j, which separated from the i later, inherited the "dot". The shape of the diacritic developed from initially resembling today's acute accent to a long flourish by the 15th century. With the advent of Roman type it was reduced to the round dot we have today.[2]
[edit] Tags:Acute,Apex,Double Acute,Grave,Breve,Dot,Macron,Apostrophe,Bar,Colon,Comma,Tilde,Anusvara,Letter,Greek,Latin Alphabet,Diaeresis,Vowel,Cedilla,S,K,Indic,Titlo,Edit,Latin Script,Pitch Accents,Polytonic Transcription,Acute Accent,Grave Accent,Circumflex Accent,Caron,Double Acute Accent,Double Grave Accent,Rheinische Dokumenta,Interpunct,Tittle,Lowercase,Trema,Umlaut,International Phonetic Alphabet,Long Vowels,Ring,Schwa,Macron Below,Slash,Stroke,Sicilicus,Palaeographic,Hook,Horn,Ogonek,Double Breve,Wali Language,Vowel Harmony,Abugida,Hindi,Roman Type,Form,á,í,ó,Circumflex,č,D, | |
| Arabic | |
| 3>
Further information: Arabic diacritics
(ئ ؤ إ أ and stand alone ء) hamza: indicates a glottal stop.
(ــًــٍــٌـ) tanwīn (تنوين) symbols: Serve a grammatical role in Arabic. The sign ـً is most commonly written in combination with alif, e.g. ـًا.
(ــّـ) shadda: Gemination (doubling) of consonants.
(ٱ) waṣla: Comes most commonly at the beginning of a word. Indicates a type of hamza that is pronounced only when the letter is read at the beginning of the talk.
(آ) madda: A written replacement for a hamza that is followed by an alif, i.e. (ءا). Read as a glottal stop followed by a long /aː/, e.g. ءاداب، ءاية، قرءان، مرءاة are written out respectivially as آداب، آية، قرآن، مرآة. This writing rule does not apply when the alif that follows a hamza is not a part of the stem of the word, e.g. نتوءات is not written out as نتوآت as the stem نتوء does not have an alif that follows its hamza.
(ــٰـ) superscript alif (also "short" or "dagger alif": A replacement for an original alif that is dropped in the writing out of some rare words, e.g. لاكن is not written out with the original alif found in the word pronunciation, instead it is written out as لٰكن.
ḥarakāt (In Arabic: حركات also called تشكيل tashkīl):
(ــَـ) fatḥa (a)
(ــِـ) kasra (i)
(ــُـ) ḍamma (u)
(ــْـ) sukūn (no vowel)
The ḥarakāt or vowel points serve two purposes:
They serve as a phonetical guide. They indicate the presence of short vowels (fatḥa, kasra, or ḍamma) or their absence (sukūn).
At the last letter of a word, the vowel point reflects the inflection case or conjugation mood.
For nouns, The ḍamma is for the nominative, fatḥa for the accusative, and kasra for the genitive.
For verbs, the ḍamma is for the imperfective, fatḥa for the perfective, and the sukūn is for verbs in the imperative or jussive moods.
Vowel points or tashkīl should not be confused with consonant points or iʿjam (إعجام) – one, two or three dots written above or below a consonant to distinguish between letters of the same or similar form.
[edit] Tags:/,Noun,Arabic,Hamza,Glottal Stop,Alif,Shadda,Inflection,Jussive,Iʿjam,ḥ, | |
| Greek | |
| 3>
Further information: Greek diacritics
These diacritics are used in addition to the acute, grave, and circumflex accents and the diaeresis:
◌ͺ (ᾳ) – iota subscript
῾◌ – rough breathing (spiritus asper): aspiration
᾿◌ – smooth (or soft) breathing (spiritus lenis): lack of aspiration
[edit] Tags:Iota Subscript,Rough Breathing,Smooth (or Soft) Breathing, | |
| Hebrew | |
| 3>
Further information: Hebrew diacritics
Gen. 1:9 "And God said, Let the waters be collected".
Letters in black, niqqud in red, cantillation in blue
Niqqud
( ּ ) Dagesh
( ּ ) Mappiq
( ֿ ) Rafe
( ׁ ) Shin dot (at top right corner)
( ׂ ) Sin dot (at top left corner)
( ְ ) Shva
( ֻ ) Kubutz
( ֹ ) Holam
( ָ ) Kamatz
( ַ ) Patakh
( ֶ ) Segol
( ֵ ) Tzeire
( ִ ) Hiriq
Cantillation marks do not generally render correctly; refer to Cantillation#Names and shapes of the ta'amim for a complete table together with instructions for how to maximize the possibility of viewing them in a web browser
Other
( ׳ ) Geresh
( ״ ) Gershayim
[edit] Tags:Hebrew,Niqqud,Cantillation Marks,Gershayim,Cantillation,Dagesh,Mappiq,Rafe,Shin Dot,Sin Dot,Shva,Kubutz,Holam,Kamatz,Patakh,Segol,Tzeire,Hiriq,Geresh, | |
| Korean | |
| 3>
Hunminjeongeum, the Korean alphabet
These diacritics, known as Bangjeom (방점;傍點), were used to mark pitch accents in Hangul for Middle Korean. It is written to the left of a character in vertical writing and above a character in horizontal writing.
〮, 〯
[edit] Tags:Hangul, | |
| Non-alphabetic scripts | |
| 2>
Some non-alphabetic scripts also employ symbols that function essentially as diacritics.
Non-pure abjads (such as Hebrew and Arabic script) and abugidas use diacritics for denoting vowels. Hebrew and Arabic also indicate consonant doubling and change with diacritics; Hebrew and Devanagari use them for foreign sounds. Devanagari and related abugidas also use a diacritical mark called a virama to mark the absence of a vowel. In addition, Devanagari uses the moon-dot chandrabindu ( ँ ).
The Japanese hiragana and katakana syllabaries use the dakuten (◌゛) and handakuten (◌゜) (in Japanese: 濁点 and 半濁点) symbols, also known as nigori (濁) or ten-ten (点々) and maru (丸), to indicate voiced consonants or other phonetical changes.
Emoticons are commonly created with diacritic symbols, especially Japanese emoticons on popular imageboards.
[edit] Tags:Chandrabindu,Virama,Dakuten,Abjads,Devanagari,Hiragana,Katakana,Syllabaries,Voiced Consonants,Emoticons,Japanese,Imageboards, | |
| Alphabetization or collation | |
| 2>
Main article: Collation
Different languages use different rules to put diacritic characters in alphabetical order. French treats letters with diacritical marks the same as the underlying letter for purposes of ordering and dictionaries.
The Scandinavian languages, by contrast, treat the characters with diacritics ä, ö and å as new and separate letters of the alphabet, and sort them after z. Usually ä is sorted as equal to æ (ash) and ö is sorted as equal to ø (o-slash). Also, aa, when used as an alternative spelling to å, is sorted as such. Other letters modified by diacritics are treated as variants of the underlying letter, with the exception that ü is frequently sorted as y.
Languages that treat accented letters as variants of the underlying letter usually alphabetize words with such symbols immediately after similar unmarked words. For instance, in German where two words differ only by an umlaut, the word without it is sorted first in German dictionaries (e.g. schon and then schön, or fallen and then fällen). However, when names are concerned (e.g. in phone books or in author catalogues in libraries), umlauts are often treated as combinations of the vowel with a suffixed e; Austrian phone books now treat characters with umlauts as separate letters (immediately following the underlying vowel).
In Spanish, the grapheme ñ is considered a new letter different from n and collated between n and o, as it denotes a different sound from that of a plain n. But the accented vowels á, é, í, ó, ú are not separated from the unaccented vowels a, e, i, o, u, as the acute accent in Spanish only modifies stress within the word or denotes a distinction between homonyms, and does not modify the sound of a letter.
For a comprehensive list of the collating orders in various languages, see Collating sequence.
[edit] Tags:Homonyms,French,Collation,Alphabetical,Stress,Collating Sequence,ú,é,ö,ø,å,æ,ä, | |
| Generation with computers | |
| 2>
Modern computer technology was developed mostly in the English-speaking countries, so data formats, keyboard layouts, etc. were developed with a bias favoring English, a language with an alphabet without diacritical marks. This has led to fears internationally that the marks and accents may be made obsolete to facilitate the worldwide exchange of data.[citation needed] Efforts have been made to create internationalized domain names that further extend the English alphabet (e.g., "pokémon.com").
Depending on the keyboard layout, which differs amongst countries, it is more or less easy to enter letters with diacritics on computers and typewriters. Some have their own keys; some are created by first pressing the key with the diacritic mark followed by the letter to place it on. Such a key is sometimes referred to as a dead key, as it produces no output of its own, but modifies the output of the key pressed after it.
In modern Microsoft Windows and Linux operating systems, the keyboard layouts US International and UK International feature dead keys that allow one to type Latin letters with the acute, grave, circumflex, diæresis, tilde, and cedilla found in Western European languages (specifically, those combinations found in the ISO Latin-1 character set) directly: "+e gives ë, ~+o gives õ, etc. On Apple Macintosh computers, there are keyboard shortcuts for the most common diacritics; Option-e followed by a vowel places an acute accent, Option-u followed by a vowel gives an umlaut, option-c gives a cedilla, etc. Diacritics can be composed in most X Window System keyboard layouts, as well as other operating systems, such as Microsoft Windows, using additional software.
On computers, the availability of code pages determines whether one can use certain diacritics. Unicode solves this problem by assigning every known character its own code; if this code is known, most modern computer systems provide a method to input it. With Unicode, it is also possible to combine diacritical marks with most characters.
[edit] Tags:English,Internationalized Domain Names,Dead Key,Dead Keys,Iso Latin-1,Composed,X Window System,Unicode,Combine Diacritical Marks, | |
| Languages with letters containing diacritics | |
| 2>
The following languages have letters that contain diacritics that are considered independent letters distinct from those without diacritics.
Germanic
Faroese uses acute accents and other special letters. All are considered separate letters and have their own place in the alphabet: á, í, ó, ú, ý, and ø.
Icelandic uses acute accents and other special letters. All are considered separate letters, and have their own place in the alphabet: á, é, í, ó, ú, ý, and ö.
Danish and Norwegian uses additional characters like the o-slash ø and the a-circle å. These letters are collated after z and æ, in the order ø, å. Historically the å has developed from a ligature by writing a small a on top of the letter a; if an å character is unavailable, some Scandinavian languages allow the substitution of a doubled a. The Scandinavian languages collate these letters after z, but have different collation standards.
Swedish uses characters identical to a-umlaut (ä) and o-umlaut (ö) in the place of ash and o-slash in addition to the a-circle (å). Historically the umlaut for the Swedish letters ä and ö, like the German umlaut, has developed from a small gothic e written on top of the letters. These letters are collated after z, in the order å, ä, ö.
Celtic
Irish uses acute accents, called fadas. Fadas are used to indicate vowel length. These are the following vowels á, é, í, ó, ú.
Welsh uses the circumflex, diaeresis, acute and grave accents on its seven vowels a, e, i, o, u, w, y.
Following orthographic reforms since the 1970s, Scottish Gaelic uses grave accents only, which can be used on any vowel (à, è, ì, ò, ù). Formerly acute accents could be used on á, ó and é, which were used to indicate a specific vowel quality. With the elimination of these accents, the new orthography relies on the reader having prior knowledge of pronunciation of a given word.
Manx uses the single diacritic ç combined with h to give the digraph <çh> (pronounced /tʃ/) to mark the distinction between it and the digraph <ch> (pronounced /h/ or /x/). Other diacritics used in Manx included â, ê, ï, etc. to mark the distinction between two similarly spelled words but with slightly differing pronunciation.
Some orthographies of Cornish such as Kernowek Standard and Unified Cornish use diacritics, while others such as Kernewek Kemmyn and the Standard Written Form do not.
Romance
Asturian, Galician and Spanish, the character ñ is a letter and collated between n and o
Asturian uses Ḷ (lower case ḷ), and Ḥ (lower case ḥ)[3]
Leonese: could use ñ or nn.
Romanian uses a breve on the letter a (ă) to indicate the sound schwa /ə/, as well as a circumflex over the letters a (â) and i (î) for the sound /ɨ/. Romanian also writes a comma below the letters s (ș) and t (ț) to represent the sounds /ʃ/ and /t͡s/, respectively. These characters are collated after their non-diacritic equivalent.
Slavic
Bosnian, Croatian, and Serbian latin alphabet have the symbols č, ć, đ, š and ž, which are considered separate letters and are listed as such in dictionaries and other contexts in which words are listed according to alphabetical order. Bosnian and Croatian also have one digraph including a diacritic, dž, which is also alphabetised independently, and follows d and precedes đ in the alphabetical order. The Serbian Cyrillic alphabet has no diacritics.
The Czech alphabet contains 27 graphemes (letters) when written without diacritics and 42 graphemes when written including them. Czech uses the acute (á é í ó ú ý), caron (č ď ě ň ř š ť ž), and for one letter (ů) the ring.
Polish has the following letters: ą ć ę ł ń ó ś ź ż. These are considered to be separate letters, each of them is placed in alphabet right after its Latin counterpart (i.e. ą between a and b), ź and ż are placed after z in this order.
The Slovak alphabet uses the acute (á é í ó ú ý ĺ ŕ), caron (č ď ľ ň š ť ž), umlaut (ä) and circumflex accent (ô).
The Slovene alphabet has the symbols č, š and ž, which are considered separate letters and are listed as such in dictionaries and other contexts in which words are listed according to alphabetical order.
Baltic
Latvian has the following letters: ā ē ī ū ŗ ļ ķ ņ ģ š ž č.
Lithuanian. In general usage, where letters appear with the caron (č, š and ž) they are considered as separate letters from c, s or z and collated separately; letters with the ogonek (ą, ę, į and ų), the macron (ū) and the superdot (ė) are considered as separate letters as well, but not given a unique collation order.
Finno-Ugric
Estonian has a distinct letter õ, which contains a tilde. Estonian "dotted vowels" ä, ö, ü are similar to German, but these are also distinct letters, not like German umlauted letters. All four have their own place in the alphabet, between w and x. Carons in š or ž appear only in foreign proper names and loanwords. Also these are distinct letters, placed in the alphabet between s and t.
Finnish uses dotted vowels (ä and ö). As in Swedish and Estonian, these are regarded as individual letters, rather than vowel + umlaut combinations (as happens in German). It also uses the characters å, š and ž in foreign names and loanwords. In the Finnish and Swedish alphabets, å, ä and ö collate as separate letters after z, the others as variants of their base letter.
Hungarian uses the umlaut, the acute and double acute accent (unique to Hungarian): ö ü, á é í ó ú and ő ű. The acute accent indicates the long form of a vowel (in case of i/í, o/ó, u/ú) while the double acute performs the same function for ö and ü. The acute accent can also indicate a different sound (more open, like in case of a/á, e/é). Both long and short forms of the vowels are listed separately in the Hungarian alphabet but members of the pairs a/á, e/é, i/í, o/ó, ö/ő, u/ú and ü/ű are collated in dictionaries as the same letter.
Livonian has the following letters: ā, ä, ǟ, ḑ, ē, ī, ļ, ņ, ō, ȯ, ȱ, õ, ȭ, ŗ, š, ț, ū, ž.
Turkic
Azerbaijani includes the distinct Turkish alphabet letters Ç, Ğ, I, İ, Ö, Ş and Ü.
Crimean Tatar includes the distinct Turkish alphabet letters Ç, Ğ, I, İ, Ö, Ş and Ü. Unlike Standard Turkish (but like Cypriot Turkish), Crimean Tatar also has the letter Ñ.
Gagauz includes the distinct Turkish alphabet letters Ç, Ğ, I, İ, Ö and Ü. Unlike Turkish, Gagauz also has the letters Ä, Ê Ș and Ț. Ș and Ț are derived from the Romanian alphabet for the same sounds. Sometime the turkish Ş may be used instead.
Turkish uses a G with a breve (Ğ), two letters with an umlaut (Ö and Ü, representing two rounded front vowels), two letters with a cedilla (Ç and Ş, representing the affricate /tʃ/ and the fricative /ʃ/), and also possesses a dotted capital İ (and a dotless lowercase ı representing a high unrounded back vowel). In Turkish each of these are separate letters, rather than versions of other letters, where dotted capital İ and lower case i are the same letter, as are dotless capital I and lowercase ı. Typographically, Ç and Ş are often rendered with a subdot, as in Ṣ; when a hook is used, it tends to have more a comma shape than the usual cedilla. The new Azerbaijani, Crimean Tatar, and Gagauz alphabets are based on the Turkish alphabet and its same diacriticized letters, with some additions.
Other
Albanian has two special letters Ç and Ё upper and lowercase. They are placed next to the most similar letters in the alphabet, c and e correspondingly.
Esperanto has the symbols ŭ, ĉ, ĝ, ĥ, ĵ and ŝ, which are included in the alphabet, and considered separate letters.
Hawaiian uses the kahakô (macron) over vowels, although there is some disagreement over considering them as individual letters. The kahakô over a vowel can completely change the meaning of a word that is spelled the same but without the kahakô.
Kurdish uses the symbols Ç, Ê, Î, Ş and Û with other 26 standard Latin alphabet symbols.
Tags:Orthography,Faroese,Acute Accents, | |
zote monety |