'C' comes from the same letter as 'G'. The Semites named it gimel. The sign is possibly adapted from an Egyptian hieroglyph for a staff sling, which may have been the meaning of the name gimel. Another possibility is that it depicted a camel, the Semitic name for which was gamal.
In the Etruscan language, plosive consonants had no contrastive voicing, so the Greek 'Γ' (Gamma) was adopted into the Etruscan alphabet to represent /k/. Already in the Western Greek alphabet, Gamma first took a '' form in Early Etruscan, then '' in Classical Etruscan. In Latin it eventually took the 'c' form in Classical Latin. In the earliest Latin inscriptions, the letters 'c k q' were used to represent the sounds /k/ and /ɡ/ (which were not differentiated in writing). Of these, 'q' was used to represent /k/ or /ɡ/ before a rounded vowel, 'k' before 'a', and 'c' elsewhere. During the 3rd century BC, a modified character was introduced for /ɡ/, and 'c' itself was retained for /k/. The use of 'c' (and its variant 'g') replaced most usages of 'k' and 'q'. Hence, in the classical period and after, 'g' was treated as the equivalent of Greek gamma, and 'c' as the equivalent of kappa; this shows in the romanization of Greek words, as in 'KAΔMOΣ', 'KYPOΣ', and 'ΦΩKIΣ' came into Latin as 'cadmvs', 'cyrvs' and 'phocis', respectively.
Other alphabets have letters homoglyphic to 'c' but not in use and derivation, like the Cyrillic letter Es (С, с) which derives from the lunate sigma, named due to its resemblance to the crescent moon.
When the Roman alphabet was introduced into Britain, 'c' represented only /k/ and this value of the letter has been retained in loanwords to all the insular Celtic languages: in Welsh, Irish, Gaelic, 'c' represents only /k/. The Old English or "Anglo-Saxon" writing was learned from the Celts, apparently of Ireland; hence 'c' in Old English also originally represented /k/; the Modern English words kin, break, broken, thick, and seek, all come from Old English words written with 'c': cyn, brecan, brocen, þicc, and séoc. But during the course of the Old English period, /k/ before front vowels (/e/ and /i/) were palatalized, having changed by the tenth century to [tʃ], though 'c' was still used, as in cir(i)ce, wrecc(e)a. On the continent, meanwhile, a similar phonetic change had also been going on (for example, in Italian).
In Vulgar Latin, /k/ became palatalized to [tʃ] in Italy and Dalmatia; in France and the Iberian peninsula, it became [ts]. Yet for these new sounds 〈c〉 was still used before front vowels ⟨e⟩,⟨ i⟩. The letter thus represented two distinct values. Subsequently, the Latin phoneme /kʷ/ (spelled 〈qv〉) de-labialized to /k/ meaning that the various Romance languages had /k/ before front vowels. In addition, Norman used the Greek letter 'k' so that the sound /k/ could be represented by either 'k' or 'c' the latter of which could represent either /k/ or /ts/ depending on whether it preceded a front vowel or not. The convention of using both 'c' and 'k' was applied to the writing of English after the Norman Conquest, causing a considerable re-spelling of the Old English words. Thus while Old English candel, clif, corn, crop, cú, remained unchanged, Cent, cæ´ᵹ (cé´ᵹ), cyng, brece, séoce, were now (without any change of sound) spelled 'Kent', 'keȝ', 'kyng', 'breke', and 'seoke'; even cniht ('knight') was subsequently changed to 'kniht' and þic ('thick') changed to 'thik' or 'thikk'. The Old English 'cw' was also at length displaced by the French 'qu' so that the Old English cwén ('queen') and cwic ('quick') became Middle English 'quen' 'quik', respectively. [tʃ] to which Old English palatalized /k/ had advanced, also occurred in French, chiefly from Latin /k/ before 'a'. In French it was represented by 'ch', as in champ (from Latin camp-um) and this spelling was introduced into English: the Hatton Gospels, written about 1160, have in Matt. i-iii, child, chyld, riche, mychel, for the cild, rice, mycel, of the Old English version whence they were copied. In these cases, the Old English 'c' gave place to 'k qu ch' but, on the other hand, 'c' in its new value of /ts/ came in largely in French words like processiun, emperice, grace, and was also substituted for 'ts' in a few Old English words, as miltse, bletsien, in early Middle English milce, blecien. By the end of the thirteenth century both in France and England, this sound /ts/ de-affricated to /s/; and from that time 'c' has represented /s/ before front vowels either for etymological reasons, as in lance, cent, or (in defiance of etymology) to avoid the ambiguity due to the "etymological" use of 's' for /z/, as in ace, mice, once, pence, defence.
Thus, to show the etymology, English spelling has advise, devise, instead of advize, devize, which while advice, device, dice, ice, mice, twice, etc., do not reflect etymology; example has extended this to hence, pence, defence, etc., where there is no etymological necessity for 'c'. Former generations also wrote sence for sense. Hence, today the Romance languages and English have a common feature inherited from Vulgar Latin where 'c' takes on either a "hard" or "soft" value depending on the following vowel.
Use in orthographies
In English orthography, 'c' generally represents a "soft" value of // before the vowel letters 'e' (including the Latin-derived digraphs ae and oe), 'i' and 'y' and a "hard" value of // before the vowel letters 'a', 'o' and 'u'. However, there are a number of exceptions in English: "soccer" and "Celt" are words that have // where // would be expected.
The digraph 'ch' most commonly represents //, but can take the value // (mainly in words of Greek origin) or // (mainly in words of French origin); some dialects of English also have // in words like loch where other speakers pronounce the final sound as //. The trigraph 'tch' always represents //.
In the Romance languages French, Spanish, Italian and Portuguese, 'c' generally has a "hard" value of /k/ and a "soft" value, the pronunciation of which varies by language. In French, Portuguese, and Spanish from Latin America and southern Spain, the soft 'c' value is /s/ as it is in English. In the Spanish spoken in northern and central Spain, the soft 'c' is a voiceless dental fricative /θ/. In Italian and Romanian, the soft 'c' is [t͡ʃ].
All Balto-Slavic languages that use the Latin alphabet, as well as Albanian, Hungarian, Pashto, several Sami languages, Esperanto, Ido, Interlingua, and Americanist phonetic notation (and those aboriginal languages of North America whose practical orthography derives from it) use 'c' to represent /t͡s/, the voiceless alveolar or voiceless dental sibilant affricate. In romanized Chinese, the letter represents an aspirated version of this sound, /t͡sʰ/.
Among non-European languages that have adopted the Latin alphabet, 'c' represents a variety of sounds. Yup'ik, Indonesian, Malay, and a number of African languages such as Hausa, Fula, and Manding share the soft Italian value of /t͡ʃ/. In Azeri, Kurdish, Tatar, and Turkish 'c' stands for the voiced counterpart of this sound, the voiced postalveolar affricate /d͡ʒ/. In Yabem and similar languages, such as Bukawa, 'c' stands for a glottal stop /ʔ/. Xhosa and Zulu use this letter to represent the click /ǀ/. in some other African languages, such as Beninese Yoruba, 'c' is used for /ʃ/. In Fijian, 'c' stands for a voiced dental fricative /ð/, while in Somali it has the value of /ʕ/.
There are several common digraphs with 'c', the most common being 'ch', which in some languages such as German is far more common than 'c' alone. 'Ch' takes various values in other languages, such as:
- /t͡ʃ/ in Spanish
- /ʃ/ in French and Portuguese
- /k/ in Interlingua and Italian
- /x/ in the West Slavic languages (e.g. Polish, Czech and Slovak)
- /x/ (comprising the mostly allophonic sounds [x] and [ç]) or sometimes /k/ in German
- /x/ or /χ/ in Dutch
- /tʂʰ/ in Romanized Standard Chinese
Other digraphs and trigraphs
As in English, 'Ck', with the value /k/, is often used after short vowels in other Germanic languages such as German and Swedish (but some other Germanic languages use 'kk' instead, such as Dutch and Norwegian). The digraph 'cz' is found in Polish and 'cs' in Hungarian, both representing /t͡ʃ/. The digraph 'sc' represents /ʃ/ in Old English, Italian, and a few languages related to Italian, (however in Italian and related languages this only happens before front vowels, otherwise it represents /sk/). The trigraph 'sch' represents /ʃ/ in German.
As a phonetic symbol, lowercase 'c' is the International Phonetic Alphabet (IPA) and X-SAMPA symbol for the voiceless palatal plosive, and capital 'C' is the X-SAMPA symbol for the voiceless palatal fricative.
Related letters and other similar characters
|Unicode name||LATIN CAPITAL LETTER C||LATIN SMALL LETTER C|
|Numeric character reference||C||C||c||c|
- 1 Also for encodings based on ASCII, including the DOS, Windows, ISO-8859 and Macintosh families of encodings.
- "C" Oxford English Dictionary, 2nd edition (1989); Merriam-Webster's Third New International Dictionary of the English Language, Unabridged (1993); "cee", op. cit.
- Sihler, Andrew L. (1995). New Comparative Grammar of Greek and Latin (illustrated ed.). New York: Oxford University Press. p. 21. ISBN 0-19-508345-8.
|Wikisource has the text of the 1911 Encyclopædia Britannica article C.|
- Media related to C at Wikimedia Commons
- The dictionary definition of C at Wiktionary
- The dictionary definition of c at Wiktionary
Letter C with diacritics