Conlang X-SAMPA Transcription Scheme


The Conlang X-SAMPA (CXS) transcription scheme is documented by Henrik Theiling. It is used on the conlang mailing list for transcribing the pronunciation of conlangs, but can be used outside that domain. CXS derives from X-SAMPA, making various incompatible changes to X-SAMPA with the intention of making it easier to read by choosing some representations that are closer to the IPA symbols.

This document uses a set of phoneme features based on the features originally developed by Evan Kirshenbaum. These features are described in the phonemes document.

The _ character is used as a tie bar to join two phonemes, as well as the first character in several diacritics. It can be used for affricates and double articulations. The _^ non-syllabic marker can be used for the second vowel in diphthongs.

The ) character is used by CXS as an alternative tie bar that is placed after the second phoneme. This is preferred by CXS as it does not clash with any diacritics. For example, ts) is an affricate and ts is a consonant cluster of t followed by s.

The X-SAMPA - character to differentiate between affricates, double articulations, and diphthongs such as ts and consonant clusters such as t-s is not supported in CXS.

Phoneme Transcription Schemes

BCP47 SubtagAbbreviationTranscription SchemeEncoding
fonipaIPAInternational Phonetic AlphabetUnicode
fonxsampX-SAMPAExtended Speech Assessment Methods Phonetic AlphabetASCII
x-foncxsCXSConlang X-SAMPAASCII
x-fonkirshKirshenbaum (ASCII-IPA)ASCII
  1. foncxs and fonkirsh are private use extensions defined in the bcp47-extensions file, so have the x- private use specifier before their subtag names.

Consonants

blblbddntalvplarfxalppalveluvlphrglt
vlsvcdvlsvcdvlsvcdvlsvcdvlsvcdvlsvcdvlsvcdvlsvcdvlsvcdvlsvcdvlsvcdvlsvcd
nasmFnn`JNN\
stppbtdt`d`cJ\kgqG\>\?
sib afr
afr
lat afr
sib frcszSZs`z`s\z\
frcp\BfvTDCj\xGXRhh\
lat frcKK\
aprPr\r\`jM\
lat aprll`LL\
flp4r`
lat flpl\
trlB\rR\H\<\
clkO\``!\!\`=\
lat clk`|`
impb_<d_<J\_<g_<G\
ejcp_>t_>t`_>c_>k_>q_>>\_>
ejc frcf_>T_>s_>S_>s`_>x_>X_>
lat ejc frcK_>
  1. The X-SAMPA X\ and ?\ are not specified by CXS. Instead, the H\ and <\ consonants are transcribed as fricatives and are included in their places.

  2. \v can also be used for the labio-dental approximant P like in X-SAMPA.

  3. !\` is specific to CXS. It is not listed in the IPA chart either.

Other Symbols

bldalvplapallbvvel
vlsvcdvlsvcdvlsvcdvlsvcdvlsvcdvlsvcd
nas
stp
afr
vzd frcx\
ptr aprHWw
fzd lat apr5
  1. 5 is supported as an alternative to l_e due to its use in some language- specific SAMPA phoneme sets.

Manner of Articulation

FeatureSymbolName
ejc_>ejective
imp_<implosive

Vowels

fntcntbck
unrrndunrrndunrrnd
hghiyi\u\Mu
smhIYI\U\U
umde2@\87o
mid@
lmdE933\VO
sml&6
lowa&\AQ
  1. i\ is in common usage in the conlang mailing list, but the X-SAMPA 1 transcription is also supported.

  2. In CXS, the following vowels are different to the X-SAMPA transcriptions: } is replaced by u\, { by &, and & by &\. This makes CXS incompatible with X-SAMPA.

  3. I\ and U\ are not listed in the IPA chart.

Other Symbols

SymbolFeatures
@`unr mid cnt rzd vwl
3`unr lmd cnt rzd vwl
  1. @` and 3` are not explicitly listed in CXS. The rhoticized diacritic is specified instead.

Diacritics

Articulation

FeatureSymbolName
lgl◌_Nlinguolabial
idtinterdental
◌_ddental
apc◌_aapical
lmn◌_mlaminal
◌_+advanced
◌_-retracted
◌_"centralized
mid-centralized
◌_rraised
◌_llowered

The articulations that do not have a corresponding feature name are recorded using the features of their new location in the consonant or vowel charts, not using the features of the base phoneme.

Phonation

FeatureSymbolName
brv◌_tbreathy voice
slv◌_0slack voice
stv◌_vstiff voice
crv◌_kcreaky voice
glc?_◌glottal closure

The IPA _0 diacritic is also used to fill the vls spaces in the IPA consonant charts. Thus, when _0 is used with a vcd consonant that does not have an equivalent vls consonant, the resulting consonant is vls, not slv.

Rounding and Labialization

FeatureSymbolName
ptr◌_wprotruded
cmpcompressed

The degree of rounding/labialization can be specified using the following symbols:

FeatureSymbolName
mrd◌_Omore rounded
lrd◌_cless rounded

Syllabicity

FeatureSymbolName
syl◌=syllabic
nsy◌_^non-syllabic

Consonant Release

FeatureSymbolName
asp◌_haspirated
nrs◌_nnasal release
lrs◌_llateral release
unx◌_}no audible release (unexploded)

Co-articulation

FeatureSymbolName
pzd◌_j, ◌;palatalized
vzd◌_G, ◌_evelarized
fzd◌_?\, ◌_epharyngealized
nzd◌~, ◌_~nasalized
rzd◌`rhoticized
  1. CXS does not support using ' for palatalisation as it conflicts with the primary stress symbol. The ; symbol is CXS specific.

Tongue Root

The tongue root position can be specified using the following features:

FeatureSymbolName
atr◌_Aadvanced tongue root
rtr◌_qretracted tongue root

Suprasegmentals

Stress

SymbolName
'◌, "◌primary stress
"◌, ,◌, %◌secondary stress
  1. The " primary and % secondary stress markers are from X-SAMPA. The other symbols are CXS specific.

Length

FeatureSymbolName
est◌_Xextra short
hlg◌:\half-long
lng◌:long
elg◌::extra long
  1. The :: symbol for elg is not listed in X-SAMPA, but is derived from the transcription for lng.

Rhythm

SymbolName
◌-\◌linking (no break)

Tones

SymbolName
◌_Textra high tone
◌_Hhigh tone
◌_Mmid tone
◌_Llow tone
◌_Bextra low tone
!◌downstep
^◌upstep

X-SAMPA additionally defines various symbols for contour tones that can be defined from the composite tone marks.

SymbolCompositeName
◌_R◌_B_Trising contour
◌_F◌_T_Bfalling contour
◌_R_F◌_M_H_Mrising-falling contour

Intonation

SymbolName
<R>global rise
<F>global fall

References

  1. Theiling, Henrik, Conlang X-Sampa: modified X-Sampa used on the Conlang Mailing List. Revised 2017.