ARPABET

Quality 0.50 · 1 view · Updated 2 months ago

Phonetic notation

title: "ARPABET" type: doc version: 1 created: 2026-02-28 author: "Wikipedia contributors" status: active scope: public tags: ["phonetic-alphabets", "advanced-research-projects-agency", "ascii", "american-english", "computer-related-introductions-in-the-1970s", "1970s-in-the-united-states", "1970s-in-science"] description: "Phonetic notation" topic_path: "geography/united-states" source: "https://en.wikipedia.org/wiki/ARPABET" license: "CC BY-SA 4.0" wikipedia_page_id: 0 wikipedia_revision_id: 0

::summary Phonetic notation ::

ARPABET (also spelled ARPAbet) is a set of phonetic transcription codes developed by Advanced Research Projects Agency (ARPA) as a part of their Speech Understanding Research project in the 1970s. It represents phonemes and allophones of General American English with distinct sequences of ASCII characters. Two systems, one representing each segment with one character (alternating upper- and lower-case letters) and the other with one or two (case-insensitive), were devised, the latter being far more widely adopted.

ARPABET has been used in several speech synthesizers, including Computalker for the S-100 system, SAM (Software Automatic Mouth) for the Atari 8-bit computers and Commodore 64, the Say utility shipped with the Amiga, TextAssist for the PC and Speakeasy from Intelligent Artefacts which used the Votrax SC-01 speech synthesiser IC. It is also used in the CMU Pronouncing Dictionary. A revised version of ARPABET is used in the TIMIT corpus.

Symbols

Stress is indicated by a digit immediately following a vowel. Auxiliary symbols are identical in 1- and 2-letter codes. In 2-letter notation, segments are separated by a space.

::data[format=table title="Vowels{{cite journal|last=Rice|first=Lloyd|date=April 1976|title=Hardware & software for speech synthesis|journal=[[Dr. Dobb's Journal of Computer Calisthenics & Orthodontia]]|volume=1|issue=4|pages=6–8|url=https://archive.org/stream/dr_dobbs_journal_vol_01#page/n93}}"]

ARPABET	IPA	Example(s)	1-letter	2-letter
a	AA	~	balm, bot (with father–bother merger)
@	AE		bat
A	AH		buck
c	AO		caught, story
W	AW		bout
x	AX		comma
	last1=Jurafsky	first1=Daniel	authorlink=Daniel Jurafsky	last2=Martin
Y	AY		bite
E	EH		bet
R	ER		bird, foreword
e	EY		bait
I	IH		bit
X	IX		roses, rabbit
i	IY		beat
o	OW		boat
O	OY		boy
U	UH		book
u	UW		boot
	UX		dude
::

::data[format=table title="Consonants"]

ARPABET	IPA	1-letter
b	B	buy
C	CH	China
d	D	die
D	DH	thy
F	DX	butter
L	EL	bottle
M	EM	rhythm
N	EN	button
f	F	fight
g	G	guy
h	HH or H	high
J	JH	jive
k	K	kite
l	L	lie
m	M	my
n	N	nigh
G	NX or NG	sing
	NX	winter
p	P	pie
Q	Q	uh-oh
r	R	rye
s	S	sigh
S	SH	shy
t	T	tie
T	TH	thigh
v	V	vie
w	W	wise
H	WH	why (without wine–whine merger)
y	Y	yacht
z	Z	zoo
Z	ZH	pleasure
::

::data[format=table title="Stress and auxiliary symbols"]

AB	Description
0	No stress
1	Primary stress
2	Secondary stress
3...	Tertiary and further stress
-	Silence
!	Non-speech segment
+	Morpheme boundary
/	Word boundary
#	Utterance boundary
:	Tone group boundary
:1 or .	Falling or declining juncture
:2 or ?	Rising or internal juncture
:3 or .	Fall-rise or non-terminal juncture
::

TIMIT

In TIMIT, the following symbols are used in addition to the ones listed above:

::data[format=table]

Symbol	Example	Description
AX-H	suspect	Devoiced
BCL	obtain	closure
DCL	width	closure
ENG	Washington	Syllabic
GCL	dogtooth	closure
HV	ahead	Voiced
KCL	doctor	closure
PCL	accept	closure
TCL	catnip	closure
PAU		Pause
EPI		Epenthetic silence
H#		Begin/end marker
::

References

Klautau, Aldebaro. (2001). "ARPABET and the TIMIT alphabet".
Rice, Lloyd. (April 1976). "Hardware & software for speech synthesis". [[Dr. Dobb's Journal of Computer Calisthenics & Orthodontia]].
(2000). "Speech and Language Processing: An Introduction to Natural Language Processing, Computational Linguistics, and Speech Recognition". Prentice Hall.
(October 12, 1990). "Table of all the phonemic and phonetic symbols used in the TIMIT lexicon". Linguistic Data Consortium.

::callout[type=info title="Wikipedia Source"] This article was imported from Wikipedia and is available under the Creative Commons Attribution-ShareAlike 4.0 License. Content has been adapted to SurfDoc format. Original contributors can be found on the article history page. ::