Language Codes

Background
ISO 639 Language codes are the most commonly used standard for classifying Languages. There are two standards published under ISO 639: ISO 639-1 consists of 136 two-character codes used to identify the world's major languages. ISO 639-2 consists of three-character language codes, initiated due to the fact that the two-letter codes in the ISO 639-1 standard is not able to accommodate a sufficient number of languages. The ISO 639-2 standard was first released in 1998.

No new codes have been added to ISO 639-1 since 2003.

Note: Viewing/Filtering content is free for all datasets below.

CDH Language Codes Datasets
The following are the various datasets available on CDH for the different coding schemes around languages of the world. Different views of the information have been made available for ease of use. Please note that some of the datasets require a paid subscription (indicated by the symbol), primarily due to the value-add of consolidating various sources, additional attributes, and incremental updates. All reasonable commercial attempt is made to keep the information up-to-date.
GLO1 ISO 639-1 Language Codes Languages of the world as specified by the ISO 639-1 standard with 2-letter codes.
GLO2 ISO 639-2 Language Codes Languages of the world as specified by the ISO 639-2 standard with 3-letter codes.
GLO2S ISO 639 Language Codes Languages of the world as specified by the ISO 639-2 standard, along with ISO 639-1 2-letter codes.
Dataset Features and Price List
Feature GLO1
FREE
GLO2
FREE
GLO2S
$50
ISO 639-1 Language Name: Language name as defined by the ISO 639-1 Standard. X
ISO 639-1 Language Char 2 Code: Two-character Language code as defined by the ISO 639-1 Standard. X X
ISO 639-2 Language Name (French): Language name in French as defined by the ISO 639-2 Standard. X X X
ISO 639-1 Language Char 2 Code (Upper Case): Two-character Language code in Upper Case as defined by the ISO 639-1 Standard. X
ISO 639-1 Language Name (Upper Case): Language name in Upper Case as defined by the ISO 639-1 Standard. X
ISO 639-2 Language Name: Language name as defined by the ISO 639-2 Standard. X X
ISO 639-2 Char 3 Code (Bibliographic): Three-character Language Bibliographic Code assigned to every language, as defined by the ISO 639-2 Standard. X X
ISO 639-2 Char 3 Code (Terminological): Three-character Language Terminological alternate Code that is derived from the language name. Defined by the ISO 639-2 Standard. X X
Language Alternate Names: Alternate names given to the language, as defined by Various sources. X X
Comments: Comments/exceptions are noted here. Sources are various. X X
Automated Notification: Automated Notification of Changes. X
Free Email Support. X X X
Free Telephone Support. X
Attribute Sources
A list of all attributes of the datasets, along with relevant sources. The field Portal Release Date is the date this information was published on CommonDataHub.
Attribute Source Portal Release Date
ISO 639-1 Language Name ISO 639-1 28-Feb-2008
ISO 639-1 Language Char 2 Code ISO 639-1 28-Feb-2008
ISO 639-2 Language Name (French) ISO 639-2 28-Feb-2008
ISO 639-1 Language Char 2 Code (Upper Case) ISO 639-1 15-Apr-2008
ISO 639-1 Language Name (Upper Case) ISO 639-1 28-Mar-2008
ISO 639-2 Language Name ISO 639-2 28-Feb-2008
ISO 639-2 Language Char 3 Code (Bibliographic) ISO 639-2 28-Feb-2008
ISO 639-2 Language Char 3 Code (Terminological) ISO 639-2 28-Feb-2008
Language Alternate Names Various 28-Feb-2008
Comments CommonDataHub 28-Feb-2008
Details of the sources used:

ISO 639:
http://www.iso.org

Library of Congress:
http://www.loc.gov/standards/

Acronyms
A listing of the acronyms and abbreviations used.

CDH: CommonDataHub
ISO: International Organization for Standardization
 
Home | About Us | Help | Contact Us | FAQ | Data Provider Account | Legal | Privacy Policy | Terms of Use v. 20080223.001