Title: | Internationalization Data from the 'Unicode CLDR' in Tabular Form |
---|---|
Description: | Up-to-date data from the 'Unicode CLDR Project' (where 'CLDR' stands for 'Common Locale Data Repository') are available here as a series of easy-to-parse datasets. Several functions are provided for extracting key elements from the tabular datasets. |
Authors: | Richard Iannone [aut, cre] |
Maintainer: | Richard Iannone <[email protected]> |
License: | MIT + file LICENSE |
Version: | 0.2.0.9000 |
Built: | 2024-11-12 04:10:32 UTC |
Source: | https://github.com/rich-iannone/i18n |
This is a vector of the 305 currency codes that are used in the currencies dataset within the i18n package.
all_currency_codes
all_currency_codes
An object of class character
of length 305.
This is a vector of the 574 locale names that are used throughout the tabular datasets within the i18n package.
all_locales
all_locales
An object of class character
of length 574.
The character_labels
table contains localized data for character labels
across 574 locales. There are 574
rows and the following 3 columns:
locale
(character
)
character_label_patterns
(named list [variable length]
)
character_labels
(named list [variable length]
)
character_labels
character_labels
An object of class tbl_df
(inherits from tbl
, data.frame
) with 574 rows and 3 columns.
The characters
table contains localized character data across
574 locales. There are 574 rows and
the following 12 columns:
locale
(character
)
exemplar_characters
(character
)
auxiliary
(character
)
index
(character
)
numbers
(character
)
punctuation
(character
)
more_info
(character
)
ellipsis
(named list [length of 6]
)
leninent_scope_general
(named list [length of 9]
)
leninent_scope_date
(named list [length of 2]
)
leninent_scope_number
(named list [length of 3]
)
stricter_scope_number
(named list [length of 2]
)
characters
characters
An object of class tbl_df
(inherits from tbl
, data.frame
) with 574 rows and 12 columns.
The character_labels table contains information on character patterns
and character labels across 574 locales. The
cldr_character_labels()
function allows one to extract element values from
the table by supplying the locale
and one of the following element names:
"patterns"
: corresponds to the character_label_patterns
column in
character_labels
"labels"
: is the character_labels
column in character_labels
cldr_character_labels(locale = "en", element = c("patterns", "labels"))
cldr_character_labels(locale = "en", element = c("patterns", "labels"))
locale |
The locale ID for which to obtain the data from the
|
element |
The element from which information will be obtained for the
specified |
A named list.
The characters table contains information on the usage of characters
and exemplar character sets across 574 locales. The
cldr_characters()
function allows one to extract element values from the
table by supplying the locale
and one of the following element names:
"exemplar_characters"
"auxiliary"
"index"
"numbers"
"punctuation"
"more_info"
"ellipsis"
"leninent_scope_general"
"leninent_scope_date"
"leninent_scope_number"
"stricter_scope_number"
cldr_characters( locale = "en", element = characters_elements$exemplar_characters )
cldr_characters( locale = "en", element = characters_elements$exemplar_characters )
locale |
The locale ID for which to obtain the data from the characters table. |
element |
The element from which information will be obtained for the
specified |
Either a named list or a length one character vector, depending on
the element
value.
The currencies table contains information of currency codes and localized
display names and symbols across 574 locales. The
cldr_currencies()
function allows one to extract a single element value
from the table by supplying the locale
, the currency code (currency
), and
one of the following element
names:
"currency_symbol"
"currency_symbol_narrow"
"currency_display_name"
"currency_display_name_count_1"
"currency_display_name_count_other"
cldr_currencies( locale = "en", currency = currency_code_list$USD, element = currencies_elements$currency_symbol )
cldr_currencies( locale = "en", currency = currency_code_list$USD, element = currencies_elements$currency_symbol )
locale |
The locale ID for which to obtain the data from the currencies table. |
currency |
The currency code (e.g., |
element |
The element from which information will be obtained for the
specified |
A length one character vector.
If you would like to get the currency display name for the British Pound
("GBP"
) currency while in the "de"
locale, the following invocation of
cldr_currencies()
can be used.
cldr_currencies( locale = "de", currency = currency_code_list$GBP, element = currencies_elements$currency_display_name )
#> [1] "Britisches Pfund"
The dates table contains information on how to express dates and this data
is localized across 574 locales. The cldr_dates()
function allows one to extract a named list using a locale
and a specific
element
. The element values are:
"months_format_abbrev"
"months_format_narrow"
"months_format_wide"
"days_standalone_narrow"
"days_standalone_short"
"days_standalone_wide"
"quarters_format_abbrev"
"quarters_format_narrow"
"quarters_format_wide"
"quarters_standalone_abbrev"
"quarters_standalone_narrow"
"quarters_standalone_wide"
"dayperiods_format_abbrev"
"dayperiods_format_narrow"
"dayperiods_format_wide"
"dayperiods_standalone_abbrev"
"dayperiods_standalone_narrow"
"dayperiods_standalone_wide"
"eras_abbrev"
"eras_names"
"eras_narrow"
"date_formats"
"date_skeletons"
"time_formats"
"time_skeletons"
"date_time_available_formats"
"date_time_append_items"
"date_time_interval_formats"
cldr_dates(locale = "en", element = dates_elements$months_format_abbrev)
cldr_dates(locale = "en", element = dates_elements$months_format_abbrev)
locale |
The locale ID for which to obtain the data from the dates table. |
element |
The element from which information will be obtained for the
specified |
A named list.
The locale_names table contains information on how to express components of
locale codes and this is localized across 574 locales.
The cldr_locale_names()
function allows one to extract a named list using
a locale
and one of the following element names:
"langs"
: corresponds to the lang_names
column in locale_names
"scripts"
: is the script_names
column in locale_names
"territories"
: is territory_names
"variants"
: is variant_names
cldr_locale_names(locale = "en", element = locale_names_elements$lang_names)
cldr_locale_names(locale = "en", element = locale_names_elements$lang_names)
locale |
The locale ID for which to obtain the data from the locale_names table. |
element |
The element from which information will be obtained for the
specified |
A named list.
The numbers table contains localization data for number usage and this data
is available for 574 locales. The cldr_numbers()
function allows one to extract a named list using a locale
and a specific
element
. The element
values are:
"default_numbering_system"
"other_numbering_systems"
"minimum_grouping_digits"
"decimal"
"group"
"list"
"percent_sign"
"plus_sign"
"minus_sign"
"approx_sign"
"exp_sign"
"sup_exp"
"per_mille"
"infinity"
"nan"
"time_sep"
"approx_pattern"
"at_least_pattern"
"at_most_pattern"
"range_pattern"
"decimal_format"
"sci_format"
"percent_format"
"currency_format"
"accounting_format"
cldr_numbers( locale = "en", element = numbers_elements$default_numbering_system )
cldr_numbers( locale = "en", element = numbers_elements$default_numbering_system )
locale |
The locale ID for which to obtain the data from the numbers table. |
element |
The element from which information will be obtained for the
specified |
Either a named list or a length one character vector, depending on
the element
value.
The currencies
table contains localized data for number-related entities
across 574 locales. This table has 173,013 rows, one per
distinct combination of locale and currency (currency_code
), and the
following 7 columns:
locale
(character
)
currency_code
(character
)
currency_display_name
(character
)
currency_symbol
(character
)
currency_symbol_narrow
(character
)
currency_display_name_count_1
(character
)
currency_display_name_count_other
(character
)
currencies
currencies
An object of class tbl_df
(inherits from tbl
, data.frame
) with 175070 rows and 7 columns.
The dates
table contains localized data for constructing dates and times
across 574 locales. There are 574
rows and the following 38 columns:
locale
(character
)
months_format_abbrev
(named list [length of 12]
)
months_format_narrow
(named list [length of 12]
)
months_format_wide
(named list [length of 12]
)
months_standalone_abbrev
(named list [length of 12]
)
months_standalone_narrow
(named list [length of 12]
)
months_standalone_wide
(named list [length of 12]
)
days_format_abbrev
(named list [length of 7]
)
days_format_narrow
(named list [length of 7]
)
days_format_short
(named list [length of 7]
)
days_format_wide
(named list [length of 7]
)
days_standalone_abbrev
(named list [length of 7]
)
days_standalone_narrow
(named list [length of 7]
)
days_standalone_short
(named list [length of 7]
)
days_standalone_wide
(named list [length of 7]
)
quarters_format_abbrev
(named list [length of 4]
)
quarters_format_narrow
(named list [length of 4]
)
quarters_format_wide
(named list [length of 4]
)
quarters_standalone_abbrev
(named list [length of 4]
)
quarters_standalone_narrow
(named list [length of 4]
)
quarters_standalone_wide
(named list [length of 4]
)
dayperiods_format_abbrev
(named list [variable length]
)
dayperiods_format_narrow
(named list [variable length]
)
dayperiods_format_wide
(named list [variable length]
)
dayperiods_standalone_abbrev
(named list [variable length]
)
dayperiods_standalone_narrow
(named list [variable length]
)
dayperiods_standalone_wide
(named list [variable length]
)
eras_abbrev
(named list [length of 4]
)
eras_names
(named list [length of 4]
)
eras_narrow
(named list [length of 4]
)
date_formats
(named list [variable length]
)
date_skeletons
(named list [length of 4]
)
time_formats
(named list [variable length]
)
time_skeletons
(named list [variable length]
)
date_time_patterns
(named list [length of 4]
)
date_time_available_formats
(named list [variable length]
)
date_time_append_items
(named list [length of 11]
)
date_time_interval_formats
(named list [variable length]
)
dates
dates
An object of class tbl_df
(inherits from tbl
, data.frame
) with 574 rows and 38 columns.
The dates_generic
table contains localized data for constructing dates and
times across 574 locales. There are
574 rows and the following 38
columns:
locale
(character
)
months_format_abbrev
(named list [length of 12]
)
months_format_narrow
(named list [length of 12]
)
months_format_wide
(named list [length of 12]
)
months_standalone_abbrev
(named list [length of 12]
)
months_standalone_narrow
(named list [length of 12]
)
months_standalone_wide
(named list [length of 12]
)
days_format_abbrev
(named list [length of 7]
)
days_format_narrow
(named list [length of 7]
)
days_format_short
(named list [length of 7]
)
days_format_wide
(named list [length of 7]
)
days_standalone_abbrev
(named list [length of 7]
)
days_standalone_narrow
(named list [length of 7]
)
days_standalone_short
(named list [length of 7]
)
days_standalone_wide
(named list [length of 7]
)
quarters_format_abbrev
(named list [length of 4]
)
quarters_format_narrow
(named list [length of 4]
)
quarters_format_wide
(named list [length of 4]
)
quarters_standalone_abbrev
(named list [length of 4]
)
quarters_standalone_narrow
(named list [length of 4]
)
quarters_standalone_wide
(named list [length of 4]
)
dayperiods_format_abbrev
(named list [variable length]
)
dayperiods_format_narrow
(named list [variable length]
)
dayperiods_format_wide
(named list [variable length]
)
dayperiods_standalone_abbrev
(named list [variable length]
)
dayperiods_standalone_narrow
(named list [variable length]
)
dayperiods_standalone_wide
(named list [variable length]
)
eras_abbrev
(named list [length of 4]
)
eras_names
(named list [length of 4]
)
eras_narrow
(named list [length of 4]
)
date_formats
(named list [variable length]
)
date_skeletons
(named list [length of 4]
)
time_formats
(named list [variable length]
)
time_skeletons
(named list [variable length]
)
date_time_patterns
(named list [length of 4]
)
date_time_available_formats
(named list [variable length]
)
date_time_append_items
(named list [length of 11]
)
date_time_interval_formats
(named list [variable length]
)
dates_generic
dates_generic
An object of class tbl_df
(inherits from tbl
, data.frame
) with 574 rows and 38 columns.
The day_periods
table contains rules for naming periods of time throughout
a day. There are 519 rows that comprise a day period name
and rule for a locale. There may be only two rows associated with a locale
but many more if a locale has many names for periods of a day. The following
columns are included:
locale
(character
)
period
(character
)
from
(character
)
to
(character
)
at
(character
)
The period
value provides an identifier for the period of time. For a given
locale there may typically be "afternoon1"
and "evening1"
period
identifiers. Some may have quite a few periods defined (perhaps with
"morning1"
and "morning2"
rules). A period is either a block of time
defined by the from
and to
columns, or, a set time (like "noon"
and
"midnight"
) found in the at
column. The period
values are typically
obtained from this dataset in order to obtain localized text from
the dates and dates_generic datasets (within the dayperiods_*
columns).
day_periods
day_periods
An object of class tbl_df
(inherits from tbl
, data.frame
) with 519 rows and 5 columns.
This is a table that contains base locale names (e.g., "en"
, "de"
)
alongside their default locale names. This indicates that "en"
maps to
"en-US"
and "de"
should map to "de-DE"
. Throughout the i18n
datasets, base names are used instead of their expanded equivalents.
There are 228 rows and the following 2 columns:
default_locale
(character
)
base_locale
(character
)
The default_locale
column contains the expanded locale names (e.g.,
"en-US"
) that do not normally appear within the CLDR datasets but are
valid aliases for the base locale names (e.g., "en"
) found in the
base_locale
column.
default_locales
default_locales
An object of class tbl_df
(inherits from tbl
, data.frame
) with 228 rows and 2 columns.
The delimiters
table contains localized information on the preferred and
alternate sets of quotation marks across 574 locales.
There are 574 rows and the following 5 columns:
locale
(character
)
quotation_start
(character
)
quotation_end
(character
)
alt_quotation_start
(character
)
alt_quotation_end
(character
)
delimiters
delimiters
An object of class tbl_df
(inherits from tbl
, data.frame
) with 574 rows and 5 columns.
Several element lists are available for use in the various cldr_*()
functions. The list object names, the number of elements they hold, and the
functions they nicely pair with are:
locale_list currency_code_list locale_names_elements currencies_elements dates_elements numbers_elements characters_elements
locale_list currency_code_list locale_names_elements currencies_elements dates_elements numbers_elements characters_elements
An object of class list
of length 574.
An object of class list
of length 305.
An object of class list
of length 4.
An object of class list
of length 5.
An object of class list
of length 37.
An object of class list
of length 26.
An object of class list
of length 12.
locale_list
(574) -> several cldr_*()
functions
currency_code_list
(305) -> cldr_currency()
currencies_elements
(5) -> cldr_currency()
locale_names_elements
(4) -> cldr_locale_names()
dates_elements
(28) -> cldr_dates()
numbers_elements
(26) -> cldr_numbers()
characters_elements
(12) -> cldr_characters()
The layout
table contains data on text layout across
574 locales. There are 574 rows and
the following 3 columns:
locale
(character
)
character_order
(character
)
line_order
(character
)
layout
layout
An object of class tbl_df
(inherits from tbl
, data.frame
) with 574 rows and 3 columns.
The locale_names
table contains localized names for languages, script
names, names for territories, and names for variants. There are
574 rows and the following 5 columns:
locale
(character
)
lang_names
(named list [variable length]
)
script_names
(named list [variable length]
)
territory_names
(named list [variable length]
)
variant_names
(named list [variable length]
)
The lang_names
column contains named lists for all localized language
names. The script_names
column holds named lists for all localized script
names, and territory_names
has all of the localized territory names per
locale. The variant_names
list column containing named lists for all
localized variant names.
locale_names
locale_names
An object of class tbl_df
(inherits from tbl
, data.frame
) with 574 rows and 5 columns.
The num_system_digits
table contains vectors of digits for different
numbering systems. Each vector contains 10 elements (comprising the numbers
0
to 9
) and there are are 53 rows in total. The
following columns are included:
script
(character
)
digits
(list [length of 10]
)
num_system_digits
num_system_digits
An object of class tbl_df
(inherits from tbl
, data.frame
) with 53 rows and 2 columns.
The numbers
table contains localized data for number-related entities
across 574 locales. This table has
574 rows (one per locale) and the following 26 columns:
locale
(character
)
default_numbering_system
(character
)
other_numbering_systems
(named list [variable length]
)
minimum_grouping_digits
(integer
)
decimal
(character
)
group
(character
)
list
(character
)
percent_sign
(character
)
plus_sign"
(character
)
minus_sign
(character
)
approx_sign
(character
)
exp_sign
(character
)
sup_exp
(character
)
per_mille
(character
)
infinity
(character
)
nan
(character
)
time_sep
(character
)
approx_pattern
(character
)
at_least_pattern
(character
)
at_most_pattern
(character
)
range_pattern
(character
)
decimal_format
(character
)
sci_format
(character
)
percent_format
(character
)
currency_format
(character
)
accounting_format
(character
)
The first column, locale
, is the locale name (e.g., "en"
, "de-AT"
,
etc.). The remaining 25 columns will be explained in separate sections.
numbers
numbers
An object of class tbl_df
(inherits from tbl
, data.frame
) with 574 rows and 26 columns.
The default_numbering_system
(CLDR: 'defaultNumberingSystem'
) column
provides an element that indicates which numbering system should be used for
presentation of numeric quantities in the given locale.
The other_numbering_systems
(CLDR: 'otherNumberingSystems'
) column
provides an element that defines general categories of numbering systems that
are sometimes used in the given locale for formatting numeric quantities.
These additional numbering systems are often used in very specific contexts,
such as in calendars or for financial purposes. There are currently three
defined categories, as follows:
native
Defines the numbering system used for the native digits, usually defined as a
part of the script used to write the language. The native numbering system
can only be a numeric positional decimal-digit numbering system, using digits
with General_Category=Decimal_Number. In locales where the native numbering
system is the default, it is assumed that the numbering system "latn"
(
Western Digits 0
-9
) is always acceptable, and can be selected using the
"-nu"
keyword as part of a Unicode locale identifier.
traditional
Defines the traditional numerals for a locale. This numbering system may be numeric or algorithmic. If the traditional numbering system is not defined, applications should use the native numbering system as a fallback.
finance
Defines the numbering system used for financial quantities. This numbering system may be numeric or algorithmic. This is often used for ideographic languages such as Chinese, where it would be easy to alter an amount represented in the default numbering system simply by adding additional strokes. If the financial numbering system is not specified, applications should use the default numbering system as a fallback.
The categories defined for other numbering systems can be used in a Unicode
locale identifier to select the proper numbering system without having to
know the specific numbering system by name. To select the Hindi language
using the native digits for numeric formatting, use locale ID
"hi-IN-u-nu-native"
. To select the Chinese language using the appropriate
financial numerals, use locale ID: "zh-u-nu-finance"
. With the Tamil
language using the traditional Tamil numerals, use locale ID
"ta-u-nu-traditio"
. As a last example, to select the Arabic language using
western digits 0
-9
, use locale ID "ar-u-nu-latn"
.
The minimum_grouping_digits
(CLDR: 'minimumGroupingDigits'
) value can
be used to suppress groupings below a certain value. This is used for
languages such as Polish, where one would only write the grouping separator
for values above 9999
. The minimum_grouping_digits
value contains the
default for the locale.
Number symbols define the localized symbols that are commonly used when formatting numbers in a given locale. These symbols can be referenced using a number formatting pattern.
The decimal
(CLDR: 'decimal'
) symbol separates the integer and
fractional part of the number. The group
(CLDR: 'group'
) symbol
separates clusters of integer digits to make large numbers more legible;
commonly used for thousands (grouping size 3, e.g. "100,000,000"
) or in
some locales, ten-thousands (grouping size 4, e.g. "1,0000,0000"
). There
may be two different grouping sizes: The primary grouping size used for the
least significant integer group, and the secondary grouping size used for
more significant groups; these are not the same in all locales (e.g.
"12,34,56,789"
). If a pattern contains multiple grouping separators, the
interval between the last one and the end of the integer defines the primary
grouping size, and the interval between the last two defines the secondary
grouping size. All others are ignored, so "#,##,###,####"
==
"###,###,####"
== "##,#,###,####"
.
The list
(CLDR: 'list'
) symbol is used to separate numbers in a list
intended to represent structured data such as an array. It must be different
from the decimal value. This list separator is for non-linguistic usage as
opposed to the list patterns for linguistic lists (e.g. "Bob, Carol, and Ted"
).
The plus_sign
(CLDR: 'plusSign'
) is the preferred symbol for expressing
a positive value and the minus_sign
(CLDR: 'minusSign'
) is for negative
values. It can be used to produce modified patterns, so that "3.12"
is
formatted as "+3.12"
, for example. The standard number patterns (except for
accounting notation) will contain the minus_sign
, explicitly or implicitly.
In the explicit pattern, the value of the plus_sign
can be substituted for
the value of the minus_sign
to produce a pattern that has an explicit plus
sign.
The approx_sign
(CLDR: 'approximatelySign'
) element contains a symbol
used to denote an approximate value. The symbol is substituted in place of
the minus_sign
using the same semantics as plus_sign
substitution.
The exp_sign
(CLDR: 'exponential'
) provides a symbol used for
separating the mantissa and exponent values. The exponential notation in
sup_exp
(CLDR: 'superscriptingExponent'
) could alternatively be used to
show a format like "1.23 × 104"
. The superscripting can use markup, such as
<sup>4</sup>
in HTML, or for the special case of Latin digits, use
superscripted numeral characters.
The percent_sign
(CLDR: 'percentSign'
) is a symbol used to indicate a
percentage (1/100th) amount. If present, the value might require
multiplication by 100
before formatting. The per_mille
(CLDR:
'perMille'
) symbol used to indicate a per mille (1/1000th) amount. If
present, the value might need to be multiplied by 1000
before formatting.
The infinity sign is provided in the infinity
(CLDR: 'infinity'
)
element. The nan
element (CLDR: 'nan'
) has the NaN (not a number) sign.
These elements both correspond to the IEEE bit patterns for infinity and NaN.
The time_sep
(CLDR: 'timeSeparator'
) pattern allows the same time
format to be used for multiple number systems when the time separator depends
on the number system. For example, the time format for Arabic should be a
colon when using the Latin numbering system, but when the Arabic numbering
system is used, the traditional time separator in older print styles was
often Arabic comma.
There are several miscellaneous patterns for special purposes. The
approx_pattern
(CLDR: 'approximately'
) indicates an approximate number,
such as: "~99"
. With the pattern called at_most_pattern
(CLDR:
'atMost'
) we can describe an upper limit. This indicates that, for example,
there are 99
items or fewer. The at_least_pattern
(CLDR: 'atLeast'
)
describes a lower limit. This might be "99+"
to indicate that there are
99
items or more. With the range_pattern
(CLDR: 'range'
), a range of
numbers, such as "99–103"
, can be used to indicate that there are from 99
to 103
items.
Number formats are used to define the rules for formatting numeric
quantities. Different formats are provided for different contexts. The
decimal_format
(CLDR: 'decimalFormats'
) is the prescribed
locale-specific way to write a base 10 number. Variations of the
decimal_format
pattern are provided that allow compact number formatting.
The percent_format
(CLDR: 'percentFormats'
) is the pattern to use for
percentage formatting. The pattern for use with scientific (exponent)
formatting is provided as sci_format
(CLDR: 'scientificFormats'
). The
pattern for use with currency formatting is found in currency_format
(CLDR: 'currencyFormats'
). This format contains a few additional
structural options that allow proper placement of the currency symbol
relative to the numeric quantity. The accounting_format
(CLDR:
'accountingFormats'
) pattern is to be used to generate accounting-style
formatting.
The script_metadata
table contains metadata for various script types.
There are 170 rows and the following 11 columns:
script
(character
)
sample_char
(character
)
rank
(integer
)
script
(character
)
rtl
(character
)
lb_letters
(character
)
has_case
(character
)
shaping_req
(character
)
ime
(character
)
density
(integer
)
origin_country
(character
)
likely_lang
(character
)
script_metadata
script_metadata
An object of class tbl_df
(inherits from tbl
, data.frame
) with 170 rows and 11 columns.
The start_of_week
table contains the day names for the start of the week
(e.g., "sun"
, "mon"
, etc.) for a given territory (which is typically a
2-letter country code). The following columns are included:
territory
(character
)
day_of_week
(character
)
start_of_week
start_of_week
An object of class tbl_df
(inherits from tbl
, data.frame
) with 151 rows and 2 columns.
The tz_bcp_id
table provides a lookup for converting between BCP47
Olson/IANA-style time zone IDs and the canonical forms (according to BCP47).
There are 593 rows and the following 3 columns:
tz_bcp_id
(character
)
tz_canonical
(character
)
description
(character
)
tz_bcp_id
tz_bcp_id
An object of class tbl_df
(inherits from tbl
, data.frame
) with 593 rows and 3 columns.
The tz_exemplar
table contains localized names for all exemplar cities used
in time zone names. There are 574 rows and a column for
each exemplar city name (comprising 442 columns; the
locale
column is first). To have syntactical column names, all slashes in
exemplar city names are instead represented with period characters (e.g.,
Indiana/Vincennes
is Indiana.Vincennes
). Some exemplar cities are not
actually cities and these are: UTC.long.standard
(en: "Coordinated Universal Time"
), UTC.short.standard
(en: "UTC"
),
and Unknown
(en: "Unknown City"
).
tz_exemplar
tz_exemplar
An object of class tbl_df
(inherits from tbl
, data.frame
) with 574 rows and 443 columns.
The tz_formats
table contains localized formatting information across all
locales. There are 574 rows and the following 7 columns:
locale
(character
)
hour_format
(character
)
gmt_format
(character
)
gmt_zero_format
(character
)
region_format
(character
)
region_format_daylight
(character
)
region_format_standard
(character
)
region_format_fallback
(character
)
tz_formats
tz_formats
An object of class tbl_df
(inherits from tbl
, data.frame
) with 574 rows and 8 columns.
The tz_map
table contains names for all map-based time zone names. There
are 598 rows and the following 4 columns:
canonical_tz_name
(character
)
territory
(character
)
tz_name
(character
)
tz_map
tz_map
An object of class tbl_df
(inherits from tbl
, data.frame
) with 598 rows and 3 columns.
The tz_metazone_names
table contains localized time zone names for all
metazones (e.g., America/Eastern
). There can be a variety of time zone
names, comprising long and short forms (e.g., Eastern Time
and ET
) and
this is further segmented by generic, standard, and daylight forms (an
example, using short forms, is ET
, EST
, and EDT
). There are
465 rows and a column for
each metazone (comprising 159 columns; the
locale
column is first).
tz_metazone_names
tz_metazone_names
An object of class tbl_df
(inherits from tbl
, data.frame
) with 465 rows and 160 columns.
The tz_metazone_users
table allows for a lookup of canonical time zone
name to which metazone each uses. As an example, the canonical time zone
America/Vancouver
corresponds to the America_Pacific
metazone (this is
the long ID, but there is often a short ID available as well). The
metazone_long_id
can be used to get a localized metazone name by use of
the tz_metazone_names table.
There are 293 rows and the following 4 columns:
canonical_tz_name
(character
)
territory
(character
)
metazone_long_id
(character
)
metazone_short_id
(character
)
tz_metazone_users
tz_metazone_users
An object of class tbl_df
(inherits from tbl
, data.frame
) with 293 rows and 4 columns.
The units
table contains localized character data across
574 locales. There are 1722 rows and
1281 columns. Each row represents a display type ("long"
,
"short"
, or "narrow"
) for each of the locales.
Following the locale
and type
columns, each unit and its subelements
are provided as a cluster of columns in the form
"<<category>-unit name>.<subelement name>"
. The subelement names are:
"displayName"
"unitPattern-count-one"
"unitPattern-count-other"
"unitPattern-count-zero"
"unitPattern-count-two"
"unitPattern-count-few"
"unitPattern-count-many"
The "displayName"
is the localized name for a unit when displayed outside
of a pattern. The "unitPattern-count-*"
subelements provide the localized
forms of the unit when the value is exactly 0
("unitPattern-count-one"
),
1
("unitPattern-count-one"
), 2
("unitPattern-count-two"
), and, when
the value constitutes a few ("unitPattern-count-few"
) or many
("unitPattern-count-many"
) units. Every other case is handled by
"unitPattern-count-other"
.
units
units
An object of class tbl_df
(inherits from tbl
, data.frame
) with 1722 rows and 1281 columns.