Package org.apache.jena.riot.web
Class LangTag
- java.lang.Object
-
- org.apache.jena.riot.web.LangTag
-
public class LangTag extends java.lang.ObjectLanguage tags: support for parsing and canonicalization of case. Grandfathered forms ("i-") are left untouched. Unsupported or syntactically illegal forms are handled in canonicalization by doing nothing.
-
-
Field Summary
Fields Modifier and Type Field Description static intidxExtensionIndex of all extensionsstatic intidxLanguageIndex of the language partstatic intidxRegionIndex of the region partstatic intidxScriptIndex of the script partstatic intidxVariantIndex of the variant part
-
Method Summary
All Methods Static Methods Concrete Methods Modifier and Type Method Description static java.lang.Stringcanonical(java.lang.String str)Canonicalize with the rules of RFC 4646, or RFC5646 without replacement of preferred form.static java.lang.Stringcanonical(java.lang.String[] parts)Canonicalize with the rules of RFC 4646 "In this format, all non-initial two-letter subtags are uppercase, all non-initial four-letter subtags are titlecase, and all other subtags are lowercase." In addition, leave extensions unchanged.static booleancheck(java.lang.String languageTag)Validate - basic syntax check for a language tags: [a-zA-Z]+ ('-'[a-zA-Z0-9]+)*static java.lang.String[]parse(java.lang.String languageTag)Parse a langtag string and return it's parts in canonical case.
-
-
-
Field Detail
-
idxLanguage
public static final int idxLanguage
Index of the language part- See Also:
- Constant Field Values
-
idxScript
public static final int idxScript
Index of the script part- See Also:
- Constant Field Values
-
idxRegion
public static final int idxRegion
Index of the region part- See Also:
- Constant Field Values
-
idxVariant
public static final int idxVariant
Index of the variant part- See Also:
- Constant Field Values
-
idxExtension
public static final int idxExtension
Index of all extensions- See Also:
- Constant Field Values
-
-
Method Detail
-
check
public static boolean check(java.lang.String languageTag)
Validate - basic syntax check for a language tags: [a-zA-Z]+ ('-'[a-zA-Z0-9]+)*
-
parse
public static java.lang.String[] parse(java.lang.String languageTag)
Parse a langtag string and return it's parts in canonical case. See constants for the array contents. Parts not present cause a null in the return array.- Returns:
- Langtag parts, or null if the input string does not parse as a lang tag.
-
canonical
public static java.lang.String canonical(java.lang.String str)
Canonicalize with the rules of RFC 4646, or RFC5646 without replacement of preferred form.
-
canonical
public static java.lang.String canonical(java.lang.String[] parts)
Canonicalize with the rules of RFC 4646 "In this format, all non-initial two-letter subtags are uppercase, all non-initial four-letter subtags are titlecase, and all other subtags are lowercase." In addition, leave extensions unchanged.This is the same as RFC5646 without replacement of preferred form or consulting the registry.
-
-