1 line
8.3 KiB
Plaintext
1 line
8.3 KiB
Plaintext
|
0 HEAD
1 CHAR UTF-8
1 SOUR REGISTERED_SOURCE_NAME
1 GEDC
2 VERS 5.5
2 FORM LINEAGE-LINKED
1 NOTE UTF-8 transmission test.
2 CONT The transmission does NOT start with a byte order mark (BOM)
2 CONT Each line is terminated using carriage return.
2 CONT This GEDCOM transmission contains a charcter set test. It consists
2 CONT of a single family (two parents, many children). The parents are used
2 CONT to test the cyrillic and greek letters. In both 'persons' the
2 CONT BIRT.PLAC tag contains some capital and the DEAT.PLAC tag some
2 CONT small letters of alphabet.
2 CONT The children contain some combined letters and special charcters.
2 CONT The NAME tag of each 'person' is the name of the characters tested
2 CONT within the person.
2 CONT The first children contain some special characters. Here the strings
2 CONT given in BIRT.PLAC and DEAT.PLAC are 'character name (test character), ...'
2 CONT where 'character name'is the name of the character (like 'british pound')
2 CONT and 'test character' is a single byte representing this character
2 CONT in ANSEL.
2 CONT The last children contain some combined characters. The name tag gives
2 CONT the name of the non-spacing character tested within the 'person'.
2 CONT Within the name the hex-values of the non-spacing character is given
2 CONT UNICODE. The DEAT.PLAC tag contains all latin characters which are
2 CONT combined with the non-spacing character tested here and which have
2 CONT a UNICODE code point. The BIRT.PLAC tag contain the same letters
2 CONT without the non-spacing part.
2 CONT Example: One 'person' is named 'ring above'. The BIRT.PLAC
2 CONT tag contains all latin letters which have a UNICODE code point if
2 CONT combined with a ring above. The DEAT.PLAC tag contain the same
2 CONT charcters combined with this ring.
2 CONT Note: Not all charcters can be displayed on all computers.
2 CONT This strongly depends on the installed fonts and codepages.
2 CONT This file based on the following source:
2 CONT www.unicode.org delivered the connection from the code point names
2 CONT to the actual values. Note, that much more UNICODE characters are
2 CONT possible (like the chinese alphabet).
1 SUBM @SUBMITTER@
1 DATE 20 JAN 1998
0 @SUBMITTER@ SUBM
1 NAME /H. Eichmann/
1 ADDR email: h.eichmann@@gmx.de
0 @FATHER@ INDI
1 NAME /cyrillic/
1 BIRT
2 PLAC АБВГДЕЖЗИЙКЛМНОПРСТУФХЦЧШЩЪЫЬЭЮЯ
1 DEAT
2 PLAC абвгдежзийклмнопрстуфхцчшщъыьэюя
1 SEX M
1 FAMS @FAMILY@
0 @MOTHER@ INDI
1 NAME /greek/
1 BIRT
2 PLAC ΑΒΓΔΕΖΗΘΙΚΛΜΝΞΟΠΡΣΤΥΦΧΨΩ
1 DEAT
2 PLAC αβγδεζηθικλμνξοπρςστυφχψω
1 SEX F
1 FAMS @FAMILY@
0 @CHILD0@ INDI
1 FAMC @FAMILY@
1 NAME /Special Characters 0/
1 BIRT
2 PLAC capital L with stroke (Ł), capital O with stroke (Ø), capital D with stroke (Đ), capital thorn (Þ)
1 DEAT
2 PLAC capital AE (Æ), capital ligature OE (Œ), modified prime (ʹ), middle dot (·), music flat sign (♭)
0 @CHILD1@ INDI
1 FAMC @FAMILY@
1 NAME /Special Characters 1/
1 BIRT
2 PLAC registered sign (®), plus-minus sign (±), capital O with horn (Ơ), capital U with horn (Ư)
1 DEAT
2 PLAC modifier right half ring (ʾ), modifier left half ring (ʿ), small L with stroke (ł), small O with stroke (ø), small D with stroke (đ)
0 @CHILD2@ INDI
1 FAMC @FAMILY@
1 NAME /Special Characters 2/
1 BIRT
2 PLAC small thorn (þ), small AE (æ), small ligature OE (œ), modified double prime (ʺ)
1 DEAT
2 PLAC small dotless i (ı), pound sign (£), small eth (ð), small O with horn (ơ), small U with horn (ư)
0 @CHILD3@ INDI
1 FAMC @FAMILY@
1 NAME /Special Characters 3/
1 BIRT
2 PLAC degree sign (°), script small L (ℓ), sound recording copyright (℗), copyright sign (©)
1 DEAT
2 PLAC music sharp sign (♯), inverted question mark (¿), inverted exclamation mark (¡), small sharp S (ß)
0 @CHILD4@ INDI
1 FAMC @FAMILY@
1 NAME code: 0309/HOOK ABOVE/
1 BIRT
2 PLAC AEIOU,Yaeio,uy
1 DEAT
2 PLAC ẢẺỈỎỦ,Ỷảẻỉỏ,ủỷ
0 @CHILD5@ INDI
1 FAMC @FAMILY@
1 NAME code: 0300/GRAVE/
1 BIRT
2 PLAC AEIOU,WY
|