gramps/data/tests/imp_UTF_8_NOBOM_CR.ged

1 line
8.3 KiB
Plaintext
Raw Normal View History

0 HEAD 1 CHAR UTF-8 1 SOUR REGISTERED_SOURCE_NAME 1 GEDC 2 VERS 5.5 2 FORM LINEAGE-LINKED 1 NOTE UTF-8 transmission test. 2 CONT The transmission does NOT start with a byte order mark (BOM) 2 CONT Each line is terminated using carriage return. 2 CONT This GEDCOM transmission contains a charcter set test. It consists 2 CONT of a single family (two parents, many children). The parents are used 2 CONT to test the cyrillic and greek letters. In both 'persons' the 2 CONT BIRT.PLAC tag contains some capital and the DEAT.PLAC tag some 2 CONT small letters of alphabet. 2 CONT The children contain some combined letters and special charcters. 2 CONT The NAME tag of each 'person' is the name of the characters tested 2 CONT within the person. 2 CONT The first children contain some special characters. Here the strings 2 CONT given in BIRT.PLAC and DEAT.PLAC are 'character name (test character), ...' 2 CONT where 'character name'is the name of the character (like 'british pound') 2 CONT and 'test character' is a single byte representing this character 2 CONT in ANSEL. 2 CONT The last children contain some combined characters. The name tag gives 2 CONT the name of the non-spacing character tested within the 'person'. 2 CONT Within the name the hex-values of the non-spacing character is given 2 CONT UNICODE. The DEAT.PLAC tag contains all latin characters which are 2 CONT combined with the non-spacing character tested here and which have 2 CONT a UNICODE code point. The BIRT.PLAC tag contain the same letters 2 CONT without the non-spacing part. 2 CONT Example: One 'person' is named 'ring above'. The BIRT.PLAC 2 CONT tag contains all latin letters which have a UNICODE code point if 2 CONT combined with a ring above. The DEAT.PLAC tag contain the same 2 CONT charcters combined with this ring. 2 CONT Note: Not all charcters can be displayed on all computers. 2 CONT This strongly depends on the installed fonts and codepages. 2 CONT This file based on the following source: 2 CONT www.unicode.org delivered the connection from the code point names 2 CONT to the actual values. Note, that much more UNICODE characters are 2 CONT possible (like the chinese alphabet). 1 SUBM @SUBMITTER@ 1 DATE 20 JAN 1998 0 @SUBMITTER@ SUBM 1 NAME /H. Eichmann/ 1 ADDR email: h.eichmann@@gmx.de 0 @FATHER@ INDI 1 NAME /cyrillic/ 1 BIRT 2 PLAC АБВГДЕЖЗИЙКЛМНОПРСТУФХЦЧШЩЪЫЬЭЮЯ 1 DEAT 2 PLAC абвгдежзийклмнопрстуфхцчшщъыьэюя 1 SEX M 1 FAMS @FAMILY@ 0 @MOTHER@ INDI 1 NAME /greek/ 1 BIRT 2 PLAC ΑΒΓΔΕΖΗΘΙΚΛΜΝΞΟΠΡΣΤΥΦΧΨΩ 1 DEAT 2 PLAC αβγδεζηθικλμνξοπρςστυφχψω 1 SEX F 1 FAMS @FAMILY@ 0 @CHILD0@ INDI 1 FAMC @FAMILY@ 1 NAME /Special Characters 0/ 1 BIRT 2 PLAC capital L with stroke (Ł), capital O with stroke (Ø), capital D with stroke (Đ), capital thorn (Þ) 1 DEAT 2 PLAC capital AE (Æ), capital ligature OE (Œ), modified prime (ʹ), middle dot (·), music flat sign (♭) 0 @CHILD1@ INDI 1 FAMC @FAMILY@ 1 NAME /Special Characters 1/ 1 BIRT 2 PLAC registered sign (®), plus-minus sign (±), capital O with horn (Ơ), capital U with horn (Ư) 1 DEAT 2 PLAC modifier right half ring (ʾ), modifier left half ring (ʿ), small L with stroke (ł), small O with stroke (ø), small D with stroke (đ) 0 @CHILD2@ INDI 1 FAMC @FAMILY@ 1 NAME /Special Characters 2/ 1 BIRT 2 PLAC small thorn (þ), small AE (æ), small ligature OE (œ), modified double prime (ʺ) 1 DEAT 2 PLAC small dotless i (ı), pound sign (£), small eth (ð), small O with horn (ơ), small U with horn (ư) 0 @CHILD3@ INDI 1 FAMC @FAMILY@ 1 NAME /Special Characters 3/ 1 BIRT 2 PLAC degree sign (°), script small L (), sound recording copyright (℗), copyright sign (©) 1 DEAT 2 PLAC music sharp sign (♯), inverted question mark (¿), inverted exclamation mark (¡), small sharp S (ß) 0 @CHILD4@ INDI 1 FAMC @FAMILY@ 1 NAME code: 0309/HOOK ABOVE/ 1 BIRT 2 PLAC AEIOU,Yaeio,uy 1 DEAT 2 PLAC ẢẺỈỎỦ,Ỷảẻỉỏ,ủỷ 0 @CHILD5@ INDI 1 FAMC @FAMILY@ 1 NAME code: 0300/GRAVE/ 1 BIRT 2 PLAC AEIOU,WY