-
Notifications
You must be signed in to change notification settings - Fork 22
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Extending SPLAC to include ADDR and PLAC? #536
Comments
In 40 plus years of using GEDCOM I've never used the Address_Structure! As far as I'm concerned it is useless! But to each their own! If I need to include an address for a place it is just another entry in the PLAC text. Just like if I need to add a grave marker (because I know the exact location of that marker) I put either the word "Grave" or if I know whatever the cemetery uses to locate the grave (i.e. row, section, columbarium number, "At Sea", etc.) As far as items that are bigger than a point (Y/X coordinates) we should also have a polygon option in a "PLAC" node. I also want a TEXT tag that can use a markup/markdown language option to create a formated description of the place being saved! If I was part of the SPLAC committee these are things I would add, but I'm not!! |
Discussion in GEDCOM Steering Committee 20 AUG 2024:
|
A somewhat related topic: in my opinion, |
In my tree, since around 1850 or so, there were a kind of "housecards" used. Those cards were for 1 specific house in some suburb or street (depending on the way they did addressing) On those cards is a lot of important info, names, relations of people living there, dates of birth, when they arrived and left and where they came from or went to. So yes I have addresses in my tree, but only from that time and later. So there should be some way of expressing that. But like Luther said, thats more an EXID thing, street (suburb) and housenumber. All the rest can go in SPLAC's. Further, as I already mentioned, in the forums of the program I use, people asked to have the leftmost of the PLAC jurisdictions to be an address. Probably for the same reason. So a postal address is not just for today, for lawyers and such. |
Tineke, If I’m reading you correctly, these “housecards” sound like Source_Records and therefore assert names, relations, birth dates, residence and other “facts”. Just like a census that also asserts facts, I would create a Source_Record for the census (i.e. 1920 US Census) with a date of the enumeration and associate it with all of the facts it asserts including the Residence, I would do the same with your “housecard”. In both cases I add the house and street number to the PLAC tag. In my application we have extended the application to have additional information about each item in the comma delimited PLAC payload that provides detail about the item/location including images and text/history. Meaning I can view information about the division, address, country etc. of any place in my database! |
I understand what you say. But yes, also a SOUR. |
@Norwegian-Sardines: 2 PLAC myhouse, myvillage, mycounty, mystate, mycountry
3 FORM building, city, county, state, country
2 ADDR mystreet myhousenumber
3 CONT zip mymunicipality The PLAC hierarchy is NOT the same as the ADDR hierarchy. I cannot put the ADDR as part of PLAC, as PLAC has no zip code, and as PLAC has the jurisdiction "city", which the address does not have, and municipality is used by the postal address, but not part of PLAC. This is not only theory, in my own postal address you will not find the city, but the municipality. And in the PLAC I do not show the municipality, but the city.. myhouse can be a farm name, or something like that. It refers to the same location as the address does, however the address is like an EXID (see Luther's comment), which - to make it more complicated, may vary from time to time. Some hundreds years ago little villages had no street names, only house numbers. Later the street names were introduced, together with new house numbers. Then the postal code was introduced, however that system has been modified twice since then. So if we add ADDR to the SPLAC records, we will have an own substructure for the address in the SPLAC record. And that substructure must have a DATE substructure to show the time dependency. |
O and the extended information you mention, will now go in the different SPLAC's, that is if they chooses SPLAC not inside PLAC. |
In v5.5.1 GEDCOM, without enough fields to capture enough information about an “artifact source” what I’ve done for documents that were created (like a legal document), the author (AUTH) is the lawyer/writer/scribe of the document and any “location based” information would follow the suggestion for unpublished work suggesting that we use the Source_Publication_Facts payload and input the city, state of the writer of the document! Maybe Source_Record needs a template for other data just like I proposed from creating a Citation_Record! |
Albert,
I don’t understand the difference between a city and a municipality. In Norway, people who live on a farm live in a Kommune not a city, I put the Kommune name rather than a city in the PLAC hierarchy. In the US we have rural townships, I use these when locating a farm, because city does not make sense! I agree that you need to have an address and zip code, but what are addresses and zipcodes for? Mailing letters! Personally I think GEDCOM should not be used to create mailing labels, but to each their own! |
On the other hand, if you did want to use GEDCOM to maintain an address book, I think putting the Address_Structure inside the Place_Structure is incorrect! I’d rather see it used as a stand alone Fact with a date range rather than as a subtag of all facts, having the zip code and a mailing label address layout for facts like |
In my region (Germany) we have: In some parts of Germany we have another administration level in between municipality and county. By the given hierarchy a city is administrated by a municipality. The postal address in most cases is build using the municipality, not the city (sometimes the names of these levels are the same, then you do not see the difference). And the postal address is build by using street name and house number, not the farm name. The farm name is part of PLAC, a different street name with house number (pointing to this farm) is not part of PLAC. What are the addresses for? To store this information as it is given by several sources. Like civil birth and death records, books with all addresses of people living in a city in a defined year (like census), letters written from sender and his address to receiver and his address, contracts of two parties identified by name and address. Often the address is a very important information to identify the individual: Someone with same name is living in the same village at the same time. Zip codes are very important to identify municipalities (we have a lot of the them with same name, however in different counties or states). If you do not document the addresses, you do not need the ADDR. Your decision. If you do document, you need it. And GEDCOM is to transfer all data we collect in our genealogical research. |
Hi all, Chosing the right name for something can help tremendously in understanding new things. Should the topmost step be "WORLD" because then we can link events on the Atlantic and such, directly to JURIS "WORLD" and we would also have a correct PLAC according to the definition. Now the "mailing" Addresses. They give confusion, thats why I started by saying we should leave out CITY, STAE, POST and CTRY. The other type of address, as I mentioned on the "housecards", they have a street and a housenumber, or, as Albert said, in older times they were just housenumbers in a small village. Maybe we should better call those "Positions". They are not used to send a letter, only to position an event on a map. (Hence I said they all might have a MAP structure) It can very well be we cannot really locate those "address"-JURIS's (anymore) because we have no idea where that street and housenumber might have been. They could be destroyed for some reason (flood, bombing, anything) or might have disappeared because new houses were build there. Maybe we dont know the smallest JURIS, and we only know the City, then that is the lowest step of the staircase. As @Norwegian-Sardines and others said, the way of addressing a "position" varies much accross countries and even inside one country. The sequence of the steps is not the same everywhere. JURIS should have the possibility (as Norwegian-Sardines mentioned) to store extra information to describe a city or a state or more. Adding pictures should also be possible. I would like to add 1 extra TAG here. |
I'm still going through all of what you wrote, so I'll answer smaller bits as I find them!
For me (and maybe this is just my interpretation) a "Place" does not stop with "jurisdictions"! If I have a In addition, a place can have multiple parent places. Let's take an example: The city of Gdansk in what is not Poland was for a time part of the Prussian Empire and before that was officially in the "Kingdom of Poland". Individuals born in various parts of Poland during times of annexation and control could have birth certificates stating Prussia. The hierarchy for Gdansk should include all possible parent jurisdictions with From/To dates. My own grandfather was born in "The Hungarian Empire" the town has changed names but it is now in Serbia! We need alternate names with date-range (from/to)! One other reason I want to have separate We could name |
Hi Tinneke, The _LOC record has a broad set of subtags. You find it in http://genealogy.net/GEDCOM/GEDCOM551%20GEDCOM-L%20Addendum-R2.pdf on page 13, defined as extension to 5.5.1 (it works with 7.0, too) |
Albert, I don’t see any difference between the word “place” (a particular position or point in space) and “location” (a particular place or position) Oxford Dictionary. So the only reason I use the tag |
Other software has an extension that extends the |
Thanks for all input. Now we wait what the committee decides. |
Discussed in steering committee
There are multiple open issues here that deserve additional thought, design, and discussion. Some of that will happen in the open meeting on SPLAC, announced in #538, though this is a big enough topic that the conversation will probably extend beyond that meeting too. |
If the <<Address_Structure>> is to be maintained as subtags of PLAC and Shared_Place_Records I think we should revisit the design of the <<Address_Structure>>. Currently the <<Address_Structure>> has the following definition:
Which for the most part is only valuable when either viewed by a human or used when creating a mailing label. The individual data points within the structure can not be parsed into meaningful parts and is of little value beyond what we could put into a The other tags found in the <<Address_Structure>> {ADR1, ADR2, ADR3, CITY, STAE, POST, CTRY} (as noted in the GEDCOM Specification) are best used by systems that may "have structured their addresses for indexing and sorting." and are subject to deprecation. This may be a good time (v7.1 GEDCOM) to actually deprecate these subtags. We could also then instead of having a "Structure" just implement the |
@Norwegian-Sardines |
In advance of the upcoming meeting I'd like to express some thoughts for discussion on the topic of the These thoughts are not about a specific solution (structure or design), but instead some of the elements I'd like to see in the solution.
This list can be discussed, revised and augmented and can be used as a starting point for any design ideas. Thanks |
I would like to add:
Adding to 6.: for bigger elements not only coordinates, but the area |
I think addition number 12 references design of the solution rather than a desire! A solution may not include "higher levels" as indicated in previous discussions! But this can be discussed in detail tomorrow. |
I agree "pointer" is part of a solution. Desire is to document all hierarchical connections in between places on different levels at any time. |
Maybe the <<Address_Structure>> as @Norwegian-Sardines said, should be an ADDR with CONT and CONC indeed. And the Addressing label type, should not that have the possibility to have a Barcode added? |
What I do for Addresses of places is: 2 PLAC ChurchRoad 12, My Town, My State, My Country I don’t need an <<Address_Structure>> for this case. |
Some comments from a genealogy software provider ...
1 BIRT 1 BURI One more hierarchary level. All possible, but makes it not easier, because not needed in every case.
That's why I am not a big fan of these hierarchy in PLAC tags and prefer the town/city on first place in PLAC tag (What is your birth place? ChurchRoad 12!). Conclusion: Dirk (www.ahnenblatt.com) |
My birth place:
What does this mean? If it is not needed don’t put it in! If I did not know the hospital name,
In my software two things are output:
Since GEDCOM v5.5.1 does not have a shorter (abbreviated) value the whole string is output on the screen and/or report. Simple! NOTE: My proposal would be to include an abbreviated place name! |
If it has to be there then include it, but in the USA a town could have dozens of zip codes and it provides no extra value locating the town on the map. In another country the zip code may be assigned to a town and used to differentiate two towns with the same name such as, “MyTown 1234, Country” from “MyTown 5678, Country”. |
The meaning of each part between the commas should be defined in HEAD.PLAC.FORM. In your case ... 1 PLAC ... where "City Center Hospital" is the Place name. About street numbers and zip code: I personally would prefer to have only City as PLAC value and more details in _LOC/SPLAC record (i.e. State, Country) and <ADDRESS_STRUCTURE> (or SADDR record - i.e. name of hospital or cemetery). Dirk |
I don’t use Place Statistics start with the highest order (country), all levels should be consistent as you go down. I see no problem! If I need to know the number of people that had an event in a city that city is described once and reused! |
I personally think the Address_Structure as defined in the GEDCOM v5.5.1 specification has no value in a Genealogical Database.
This structure (as described above) is best used to send letters and mail to the individual. First, a large percentage of individual in most genealogical databases are dead so they will not be getting mail from me! Second, Using the Address_Structure for anything other than a person's residence does not make sense, why send a letter to their church, hospital, or the place they lived in a census 50 years ago? Third, I am not a fan of having address information for the living in a GEDCOM that can be shared to and used by others. Privacy for the living is important and we have no way to prevent a However, I can understand if you want to record an address for an individual's residence. I think it should be record in the NOTE:
I am interesting in zip-code if it defines a place uniquely where using just the city or region does not. |
If you do not use PLAC.FORM, how do you know what place this PLAC: 2 PLAC Kansas, USA
3 EXID KANSASEM33MU
4 TYPE http://gov.genealogy.net/ So it is the city in Clark County in the Arkansas state. With the hierarchical structure of _LOC records as defined by GEDCOM-L group no problem: |
I don't really care what type of place this is! It is the place where an event happened. If I take the text "Kansas, USA" and put it into Google Maps it will find the State and it will show on a map. If you transmitted "Chicago, USA" or "Chicago, Illinois" it would find it too! If I transmitted "Kansas City, USA" it would need more clarification, Missouri or Kansas? So you should be transmitting: This is why it is imperative that a unique name be transmitted that can be found on a map! How do you know what/where is with the |
Zip code is one of the most safe ways to identify a place, where the same name is used for several places. So it is one of the easiest ways for a safe description of places. I have seen a lot of GEDCOM files in the wild using the zip code within the PLAC payload of 5.5 / 5.5.1, like |
Oh, and by the way. For my clients and family in Norway I would actually transmit or display.
Just Like I would enter: or
or
|
If I used a well managed, maintained and curated offsite database then these values would have a more disciplined naming structure and therefore also not need |
I agree to a point. In my town of 47000 people we have 9 different zip codes, which one do I use? |
Personally in your case were each town has only one zip code you could enter the following:
But in really in my example the county without the zip code still makes the place unique. NOTE: I removed the comma between the town and the zip code because they go together to make a unique place and should not violate the GEDCOM Standard for creating a non administrative level. |
Now I have read the complete discussion here and would agree to all of @mother10 's proposals. As I understand PLAC is going down just to the city and gets an SPLAC record for more detailed information. ADDR is in my view not just a postal address, but a more detailed description of the place, building or location. This will get BLDNG records to avoid redundancies and add more details like coordinates. What I also like is the idea of a new tag CITYNAME. I would wish that it is not necessary, because PLAC has already only the value which is presented to the user (like CITYNAME purpose). But there are so many GEDCOMs around which have these comma-formatted PLAC values. About PLAC, FORM and commas: In GEDCOM 5.3 there are several samples ... No pre-defined jurisdictions? Should all terms be written in lower case or does the spelling not matter? Do these terms always have to be in English? What about the use of additional jurisdictions? Each software can invent "own" jurisdiction terms (like "building name" or "street with number"), which in the end only exist in this software. And there is software which doesn't use FORM tags, even if users enter these comma-separated PLAC jurisdictions (on their own risk). This would be no problem if software handles this comma-thing internally and presents users only the name of the city. But to keep the right hierarchy levels of a PLAC value is in most cases the duty of the user. I never came across software that is checking numbers of commas in all PLAC values and gives warnings. What about merging/adding/importing GEDCOM files? Will FORM values and PLAC values/hierarchies (number of commas) be changed? It should be checked - but in reality it is not. Every software has built its own strategy how to deal with these comma-separated values (including "empty" commas) when generating outputs. In my eyes, with introducing SPLAC, the PLAC/FORM comma-separated hierarchy structure should be declared dead. There should be no inforcement in the GEDCOM documentation to still do it. It should be replaced by new SPLAC + BLDNG records (final name of BLDNG record can be discussed - I would suggest LOC). Dirk |
Hi, it took a while. But I finally managed to get my draft pullrequest up there: #552 . It has an extra Examples page to see the proposal "working". After I had worked out the PERIODS a bit, I also wanted to see how things might look with [GEDCOM-L's _LOC extension] inside. The problem about a town in Serbia, "moving" over different parents, was really huge, so I made a scheme first before I tried to put that in Maybe start with the example page before diving into the specifications. Have fun! |
Meanwhile I've put together a design document for an alternative SPLAC (Shared_Place_Record) that does not use GEDCOM-L type hierarchy due to the problems pointed out here regarding:
Which would require every GEDCOM to include the same historic geographic database. This historic geographic database ought to be part of a genealogy application - not included in every genealogy file. Unfortunately I'm not fully acquainted with GITHUB PR, check out and the like so I've created what I normally do when I propose an update to a document or application: This is my first blush at the topic, so I may revise it as I get input from others, or see flaws in the document. |
Maybe I should clarify a bit about that huge Serbia-Subotica example in my PR: That I entered all that info, does NOT mean a user should dig up all that information. That in my examples I wrote out a lot coming from that scheme, is just to give an idea how it will look if there are events during that whole period. And the only things about a place and its parent(s) that will be in the GEDCOM is statistical info about the location, the parent, its name and such. The GEDCOM will not contain an historical treatise about how that place was conquered defended and all that kind of thing. So certainly not the whole historybook about that place. Just the basic facts needed to define it in that time period. To help clarify why and how someone in our GEDCOM did and lived like he did in that time. Thats what I think. About not being able to put up a PR: But I would really really have liked a simple step by step of how to do a PR. There is lots of info in the Github docs, I read it many times, but it still was very difficult, and I got lots of complaints by the Github system, refusing my uploads. |
@Norwegian-Sardines: Thank you for providing your proposal. For me the hierarchical relations in between the SPLAC records are essential, and they are missing in the proposal. By this the proposal is requiring to put in the hierarchy by list into every record by SPLAC.PLAC. What about Next point: In your proposal the offical name of a location is missing. In many cases it is not in the source of the event (as there we often find additional data to identify the location, but not the hierarchical list you are using in your SPLAC record). The name in the example is "Tetendorf" (given in official registries of places in Lower Saxony), in the genealogical source it may be given as "Tetendorf bei Soltau" (which you put in the calling record, same in my application), but nowhere you have Tetendorf, Heidekreis, Niedersachsen, Deutschland or any thing like that in the sources. That is an artificial construction which is not needed anywhere. Using the hierarchical structure of GEDCOM-L location records, my application can find all people with any event (or other search criteria as name, occupation, year of birth) in the county of Heidekreis. The application does this by checking the record county Heidekreis, and all "sub"records linked to Heidekreis in any generation. I will implement an extension to build the hierarchical structure of location records as long as GEDCOM standard does not offer it itself to make sure this functionality is covered by GEDCOM transmission. You argument, the GEDCOM-L solution implies that all applications / genealogies must implement the same data of places, does not work for me. I have the external link to GOV (gov.genealogy.net), and when importing an additional GEDCOM file my application automatically merges location records with same ID in GOV. So it is open to every researcher, which data he puts in the record. GOV offers a webservice, so my application can optionally download the data from GOV. As this is a very time consuming process, it is much faster to transfer the downloaded information by GEDCOM than to download the same information again and again. Especially looking for all people in a county would not run if the hierarchical structure within the county had to be downloaded first in this call. |
I waited a while to respond to Albert in case a nonpartisan question was ask or observation was made. Sorry for the lengthy reply! Albert: For me the hierarchical relations in between the SPLAC records are essential, and they are missing in the proposal. By this the proposal is requiring to put in the hierarchy by list into every record by SPLAC.PLAC Replay: First, why is it “essential” for the transmitting application to provide a hierarchy of superior entities as separate set of record instances in the GEDCOM? Second, when an application user identifies a place in their genealogy, they most likely will not know or have access to a correct list of entities in the place hierarchy. Their software may or may not include a mechanism that provides a hierarchy of superior entities, if they have a list, that list may be inaccurate, not reliable for the time of the event or match the list provided by GOV. It is not possible for all GEDCOMs to provide a complete list of superior place entities and while many applications try to create the illusion of completeness, they are not 100% complete. GOV itself has missing levels in many of its place entities and has many missing historical and modern-day combinations. In the proposal a 100% complete “list” is not required at all, it does not have to have all levels of hierarchy. The “list” only needs to contain enough information to uniquely define the “Place of the Event” so that it can be found on a map or in person. Albert: Let us take the county of nowadays Heidekreis in Lower Saxony, Germany. Until 2011 it was named "Landkreis Soltau-Fallingbostel". And in 1977 it was established by merging the former counties of Soltau and Fallingbostel. Reply: An application user would see a Source Artifact stating that an event occurred in Tetendorf, Soltau, Deutschland and enter it in the .PLAC tag. At some later date they could create an SPLAC record that has a SPLAC.PLAC entry of Tetendorf, Heidekreis, Lower Saxony, Germany. The application user probably does not care about the intermittent names or understand the dynamics of the history of German States and Districts, just the current name. If they were interacting with German speakers, they may want to include a German spelling of the place, but it is not required. The transmission would contain:
Albert: What about 2 PLAC Soltau, Niedersachsen, Deutschland?? Here it is not clear, whether it is the city of Soltau (we have only one city "Soltau" in Lower Saxony/Niedersachsen, or whether it is the county in Lower Saxony/Niedersachsen. What is missing? The type of the location: TYPE. GEDCOM-L allows to classify the records by its type. And this includes church structures, so you can document, in which parish your village is, of which churches are operating in a city. Reply: If Soltau is unique within Niedersachsen then why do I as the transmitter care if the entity is a city, farm, county? I can enter Soltau, Niedersachsen, Deutschland into any search engine and it will return the German town that is Soltau! For that matter it looks like Soltau is unique within Germany as well. A Germany novice may only know a minimal amount about the various entities in Germany and can’t know that Soltau is a city, farm or other and can’t be required to enter the complete hierarchy, whether that hierarchy is a list or a set of hierarchical records! I’ve never seen a major application have an input field for the PLAC.FORM information. I’ve also asked around to people who have used other software that I have not used (they have listed several dozen). None of them indicate the use of or possible entry of the PLAC.FORM payload. Additionally, the FORM payload has no defined set of values, or languages to use. A term used in one country means something different in another country. The use of a TYPE value is fine for the GOV database and is helpful for a GOV subscriber. It is useless for individuals that don’t subscribe to GOV, speak languages not found in GOV, or have entities they what to include in their transmission not described in the GOV database. This concept should be contained as part of GOV (as a source of information) not in a GEDCOM transmission. Albert: Next point: In your proposal the offical name of a location is missing. In many cases it is not in the source of the event (as there we often find additional data to identify the location, but not the hierarchical list you are using in your SPLAC record). The name in the example is "Tetendorf" (given in official registries of places in Lower Saxony), in the genealogical source it may be given as "Tetendorf bei Soltau" (which you put in the calling record, same in my application), but nowhere you have Tetendorf, Heidekreis, Niedersachsen, Deutschland or any thing like that in the sources. That is an artificial construction which is not needed anywhere. Reply: The GEDCOM should not be responsible for transmitting the official name unless that is either part of the “Source Artifact”, or the creator of the transmission enters it in the SPLAC when they create the record. The transmission does not require (nor do most applications) or care about the official name of the entity. Most GEDCOMs I have received Do not Spell out “United States of America”, “Kingdom of Norway “, “Kongeriket Norge” (in bokmål), “Kongeriket Noreg” (in Nynorsk) or “Bundesrepublik Deutschland”. Unless the application subscribes to a geo-locating and describing database, the GEDCOM data sent in the transmission is completely the responsibility of the data entry person. Albert: Using the hierarchical structure of GEDCOM-L location records, my application can find all people with any event (or other search criteria as name, occupation, year of birth) in the county of Heidekreis. The application does this by checking the record county Heidekreis, and all "sub"records linked to Heidekreis in any generation. I will implement an extension to build the hierarchical structure of location records as long as GEDCOM standard does not offer it itself to make sure this functionality is covered by GEDCOM transmission. Reply: In your example this is true, If and Only If, Heidekreis is unique within the database. If the city name was “Paris” we could have multiple record instances within a single database. If the application subscribes to the GOV database a GOV-ID may be present, but it is very presumptive that any application will automatically subscribe to this database. Rather, a software application can subscribe to any number of providers, create their own database or have no geolocation database. A second problem that if an application does subscribe to the GOV database, does that application bring in all of the data found in the database for the entity or just what the application user needs or cares about. For example, they may only care about the entities current name and locator. The rest could be in their GEDCOM creating unnecessary bloat! The information about the hierarchy of entities in the GOV database is great as a reference for entities they support, but GOV does not support all place entities in the world where an event may occur. Keep this information and the hierarchy with GOV, don’t carry it within the GEDCOM transmission. Albert: You argument, the GEDCOM-L solution implies that all applications / genealogies must implement the same data of places, does not work for me. I have the external link to GOV (gov.genealogy.net), and when importing an additional GEDCOM file my application automatically merges location records with same ID in GOV. Reply: The proposal has no issue with linking to the GOV database, this linking is provided by the proposal. We need to separate what the GOV Place Directory (“GOV”) does from the function of the GEDCOM transmission! GOV as a tool is a fine place (although incomplete) to research and document places in the world. The GOV-ID can be used as a source of information about a place entity and referencing GOV adds some value to a GEDCOM. However, the structure that GEDCOM-L suggests for inclusion in Standard GEDCOM, which I can only assume mimics the structure of the GOV database, increases the size of a transmission by its need to create record instances for all entity levels superior to the targeted place. These additional record instances have limited value in the transmission. The example above, “Tetendorf, Heidekreis, Lower Saxony, Germany” requires four SPLAC records, one for each of the levels in the hierarchy to be created and transmitted. The three additional record instances (for Heidekreis, Lower Saxony, and Germany) may never be referenced by any Fact/Event PLAC tags within the transmission and therefore are overkill. In my proposal only one SPLAC instance is required. For application databases that use the GEDCOM record structure in some form as their schema the overhead of multiple SPLAC levels would be problematic especially if a place entity could have multiple superior place entities based on time or language. Language comes into play when an entity was part of a French speaking superior entity and later became part of a German Speaking superior entity. A far bigger issue is when the bottom level is a common entity name like “Paris” or “Lincoln” where locating the correct Lincoln or Paris requires at least one (possibly more) higher level entity instances to be called to determine the correct target entity (“Paris”). Before the SPLAC can be connected to the Fact/Event all possible Paris or Lincoln combinations in the GEDCOM must be reviewed to determine the correct one (and if none is found then a new SPLAC hierarchy added) before the SPLAC can be connected. For example, the user knows that the person was born in Paris, Ohio, under the Multi-level SPLAC design the application will have to look several levels up to determine if they have Paris, Ohio (in the correct township and county) or Paris, Michigan or any one of more than a dozen cities named Paris in the USA. There are more than 40 Cities named Lincoln in the USA and more in the world! Additionally, using the GEDCOM-L record (again we are not talking about the GOV database) where an entity has multiple superior entities, such as your “Tetendorf” example that is based on date, we could take different paths to find the correct superior entity, (Soltau, Landkreis Soltau-Fallingbostel, or Heidekreis). The user of the application may not have or know the date when the event occurred and therefore could either not take the right path to the next level or not select any next levels! If the user creates an SPLAC record themselves without knowing all the levels, they could just enter Tentendof, Germany and be done with the data entry at that time! |
Soltau is not unique in Lower Saxony. It was a county, and it is a city. If the data transferred by GEDCOM neither have the type of the place, nor the next level object, an user not familiar with the situation will treat the place "Soltau" as a city. And if that is wrong, because the original source cited the county, then the user will not find any sources of the event in any archive of the city of Soltau, if the event took place in another place within the county of Soltau, and not in the city of Soltau. Next point is: Putting in a place, I only need to identify the next level place object and link to it. For all places linked to the higher level place object, I only once have to put its data including its next level object (or more, if there were more depending on date periods). By the link the data structure automatically transfers this information to all lower level objects. It is much easier than to look for the complete hierarchy again and again for all lowest level places. So within a place object, no other information about any higher level object is needed only the link to next level. This reduces the time to build the place records, and it reduces the size of data in the GEDCOM file, as multiple information about in which state a county is located, is not written for all places in that county, but only for the county itself. If you want, you can create the list of hierarchies for a place from the hierarchical database, and put it in your reports. And you can do that with the hierarchy exactly matching to the date of the event. That is the only use case where you really need this hierarchical list. All the other points: We will not find together. I need the data, @Norwegian-Sardines does not. So I need to export them, he does not. As long as GEDCOM standard does not offer the possibility to export the data I need, I stay with the extensions we have agreed on in GEDCOM-L list. The question is, whether other applications use the link to next level objects, too. Then this link should be within GEDCOM standard so these applications can share the data. |
This is 100% true and if I found a source (like a bible) and it said the person was born in Soltau, Germany. I would not know that they were born in the city or the county. Under GEDOM-L design I could only create a 2 SPLAC record instances 1) Soltua 2) Germany because I don't have access to anything that might help. If I thought to go to Wikipedia I might find that it would say:
So do I create three additional SPLAC records (Lüneburg Heath, Heidekreis, and Lower Saxony) and then I would not know what TYPE to use for "Lüneburg Heath" and "Lower Saxony". By then I'm getting bored with data entry and saying forget it "Soltau, Germany it is"!!! As you have noted People not familiar with the setup of divisions in other countries (and possibly their own) don't invest time in researching the place. Most people living in the Chicago, Illinois, USA area don't know that "Cook County" (the county which Chicago and many of its suburbs) has more than 30 Townships (a division between county and city) and never input the division. Some cities are found in multiple counties, which one is the next higher level in GEDCOM-L? I don't understand your second point! If I have 8 Paris City SPLAC instances in your GEDCOM-L hierarchy and only know that it is in the State of Ohio, I don't know the potential two levels (county and township) between the State SPLAC instance and the City SPLAC instance. |
@Norwegian-Sardines :
... or similar were also included.
Based on your example...
... I think it should be the PLAC information (here: Paris). A corresponding statement/recommendation in the text would provide clarity (or have I overlooked it?).
I would not see the new SPLAC structure as a replacement for ADDR and would rather enter “Highland Cemetery, John Doe Grave” as ADDR. For me, ADDR are therefore not always mandatory postal addresses.
Dirk |
Dirk: I had hoped that the comma-separated location entities would be replaced by Dirk: How is a beginner to know that he has to enter comma-separated elements in ascending order in his software? Dirk: It should be clearer which of the place texts would then be used for output. Especially with extensive list output, a place does not have to be repeated each time with all its place entities.
NOTE: This record includes multiple Place Descriptions based on the history of the city over time. Only the first
Paris would need two date-based links to superior levels 1) Lamar County 2) Red River County The second issue is that the Record Instance of “Paris” has no identifier in it to indicate that it is the Paris that will eventually get us into a Texas land mass. Paris Ohio would start with Paris at the lowest level too! Dirk: I would not see the new Dirk: The software can then generate an additional glossary page in which, among other things, “Hmbg = Hamburg” can be found as an explanation. Or have I misinterpreted Dirk: I don't particularly like the combination of |
Are we mixing now GEDCOM data transfer and internal application processes? |
How many of those links/events are in
Could some of them be in Lamar County, Mississippi? |
My point here is that Lamar County (and potentially any place entity) is not unique within the GOV database, it exists within the context of its higher place entities. In this case the State of Texas, the State of Mississippi, or the State of Georgia, So using the record instance named "Lamar" or "Lamar County" as the entry point to find all events that occurred in that county must also identify the State within the United States. And of course, as stated previously, Lamar was also in the Country of "The Republic of Texas". So this is why the hierarchy of different record instances has issues, and they are compounded if you want to include historical place names, place names by language, and place names that are not unique in the world due to their contextual connections to other place entities. |
Lamar County in Texas has 133 locations in the default database. You are correct, we have to pick this SPLAC record for the County in Texas when searching. My application does it automatically, if you choose it as parent SPLAC record. If you only look for the name "Lamar County", you get the places in other states of the US, too. But that is shown in the result, as the search result will list the name of location, its type, its county, its state, and the country as well as the GPS coordinates. By the way, do you know an external database which shows all the historic links in US? GOV could do that, if somebody enters the data. GOV is based on the community of researchers cooperating to enter all these data... |
No I don’t know of an external DB of historical links for the USA. Sorry, I have enough to do, can’t use up my time entering data for other people! |
Since I started reading GEDCOM, I have always wondered why on some places in the GEDCOM, ADDR had to be used, and on other places PLAC.
Inside ADDR we have tags for CITY, STAE, POST and CTRY.
But inside PLAC we have jurisdictions that do the same?!
On places in GEDCOM, where only PLAC is allowed now, people start using an address as the leftmost, smallest, jurisdiction, because they want to denote an address rather then a Place.
Or they use the name of a church as the leftmost jurisdiction.
Now I have seen the proposels for SPLAC (See #520 and #527 ) of which I want to add to 520 "Adding SPLAC beside PLAC", because that is more in line with what I will write here.
The comparison in that proposel, was made with NOTE and SNOTE.
But I think that should be with REPO and SOUR.
Why?
1 (one) Repository can contain many sources. At an event, we can link to a Source, which can link to a Repository.
I wonder if addresses are not the same.
1 (one) City, can have many addresses. So at an event we should have just an address described, and that address should point to a City (SPLAC). (which in turn points to, a state, which ... etc just as in the new spec of GEDCOM)
The address at the event should NOT have CITY, STAE, POST and CTRY.
Because that way we have kind of the same information on more places in the GEDCOM.
And to maybe make things clearer, we should not call it ADDR, but maybe BUILDING (or an abbreviation of that like BLDNG)
BUILDING can be someones home, or a church or a castle, or a university, a Harbour etc.
Because in 1 home, more children can be born, and in 1 church many children can be baptised, or people can have their wedding, BUILDING should also be a record like structure, not a property of something.
In fact, BUILDING is the smallest form of SPLAC.
It should have a name or a title so it can be identified for a user.
So where ever it is now allowed to have either ADDR or PLAC, we will now have BUILDING there.
That just describes an address in user understandable text. And points to the CITY SPLAC entity.
I looked at addresses around the world, and that seems way to complicated to "catch" in a specification.
So thats why I say, inside the BUILDING, the address is written as complete as the user wants, but that text will only be output by a program (shown to the user), NOT interpreted, it is not used to define where it is on a map, because of the complications.
Defining a BUILDING on a map is done by the link to the "first" SPLAC in the chain.
Now because there can be more BUILDINGs in a City, inside BUILDING we can also use MAP with its 2 coördinates to pinpoint its position in a City.
If that position is not present, it will be put at the center of the city it belongs to (the default position of the City itself). On top of others that have no further positioning.
By adding MAP to BUILDING, it will now also be possible to denote the birthplace when a child is born on a ship or in a plane, or in a car somewhere, because a hospital could not be reached in time. Same is valid for a Death at sea.
So instead of dying in the "Atlantic Ocean", which is so huge we have no idea where that might have been happening, we maybe are able to figure out from the route a ship took, and the date of the death, where it might have been and show a bit better in that immense ocean where it happened.
So BUILDING could also be a Plane or a Boat.
To me I think BUILDING should also have a NOTE structure and a SOURCE citation.
And SPLAC would have a CHAN and a CREA.
Area's:
I remember, to have seen, I think Luther, talking about people wanting an area to denote an event.
That is also possible, we define RADIUS under MAP, and give meters, or kilometers ar anything, and we have a circle with a center, denoting the area where things took place.
1 MAP
2 LATI N18.150944
2 LONG E168.150944
2 RADIUS 5.4KM
Instead of RADIUS we could also have SQUARE or RECT like:
1 MAP
2 LATI N18.150944
2 LONG E168.150944
2 RECT X:+5.7KM Y:-5.7KM
Depending on the + or - the Coordinates denote which corner it is. (Both +, the corner is the left bottom, both minus, that corner is the top right)
But to me RADIUS seems easier.
My guess is this will also reduce GEDCOM size, as a lot of "doubled text" (from all the ADDR's that are in fact the same) will be removed and move into the corresponding SPLAC records.
If BUILDING has a MAP structure and SPLAC too, the MAP of BUILDING should be used to show on a map, as thats more precise. The MAP of a City points to an arbitrary point inside the City. A default point, in case the BUILDINGs pointing to that City, have no Map structure.
In the SPLAC beside PLAC md file, I think there is an SPLAC record missing under RECORD :=
For the TYPE I would choose to have everything in Uppercase, so CITY, COUNTY, STATE, COUNTRY
I think it might be more clear if there also was an example with a real placename.
Maybe, in case of the above example for an Ocean, have a TYPE OCEAN too?
And maybe a Type AIR?
These last 2 Types have no other SPLACs that they point to, or that point to those I presume.
Some other thing about SPLAC:
Now it has coördinates. But what if there would be more ways of denoting a place then just LAT and LONG and an Address.
Then it might need a TYPE to tell what mapping system is used. And maybe more then 1 mapping system can be used for 1 SPLAC?
There seem to be other systems like What3words, UTM coding, Plus Codes, and maybe more.
But that does not seem very common yet?
I am sure I forgot things, but I wanted to add it here, to maybe inspire someone.
The text was updated successfully, but these errors were encountered: