FreeUKGen / FreeCENMigration

Issue tracking for project migrating FreeCEN to FreeCEN2 genealogy record database and search engine architecture. Code developed here is based on that developed in MyopicVicar
https://www.freecen.org.uk
Apache License 2.0
4 stars 3 forks source link

307808132 Invalid text in gazetteer? (Ali) #1813

Open FreeREGcomputer opened 1 month ago

FreeREGcomputer commented 1 month ago

Issue reported by AH001 at 2024-10-02 14:15:32 UTC Time: 2024-10-02T13:15:52+00:00 Session ID: eab012a4b15dd704af68bfc41e5c927b Problem Page URL: /freecen2_places/66fd2f20b18f7cf8bfbaef57?locale=en Previous Page URL: https://www.freecen.org.uk/freecen2_places/66fd2f20b18f7cf8bfbaef57/edit?locale=en Reported Issue: Invalid text allowed in gazetteer but not in csv??, although place is in gazetteer it is shown as a warning, ie not in gazetteer, in csv.

Sweden Soderhamn

Screenshot

geoffj-FUG commented 1 month ago

I do not understand the problem here.

The invalid text is presumably the dots over the o in the place name. The anglicised version is also recorded. A search to match a place of birth will presumably match the anglicised name.

We are aware that UTF8 has not been implemented on the csv collection. There is a story that covers this.

Characters that are not currently recognised are appearing in the Gazetteer and that is deliberate. Irish names in particular require this to be able to record the Gaelic name. We are collecting these names so that the Gazetteer is comprehensive and will not need to be revisited.

In this case the place name is in Sweden and the letter o is recorded correctly.

This issue will be addressed during development of POB searching as UTF8 is implemented.

Geoff

From: Vino-S @.> Sent: Wednesday, 16 October 2024 7:16 PM To: FreeUKGen/FreeCENMigration @.> Cc: Geoff J @.>; Assign @.> Subject: Re: [FreeUKGen/FreeCENMigration] 307808132 Invalid text in gazetteer? (Ali) (Issue #1813)

Assigned #1813 https://github.com/FreeUKGen/FreeCENMigration/issues/1813 to @geoffj-FUG https://github.com/geoffj-FUG .

— Reply to this email directly, view it on GitHub https://github.com/FreeUKGen/FreeCENMigration/issues/1813#event-14672074858 , or unsubscribe https://github.com/notifications/unsubscribe-auth/AKCPIFO3LTU5JIN65RG5R53Z3YVDLAVCNFSM6AAAAABPIDHOK6VHI2DSMVQWIX3LMV45UABCJFZXG5LFIV3GK3TUJZXXI2LGNFRWC5DJN5XDWMJUGY3TEMBXGQ4DKOA . You are receiving this because you were assigned. https://github.com/notifications/beacon/AKCPIFMUUDFFHWSUW54VFKTZ3YVDLA5CNFSM6AAAAABPIDHOK6WGG33NNVSW45C7OR4XAZNWJFZXG5LFIV3GK3TUJZXXI2LGNFRWC5DJN5XKUY3PNVWWK3TUL5UWJTYAAAAAG2UGDBVA.gif Message ID: @. @.> >

geoffj-FUG commented 1 month ago

I do not understand the problem here.

The invalid text is presumably the dots over the o in the place name. The anglicised version is also recorded. A search to match a place of birth will presumably match the anglicised name.

We are aware that UTF8 has not been implemented on the csv collection. There is a story that covers this.

Characters that are not currently recognised in csv and vld files are appearing in the Gazetteer and that is deliberate. Irish names in particular require this to be able to record the Gaelic name. We are collecting these names so that the Gazetteer is comprehensive and will not need to be revisited.

In this case the place name is in Sweden and the letter o is recorded correctly in the Gazetteer.

This issue will be addressed during development of POB searching as UTF8 is implemented.

Geoff

From: Vino-S @.> Sent: Wednesday, 16 October 2024 7:16 PM To: FreeUKGen/FreeCENMigration @.> Cc: Geoff J @.>; Assign @.> Subject: Re: [FreeUKGen/FreeCENMigration] 307808132 Invalid text in gazetteer? (Ali) (Issue #1813)

Assigned #1813 https://github.com/FreeUKGen/FreeCENMigration/issues/1813 to @geoffj-FUG https://github.com/geoffj-FUG .

— Reply to this email directly, view it on GitHub https://github.com/FreeUKGen/FreeCENMigration/issues/1813#event-14672074858 , or unsubscribe https://github.com/notifications/unsubscribe-auth/AKCPIFO3LTU5JIN65RG5R53Z3YVDLAVCNFSM6AAAAABPIDHOK6VHI2DSMVQWIX3LMV45UABCJFZXG5LFIV3GK3TUJZXXI2LGNFRWC5DJN5XDWMJUGY3TEMBXGQ4DKOA . You are receiving this because you were assigned. https://github.com/notifications/beacon/AKCPIFMUUDFFHWSUW54VFKTZ3YVDLA5CNFSM6AAAAABPIDHOK6WGG33NNVSW45C7OR4XAZNWJFZXG5LFIV3GK3TUJZXXI2LGNFRWC5DJN5XKUY3PNVWWK3TUL5UWJTYAAAAAG2UGDBVA.gif Message ID: < @.> @.>