FreeUKGen / FreeCENMigration

Issue tracking for project migrating FreeCEN to FreeCEN2 genealogy record database and search engine architecture. Code developed here is based on that developed in MyopicVicar
https://www.freecen.org.uk
Apache License 2.0
4 stars 3 forks source link

Resolve the Red source issues in Geoff's Source report #1369

Closed Captainkirkdawson closed 2 years ago

Captainkirkdawson commented 2 years ago

2/3rds of way through. Completed NFK

Captainkirkdawson commented 2 years ago

Finished DTS; there are 45 left to do

Captainkirkdawson commented 2 years ago

@geoffj-FUG I have effectively completed the reds. However some of the large urban centre entries ae a mess as you have noted. I would like to spend some time getting then into a consistent structure. Before starting I wish to confirm that place naming should follow the larger smaller rule i.e. Leeds Beeston should be the format NOT Beeston Leeds. There is also the question of brackets. Do we include or not include? We are inconsistent. (The internal standard format used by the system drops them regardless)

Captainkirkdawson commented 2 years ago

IMO if we are doing the bigger smaller rule we should bracket so as to indicate that is what has been done. ie. Leeds (West)

geoffj-FUG commented 2 years ago

Kirk

Yes, Bigger to smaller.

Brackets are essential. We are looking at locking down duplicate entries by testing strings. So Wraxall (Nailsea) has to be differentiated from Wraxall (Shepton Mallet). The searching of POBs will need to take account of multiple lat/long entries if and when it ever happens.

Is Leeds West a place or not? I would validate it as the alternative Leeds. However, Leeds is a big area. West Leeds may be a legitimate place name. It is not in Genuki list of Leeds Parishes. So Leeds West is really Leeds West Riding of Yorkshire? It may well be a FC1 abbreviation given that it is Yorkshire. It should not even be in the Gazetteer. If it was entered as Leeds (West) then the system would not validate Leeds West and somebody would change it.

YKS is one of the big problems with Gazetteer entries. The number that I have in Anne’s spreadsheet of place names that do not have web addresses is incredible.

Geoff

geoffj-FUG commented 2 years ago

Kirk

Further to my previous email.

If Leeds West is going to be entered to aid validation because it recurs in many pieces in many years, it should be an alternative name to show that it is Leeds. If should not be entered as a primary name.

The problem is changing people’s habits. Everything was entered in one list in FC1. No alternatives.

Geoff

Captainkirkdawson commented 2 years ago

@geoffj-FUG Leeds (West) would validate Leeds West because we validate on the standard name which is always shown on the Freecen2_place detail display

Captainkirkdawson commented 2 years ago

@geoffj-FUG your argument about having all quadrants as alternates to the primary in cities makes logical sense. The counter argument is that doing so creates a single place with many hundreds of thousands of records and loss of specificity not to mention maxing out the results for common names. Hence I would urge retention of the sectorization.

Captainkirkdawson commented 2 years ago

I have cleaned up leeds and think it looks better!!

Captainkirkdawson commented 2 years ago

All red items dealt with

geoffj-FUG commented 2 years ago

Thank you.

I will get the missing web address and invalid sources sorted out and then we can run another download and see how things look then.

The missing web addresses are being delegated to the Coordinators. Pat has already sent out a message to them that it is going to happen.

Geoff

From: Kirk Dawson @.> Sent: Wednesday, 9 March 2022 10:30 AM To: FreeUKGen/FreeCENMigration @.> Cc: geoffj-FUG @.>; Mention @.> Subject: Re: [FreeUKGen/FreeCENMigration] Resolve the Red source issues in Geoff's Source report (Issue #1369)

All red items dealt with

— Reply to this email directly, view it on GitHub https://github.com/FreeUKGen/FreeCENMigration/issues/1369#issuecomment-1062428517 , or unsubscribe https://github.com/notifications/unsubscribe-auth/AKCPIFNLEVIKXQIRCV4O5DLU67WJFANCNFSM5NUE7YEA . Triage notifications on the go with GitHub Mobile for iOS https://apps.apple.com/app/apple-store/id1477376905?ct=notification-email&mt=8&pt=524675 or Android https://play.google.com/store/apps/details?id=com.github.android&referrer=utm_campaign%3Dnotification-email%26utm_medium%3Demail%26utm_source%3Dgithub . You are receiving this because you were mentioned. https://github.com/notifications/beacon/AKCPIFN7HDMQT3UCO3MU6H3U67WJFA5CNFSM5NUE7YEKYY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOH5JV6ZI.gif Message ID: @. @.> >

geoffj-FUG commented 2 years ago

Kirk

OK I understand

Geoff

From: Kirk Dawson @.> Sent: Wednesday, 9 March 2022 8:36 AM To: FreeUKGen/FreeCENMigration @.> Cc: geoffj-FUG @.>; Mention @.> Subject: Re: [FreeUKGen/FreeCENMigration] Resolve the Red source issues in Geoff's Source report (Issue #1369)

@geoffj-FUG https://github.com/geoffj-FUG your argument about having all quadrants as alternates to the primary in cities makes logical sense. The counter argument is that doing so creates a single place with many hundreds of thousands of records and loss of specificity not to mention maxing out the results for common names. Hence I would urge retention of the sectorization.

— Reply to this email directly, view it on GitHub https://github.com/FreeUKGen/FreeCENMigration/issues/1369#issuecomment-1062290296 , or unsubscribe https://github.com/notifications/unsubscribe-auth/AKCPIFJ2QTY4Z3GBPKSYPSTU67I47ANCNFSM5NUE7YEA . Triage notifications on the go with GitHub Mobile for iOS https://apps.apple.com/app/apple-store/id1477376905?ct=notification-email&mt=8&pt=524675 or Android https://play.google.com/store/apps/details?id=com.github.android&referrer=utm_campaign%3Dnotification-email%26utm_medium%3Demail%26utm_source%3Dgithub . You are receiving this because you were mentioned. https://github.com/notifications/beacon/AKCPIFL4QIBMZSI4QUM5LK3U67I47A5CNFSM5NUE7YEKYY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOH5IUG6A.gif Message ID: @. @.> >

DeniseColbert commented 2 years ago

Done, closing