TMContent / Lib_UNO

All the articles collapsed into one library
Creative Commons Zero v1.0 Universal
0 stars 1 forks source link

Fix Duplicate Articles #50

Closed romichg closed 9 years ago

romichg commented 9 years ago

There are currently 24 Duplicate articles. They are in DUPES View on https://uno.teammentor.net We should remove the duplicates, and provide the mapping of the removed GUID to the GUID remaining.

romichg commented 9 years ago

@SergeTruth removed the duplicate articles. We now need to make sure we replace any links to the removed articles with the valid ones. Here is a mapping. @michaelhidalgo lets use a script to search for the Dupe GUID in all articles and replace it with the Good GUID below.

Dupe GUID   Good GUID
8e2cab7b-bab6-488d-9487-d93b3f479b27    fe9afb4a-790d-413f-9b77-4258314c1dc4
8504cbf2-1d26-4451-8432-edd3b469b041    80daaf63-19c7-4d69-beec-ec255660f82d
00000000-0000-0000-0000-0000005ab7c3    00000000-0000-0000-0000-00000043b1b9
204c4dad-aa90-491f-ab60-c458edb07938    a495f2bb-cc74-4679-a01e-cc0580b3d404
1a5a55be-606e-4bef-a0ae-10c684928012    02cca0fd-72ee-42ec-a0e4-386cc9051e63
8d739a7a-1cae-43ae-985b-7eb01f2685fe    deeb51ee-f75a-4e6a-a5f7-de2f27aa2beb
a61fa5bb-c310-41c3-b22d-f17a91fe188a    deeb51ee-f75a-4e6a-a5f7-de2f27aa2beb
93ab7db1-45dc-4e8d-a51b-2c5b328f01da    e0e1e06b-51cf-47ae-ad7f-0cff6ff4f683
41a04d3f-0bb5-4904-bb28-134a109f9ce0    99e25582-fc50-4721-9a07-050ca26e3e66
00000000-0000-0000-0000-0000005ae4db    00000000-0000-0000-0000-0000001ab0c1
8eaf228f-15c2-4c50-a15b-91b3bb213d68    58b12785-2ca7-48b0-89c5-8bee7aaa1cf8
4a98a893-8b2c-4a8d-9b89-1247e5418263    63ac333b-41e4-4aba-b72f-c5b7591d6398
michaelhidalgo commented 9 years ago

Hi @roman87 , I've been working on this issue.

However, in sme/LibUno, I'm still able to access the Dupe articles, I just wanted to make sure we are working with the updated version. My branch is dev.

screen shot 2015-05-29 at 16 37 14

michaelhidalgo commented 9 years ago

Actually https://uno.teammentor.net/article/8e2cab7b-bab6-488d-9487-d93b3f479b27 I still can see the article but with the [DUPE] in the article's title

screen shot 2015-05-29 at 16 42 17

michaelhidalgo commented 9 years ago

Ok, so I did some scripting here.

I searched all the duped GUIDs but I did not found any article containing a reference to any of those GUIDS.

I searched in all the 2925 files in Disk from the sme Dev branch.

This is the script I used.

Func<string,int> articlesContainingGuid =(guid)=>
{   
   int counter =0;  
   foreach (var file in files) {
    var content = file.fileContents();
     if (!file.contains(guid) && content.contains(guid)){
        counter ++;
     }  
}
return counter;
};
 return articlesContainingGuid("4a98a893-8b2c-4a8d-9b89-1247e5418263");
SergeTruth commented 9 years ago

I think I fixed this?

romichg commented 9 years ago

Hmm... ok. We will need to run the script for dupes again on beta content, when beta is ready to confirm.

romichg commented 9 years ago

This is done.