pulibrary / special_collections

Deprecated - Special Collections ( formerly RBSC) Drupal 7 Project
https://library.princeton.edu/special-collections/
GNU General Public License v2.0
2 stars 0 forks source link

libweb urls should be removed #679

Closed carolyncole closed 2 months ago

carolyncole commented 3 years ago

Analysis of the sql show all of these libweb urls still existing in the special collections site:

libweb_urls_uniq.txt

each server should be analyzed and removed

carolyncole commented 3 years ago

The dump file with new lines was created by loading the production database locally and then running

lando ssh
mysqldump -h database -u drupal7 -p drupal7 --extended-insert=FALSE >dump_newline.sql
exit

This files was created by searching the sql dump file with new lines

grep http://libweb dump_newline.sql | grep -v revision | grep -v webform_submitted_data  | grep -v linkchecker_link | grep -v field_revision_field_direct_database_link >> libweb.txt
grep http://libweb dump_newline.sql | grep -v revision | grep -v webform_submitted_data  | grep -v linkchecker_link | grep -v field_revision_field_direct_database_link > libweb.txt
awk -F'http' '{print $2}' libweb.txt > libweb_awk.txt

Use virtual studio change all to clean up the links and then

sort libweb_awk.txt |uniq > libweb_urls_uniq.txt