jayvdb / pypidb

PyPI client side database with SCM/VCS URLs
Apache License 2.0
13 stars 3 forks source link

regex101.com leads to wrong result #116

Open jayvdb opened 4 years ago

jayvdb commented 4 years ago
INFO     pypidb._pypi:_pypi.py:536 looking up aws-cdk-assets
DEBUG    pypidb._adapters:_adapters.py:84 cdn block of https://pypi.org/pypi/aws-cdk-assets/json skipped
DEBUG    pypidb._adapters:_adapters.py:108 domain block of https://pypi.org/pypi/aws-cdk-assets/json skipped
DEBUG    pypidb._adapters:_adapters.py:32 is_num = False; is_IP = False: https://pypi.org/pypi/aws-cdk-assets/json
DEBUG    pypidb._adapters:_adapters.py:42 is_IP = False: https://pypi.org/pypi/aws-cdk-assets/json
DEBUG    pypidb._adapters:_adapters.py:46 IPblock of https://pypi.org/pypi/aws-cdk-assets/json skipped
DEBUG    https_everywhere.adapter:adapter.py:86 No implementation for get_redirect('https://pypi.org/pypi/aws-cdk-assets/json')
DEBUG    https_everywhere.adapter:adapter.py:117 no redirection of https://pypi.org/pypi/aws-cdk-assets/json occurred
DEBUG    cachecontrol.controller:controller.py:126 Looking up "https://pypi.org/pypi/aws-cdk-assets/json" in the cache
DEBUG    cachecontrol.controller:controller.py:163 Returning cached "301 Moved Permanently" response (ignoring date and etag information)
DEBUG    pypidb._adapters:_adapters.py:84 cdn block of https://pypi.org/pypi/aws-cdk.assets/json skipped
DEBUG    pypidb._adapters:_adapters.py:108 domain block of https://pypi.org/pypi/aws-cdk.assets/json skipped
DEBUG    pypidb._adapters:_adapters.py:32 is_num = False; is_IP = False: https://pypi.org/pypi/aws-cdk.assets/json
DEBUG    pypidb._adapters:_adapters.py:42 is_IP = False: https://pypi.org/pypi/aws-cdk.assets/json
DEBUG    pypidb._adapters:_adapters.py:46 IPblock of https://pypi.org/pypi/aws-cdk.assets/json skipped
DEBUG    https_everywhere.adapter:adapter.py:86 No implementation for get_redirect('https://pypi.org/pypi/aws-cdk.assets/json')
DEBUG    https_everywhere.adapter:adapter.py:117 no redirection of https://pypi.org/pypi/aws-cdk.assets/json occurred
DEBUG    cachecontrol.controller:controller.py:126 Looking up "https://pypi.org/pypi/aws-cdk.assets/json" in the cache
DEBUG    cachecontrol.controller:controller.py:179 Current age based on date: 147086
DEBUG    cachecontrol.controller:controller.py:201 Freshness lifetime from expires: 431944
DEBUG    cachecontrol.controller:controller.py:219 The response is "fresh", returning cached response
DEBUG    cachecontrol.controller:controller.py:220 431944 > 147086
DEBUG    pypidb._pypi:_pypi.py:556 rule DefaultRule aws-cdk-assets/aws-cdk-assets = aws-cdk.assets None {}
DEBUG    pypidb._pypi:_pypi.py:519 emails []
DEBUG    pypidb._pypi:_pypi.py:311 queue [UrlSet({'https://github.com/aws/aws-cdk.git', 'https://pypi.org/project/aws-cdk.assets/1.33.0/', 'https://pypi.org/project/aws-cdk.assets/', 'https://github.com/aws/aws-cdk'}), Text('Integratio...'), Text('## AWS CDK...'), Name('aws-cdk.assets')]
DEBUG    pypidb._pypi:_pypi.py:317 processing UrlSet: {'https://github.com/aws/aws-cdk.git', 'https://pypi.org/project/aws-cdk.assets/1.33.0/', 'https://pypi.org/project/aws-cdk.assets/', 'https://github.com/aws/aws-cdk'}
DEBUG    pypidb._scm_url_cleaner:__init__.py:157 Calling pypidb._scm_url_cleaner.SCMURLCleaner.get_root(self = <pypidb._scm_url_cleaner.SCMURLCleaner object at 0x7f64161250d0>, url = 'https://github.com/aws/aws-cdk.git')
DEBUG    pypidb._scm_url_cleaner:__init__.py:157 Calling pypidb._scm_url_cleaner._github(url = 'https://github.com/aws/aws-cdk.git')
DEBUG    pypidb._scm_url_cleaner:__init__.py:177 Returned ('aws', 'aws-cdk') from pypidb._scm_url_cleaner._github
DEBUG    pypidb._scm_url_cleaner:__init__.py:177 Returned 'https://github.com/aws/aws-cdk' from pypidb._scm_url_cleaner.SCMURLCleaner.get_root
DEBUG    pypidb._scm_url_cleaner:__init__.py:157 Calling pypidb._scm_url_cleaner.SCMURLCleaner.get_root(self = <pypidb._scm_url_cleaner.SCMURLCleaner object at 0x7f64161250d0>, url = 'https://pypi.org/project/aws-cdk.assets/1.33.0/')
DEBUG    pypidb._scm_url_cleaner:__init__.py:177 Returned None from pypidb._scm_url_cleaner.SCMURLCleaner.get_root
DEBUG    pypidb._scm_url_cleaner:__init__.py:157 Calling pypidb._scm_url_cleaner.SCMURLCleaner.get_root(self = <pypidb._scm_url_cleaner.SCMURLCleaner object at 0x7f64161250d0>, url = 'https://pypi.org/project/aws-cdk.assets/')
DEBUG    pypidb._scm_url_cleaner:__init__.py:177 Returned None from pypidb._scm_url_cleaner.SCMURLCleaner.get_root
DEBUG    pypidb._scm_url_cleaner:__init__.py:157 Calling pypidb._scm_url_cleaner.SCMURLCleaner.get_root(self = <pypidb._scm_url_cleaner.SCMURLCleaner object at 0x7f64161250d0>, url = 'https://github.com/aws/aws-cdk')
DEBUG    pypidb._scm_url_cleaner:__init__.py:157 Calling pypidb._scm_url_cleaner._github(url = 'https://github.com/aws/aws-cdk')
DEBUG    pypidb._scm_url_cleaner:__init__.py:177 Returned ('aws', 'aws-cdk') from pypidb._scm_url_cleaner._github
DEBUG    pypidb._scm_url_cleaner:__init__.py:177 Returned 'https://github.com/aws/aws-cdk' from pypidb._scm_url_cleaner.SCMURLCleaner.get_root
DEBUG    pypidb._pypi:_pypi.py:355 --results from {'https://github.com/aws/aws-cdk.git', 'https://pypi.org/project/aws-cdk.assets/1.33.0/', 'https://pypi.org/project/aws-cdk.assets/', 'https://github.com/aws/aws-cdk'}: ['https://github.com/aws/aws-cdk', 'https://github.com/aws/aws-cdk']
DEBUG    pypidb._pypi:_pypi.py:361 queuing ['https://pypi.org/project/aws-cdk.assets/', 'https://pypi.org/project/aws-cdk.assets/1.33.0/']
INFO     pypidb._pypi:_pypi.py:365 aws-cdk.assets: from None added urls ['https://github.com/aws/aws-cdk', 'https://github.com/aws/aws-cdk']
DEBUG    pypidb._similarity:__init__.py:157 Calling pypidb._similarity.get_best_match(name = 'aws-cdk-assets', urls = ['https://github.com/aws/aws-cdk', 'https://github.com/aws/aws-cdk'])
DEBUG    pypidb._similarity:__init__.py:157 Calling pypidb._similarity._get_weighted_best(name = 'aws-cdk-assets', urls = ['https://github.com/aws/aws-cdk', 'https://github.com/aws/aws-cdk'], cond = <function _slash_endswith at 0x7f641e407280>)
DEBUG    pypidb._similarity:__init__.py:177 Returned None from pypidb._similarity._get_weighted_best
DEBUG    pypidb._similarity:__init__.py:157 Calling pypidb._similarity._get_weighted_best(name = 'aws-cdk-assets', urls = ['https://github.com/aws/aws-cdk', 'https://github.com/aws/aws-cdk'], cond = <function _endswith at 0x7f641e407310>)
DEBUG    pypidb._similarity:__init__.py:177 Returned None from pypidb._similarity._get_weighted_best
DEBUG    pypidb._similarity:__init__.py:157 Calling pypidb._similarity._get_weighted_best(name = 'aws-cdk-assets', urls = ['https://github.com/aws/aws-cdk', 'https://github.com/aws/aws-cdk'], cond = <function _python_and_name_match at 0x7f641e4073a0>)
DEBUG    pypidb._similarity:__init__.py:177 Returned None from pypidb._similarity._get_weighted_best
DEBUG    pypidb._similarity:__init__.py:157 Calling pypidb._similarity._get_weighted_best(name = 'aws-cdk-assets', urls = ['https://github.com/aws/aws-cdk', 'https://github.com/aws/aws-cdk'], cond = <function _only_name_match at 0x7f641e407430>)
DEBUG    pypidb._similarity:__init__.py:177 Returned None from pypidb._similarity._get_weighted_best
DEBUG    pypidb._similarity:__init__.py:157 Calling pypidb._similarity._get_weighted_best(name = 'aws-cdk-assets', urls = ['https://github.com/aws/aws-cdk', 'https://github.com/aws/aws-cdk'], cond = <function get_best_match.<locals>.<lambda> at 0x7f641606cdc0>)
DEBUG    pypidb._similarity:_similarity.py:101 counter - Counter({'https://github.com/aws/aws-cdk': 2})
DEBUG    pypidb._similarity:__init__.py:177 Returned 'https://github.com/aws/aws-cdk' from pypidb._similarity._get_weighted_best
DEBUG    pypidb._similarity:_similarity.py:244 !!!!finding similar: ['https://github.com/aws/aws-cdk', 'https://github.com/aws/aws-cdk']
DEBUG    pypidb._similarity:__init__.py:157 Calling pypidb._similarity._get_most_similar(name = 'aws-cdk-assets', urls = {'https://github.com/aws/aws-cdk'})
DEBUG    pypidb._similarity:__init__.py:157 Calling pypidb._similarity._compute_similarity(name = 'aws-cdk-assets', url = 'https://github.com/aws/aws-cdk', algo = <bound method _Distance.dist of <abydos.distance._ratcliff_obershelp.RatcliffObershelp object at 0x7f642a746430>>, comp_op = <built-in function min>)
DEBUG    pypidb._similarity:_similarity.py:44 _similarity of aws awscdk to awscdkassets
DEBUG    pypidb._similarity:__init__.py:177 Returned 0.23809523809523814 from pypidb._similarity._compute_similarity
DEBUG    pypidb._similarity:_similarity.py:118 computed similarity: {'https://github.com/aws/aws-cdk': 0.23809523809523814}
DEBUG    pypidb._similarity:_similarity.py:126 nearest: 0.23809523809523814 https://github.com/aws/aws-cdk
DEBUG    pypidb._similarity:_similarity.py:129 nearest_counter Counter({0.23809523809523814: 1})
DEBUG    pypidb._similarity:__init__.py:177 Returned 'https://github.com/aws/aws-cdk' from pypidb._similarity._get_most_similar
DEBUG    pypidb._similarity:_similarity.py:255 common, similar: https://github.com/aws/aws-cdk - https://github.com/aws/aws-cdk
DEBUG    pypidb._similarity:_similarity.py:257 most common and most similar: https://github.com/aws/aws-cdk
DEBUG    pypidb._similarity:__init__.py:177 Returned 'https://github.com/aws/aws-cdk' from pypidb._similarity.get_best_match
DEBUG    pypidb._similarity:__init__.py:157 Calling pypidb._similarity._compute_similarity(name = 'aws-cdk-assets', url = 'https://github.com/aws/aws-cdk', algo = <bound method _Distance.dist of <abydos.distance._ratcliff_obershelp.RatcliffObershelp object at 0x7f642a746430>>, comp_op = <built-in function min>)
DEBUG    pypidb._similarity:_similarity.py:44 _similarity of aws awscdk to awscdkassets
DEBUG    pypidb._similarity:__init__.py:177 Returned 0.23809523809523814 from pypidb._similarity._compute_similarity
DEBUG    pypidb._pypi:_pypi.py:317 processing Text: summary
DEBUG    pypidb._pypi:_pypi.py:383 @@ ran <function _url_extractor_wrapper at 0x7f641df40ca0> on text size 41 for 0 urls !!
DEBUG    pypidb._pypi:_pypi.py:388 extracted []
DEBUG    pypidb._pypi:_pypi.py:317 processing Text: description
DEBUG    pypidb._pypi:_pypi.py:383 @@ ran <function _url_extractor_wrapper at 0x7f641df40ca0> on text size 849 for 2 urls !!
DEBUG    pypidb._pypi:_pypi.py:388 extracted ['https://img.shields.io/badge/cdk--constructs-experimental-important.svg?style=for-the-badge', 'https://semver.org/']
DEBUG    pypidb._pypi:_pypi.py:317 processing Name: aws-cdk.assets @ Unknown
DEBUG    pypidb._pypi:_pypi.py:317 processing Url: https://pypi.org/project/aws-cdk.assets/1.33.0/ @ Unknown
DEBUG    pypidb._pypi:_pypi.py:427 reject rule https://pypi.org/project/aws-cdk.assets/1.33.0/: False
INFO     pypidb._pypi:_pypi.py:451 r https://pypi.org/project/aws-cdk.assets/1.33.0/
DEBUG    pypidb._pypi:_pypi.py:455 r Response.url https://pypi.org/project/aws-cdk.assets/1.33.0/ elapsed 0:00:00.000102
DEBUG    pypidb._pypi:_pypi.py:460 r https://pypi.org/project/aws-cdk.assets/1.33.0/ headers: {}
WARNING  pypidb._pypi:_pypi.py:463 https://pypi.org/project/aws-cdk.assets/1.33.0/: 500 Server Error: HTTPS Everywhere for url: https://pypi.org/project/aws-cdk.assets/1.33.0/
DEBUG    pypidb._pypi:_pypi.py:317 processing Url: https://pypi.org/project/aws-cdk.assets/ @ Unknown
DEBUG    pypidb._pypi:_pypi.py:427 reject rule https://pypi.org/project/aws-cdk.assets/: False
INFO     pypidb._pypi:_pypi.py:451 r https://pypi.org/project/aws-cdk.assets/
DEBUG    pypidb._pypi:_pypi.py:455 r Response.url https://pypi.org/project/aws-cdk.assets/ elapsed 0:00:00.000119
DEBUG    pypidb._pypi:_pypi.py:460 r https://pypi.org/project/aws-cdk.assets/ headers: {}
WARNING  pypidb._pypi:_pypi.py:463 https://pypi.org/project/aws-cdk.assets/: 500 Server Error: HTTPS Everywhere for url: https://pypi.org/project/aws-cdk.assets/
DEBUG    pypidb._pypi:_pypi.py:317 processing UrlSet: summary
DEBUG    pypidb._pypi:_pypi.py:317 processing UrlSet: description
DEBUG    pypidb._scm_url_cleaner:__init__.py:157 Calling pypidb._scm_url_cleaner.SCMURLCleaner.get_root(self = <pypidb._scm_url_cleaner.SCMURLCleaner object at 0x7f64161250d0>, url = 'https://semver.org/')
DEBUG    pypidb._scm_url_cleaner:__init__.py:177 Returned None from pypidb._scm_url_cleaner.SCMURLCleaner.get_root
DEBUG    pypidb._scm_url_cleaner:__init__.py:157 Calling pypidb._scm_url_cleaner.SCMURLCleaner.get_root(self = <pypidb._scm_url_cleaner.SCMURLCleaner object at 0x7f64161250d0>, url = 'https://img.shields.io/badge/cdk--constructs-experimental-important.svg?style=for-the-badge')
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:70 url https://img.shields.io/badge/cdk--constructs-experimental-important.svg?style=for-the-badge not path appveyor/ci/
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:70 url https://img.shields.io/badge/cdk--constructs-experimental-important.svg?style=for-the-badge not path codecov/c/github/
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:70 url https://img.shields.io/badge/cdk--constructs-experimental-important.svg?style=for-the-badge not path github/commit-activity/y/
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:70 url https://img.shields.io/badge/cdk--constructs-experimental-important.svg?style=for-the-badge not path github/license/
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:70 url https://img.shields.io/badge/cdk--constructs-experimental-important.svg?style=for-the-badge not path lgtm/alerts/g/
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:70 url https://img.shields.io/badge/cdk--constructs-experimental-important.svg?style=for-the-badge not path lgtm/grade/python/g/
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:70 url https://img.shields.io/badge/cdk--constructs-experimental-important.svg?style=for-the-badge not path travis/
DEBUG    pypidb._scm_url_cleaner:__init__.py:177 Returned None from pypidb._scm_url_cleaner.SCMURLCleaner.get_root
DEBUG    pypidb._pypi:_pypi.py:359 --none of this set: {'https://semver.org/', 'https://img.shields.io/badge/cdk--constructs-experimental-important.svg?style=for-the-badge'}
DEBUG    pypidb._pypi:_pypi.py:361 queuing ['https://img.shields.io/badge/cdk--constructs-experimental-important.svg?style=for-the-badge', 'https://semver.org/']
INFO     pypidb._pypi:_pypi.py:365 aws-cdk.assets: from description added urls []
DEBUG    pypidb._pypi:_pypi.py:317 processing Url: https://semver.org/ @ Unknown
DEBUG    pypidb._pypi:_pypi.py:427 reject rule https://semver.org/: False
INFO     pypidb._pypi:_pypi.py:451 r https://semver.org/
DEBUG    pypidb._adapters:_adapters.py:84 cdn block of https://semver.org/ skipped
DEBUG    pypidb._adapters:_adapters.py:108 domain block of https://semver.org/ skipped
DEBUG    pypidb._adapters:_adapters.py:32 is_num = False; is_IP = False: https://semver.org/
DEBUG    pypidb._adapters:_adapters.py:42 is_IP = False: https://semver.org/
DEBUG    pypidb._adapters:_adapters.py:46 IPblock of https://semver.org/ skipped
DEBUG    https_everywhere.adapter:adapter.py:86 No implementation for get_redirect('https://semver.org/')
DEBUG    https_everywhere.adapter:adapter.py:117 no redirection of https://semver.org/ occurred
DEBUG    cachecontrol.controller:controller.py:126 Looking up "https://semver.org/" in the cache
DEBUG    cachecontrol.controller:controller.py:179 Current age based on date: 153129
DEBUG    cachecontrol.controller:controller.py:201 Freshness lifetime from expires: 431944
DEBUG    cachecontrol.controller:controller.py:219 The response is "fresh", returning cached response
DEBUG    cachecontrol.controller:controller.py:220 431944 > 153129
DEBUG    pypidb._pypi:_pypi.py:455 r Response.url https://semver.org/ elapsed 0:00:00.016949
DEBUG    pypidb._pypi:_pypi.py:460 r https://semver.org/ headers: {'Connection': 'keep-alive', 'Content-Length': '8519', 'Content-Type': 'text/html; charset=utf-8', 'Server': 'GitHub.com', 'Last-Modified': 'Wed, 07 Aug 2019 21:38:07 GMT', 'ETag': 'W/"5d4b44bf-69ad"', 'Access-Control-Allow-Origin': '*', 'expires': 'Fri, 24 Apr 2020 02:53:57 -0000', 'cache-control': 'public', 'Content-Encoding': 'gzip', 'X-Proxy-Cache': 'MISS', 'X-GitHub-Request-Id': '5904:3246:69A02:75500:5E728C71', 'Accept-Ranges': 'bytes', 'Date': 'Sun, 19 Apr 2020 02:54:53 GMT', 'Via': '1.1 varnish', 'Age': '437', 'X-Served-By': 'cache-sin18040-SIN', 'X-Cache': 'HIT', 'X-Cache-Hits': '1', 'X-Timer': 'S1587264894.512797,VS0,VE1', 'Vary': '', 'X-Fastly-Request-ID': 'bf1f0bd36257b1a4566dc4b5eddf874b071121e2', 'Warning': '110 - Automatically cached for 5 days, 0:00:00. Response might be stale'}
DEBUG    pypidb._pypi:_pypi.py:317 processing Url: https://img.shields.io/badge/cdk--constructs-experimental-important.svg?style=for-the-badge @ Unknown
DEBUG    pypidb._pypi:_pypi.py:427 reject rule https://img.shields.io/badge/cdk--constructs-experimental-important.svg?style=for-the-badge: False
INFO     pypidb._pypi:_pypi.py:451 r https://img.shields.io/badge/cdk--constructs-experimental-important.svg?style=for-the-badge
DEBUG    pypidb._pypi:_pypi.py:455 r Response.url https://img.shields.io/badge/cdk--constructs-experimental-important.svg?style=for-the-badge elapsed 0:00:00.000133
DEBUG    pypidb._pypi:_pypi.py:460 r https://img.shields.io/badge/cdk--constructs-experimental-important.svg?style=for-the-badge headers: {}
WARNING  pypidb._pypi:_pypi.py:463 https://img.shields.io/badge/cdk--constructs-experimental-important.svg?style=for-the-badge: 500 Server Error: HTTPS Everywhere for url: https://img.shields.io/badge/cdk--constructs-experimental-important.svg?style=for-the-badge
DEBUG    pypidb._pypi:_pypi.py:317 processing Webpage: https://semver.org/
DEBUG    pypidb._pypi:_pypi.py:383 @@ ran <function _url_extractor_wrapper at 0x7f641df40ca0> on text size 26846 for 8 urls !!
DEBUG    pypidb._pypi:_pypi.py:388 extracted ['http://creativecommons.org/licenses/by/3.0/', 'http://schema.org', 'http://tom.preston-werner.com', 'http://tools.ietf.org/html/rfc2119', 'https://github.com/semver/semver/issues', 'https://regex101.com/r/Ly7O1x/3/', 'https://regex101.com/r/vkijKf/1/', 'https://semver.org/']
DEBUG    pypidb._pypi:_pypi.py:317 processing UrlSet: https://semver.org/
DEBUG    pypidb._scm_url_cleaner:__init__.py:157 Calling pypidb._scm_url_cleaner.SCMURLCleaner.get_root(self = <pypidb._scm_url_cleaner.SCMURLCleaner object at 0x7f64161250d0>, url = 'http://schema.org')
DEBUG    pypidb._scm_url_cleaner:__init__.py:177 Returned None from pypidb._scm_url_cleaner.SCMURLCleaner.get_root
DEBUG    pypidb._scm_url_cleaner:__init__.py:157 Calling pypidb._scm_url_cleaner.SCMURLCleaner.get_root(self = <pypidb._scm_url_cleaner.SCMURLCleaner object at 0x7f64161250d0>, url = 'https://semver.org/')
DEBUG    pypidb._scm_url_cleaner:__init__.py:177 Returned None from pypidb._scm_url_cleaner.SCMURLCleaner.get_root
DEBUG    pypidb._scm_url_cleaner:__init__.py:157 Calling pypidb._scm_url_cleaner.SCMURLCleaner.get_root(self = <pypidb._scm_url_cleaner.SCMURLCleaner object at 0x7f64161250d0>, url = 'https://regex101.com/r/Ly7O1x/3/')
DEBUG    pypidb._scm_url_cleaner:__init__.py:177 Returned None from pypidb._scm_url_cleaner.SCMURLCleaner.get_root
DEBUG    pypidb._scm_url_cleaner:__init__.py:157 Calling pypidb._scm_url_cleaner.SCMURLCleaner.get_root(self = <pypidb._scm_url_cleaner.SCMURLCleaner object at 0x7f64161250d0>, url = 'http://tools.ietf.org/html/rfc2119')
DEBUG    pypidb._scm_url_cleaner:__init__.py:177 Returned None from pypidb._scm_url_cleaner.SCMURLCleaner.get_root
DEBUG    pypidb._scm_url_cleaner:__init__.py:157 Calling pypidb._scm_url_cleaner.SCMURLCleaner.get_root(self = <pypidb._scm_url_cleaner.SCMURLCleaner object at 0x7f64161250d0>, url = 'https://github.com/semver/semver/issues')
DEBUG    pypidb._scm_url_cleaner:__init__.py:157 Calling pypidb._scm_url_cleaner._github(url = 'https://github.com/semver/semver/issues')
DEBUG    pypidb._scm_url_cleaner:__init__.py:177 Returned ('semver', 'semver') from pypidb._scm_url_cleaner._github
DEBUG    pypidb._scm_url_cleaner:__init__.py:177 Returned 'https://github.com/semver/semver' from pypidb._scm_url_cleaner.SCMURLCleaner.get_root
DEBUG    pypidb._scm_url_cleaner:__init__.py:157 Calling pypidb._scm_url_cleaner.SCMURLCleaner.get_root(self = <pypidb._scm_url_cleaner.SCMURLCleaner object at 0x7f64161250d0>, url = 'https://regex101.com/r/vkijKf/1/')
DEBUG    pypidb._scm_url_cleaner:__init__.py:177 Returned None from pypidb._scm_url_cleaner.SCMURLCleaner.get_root
DEBUG    pypidb._scm_url_cleaner:__init__.py:157 Calling pypidb._scm_url_cleaner.SCMURLCleaner.get_root(self = <pypidb._scm_url_cleaner.SCMURLCleaner object at 0x7f64161250d0>, url = 'http://tom.preston-werner.com')
DEBUG    pypidb._scm_url_cleaner:__init__.py:177 Returned None from pypidb._scm_url_cleaner.SCMURLCleaner.get_root
DEBUG    pypidb._scm_url_cleaner:__init__.py:157 Calling pypidb._scm_url_cleaner.SCMURLCleaner.get_root(self = <pypidb._scm_url_cleaner.SCMURLCleaner object at 0x7f64161250d0>, url = 'http://creativecommons.org/licenses/by/3.0/')
DEBUG    pypidb._scm_url_cleaner:__init__.py:177 Returned None from pypidb._scm_url_cleaner.SCMURLCleaner.get_root
DEBUG    pypidb._pypi:_pypi.py:355 --results from {'http://schema.org', 'https://semver.org/', 'https://regex101.com/r/Ly7O1x/3/', 'http://tools.ietf.org/html/rfc2119', 'https://github.com/semver/semver/issues', 'https://regex101.com/r/vkijKf/1/', 'http://tom.preston-werner.com', 'http://creativecommons.org/licenses/by/3.0/'}: ['https://github.com/semver/semver']
DEBUG    pypidb._pypi:_pypi.py:361 queuing ['http://creativecommons.org/licenses/by/3.0/', 'http://schema.org', 'http://tom.preston-werner.com', 'http://tools.ietf.org/html/rfc2119', 'https://regex101.com/r/Ly7O1x/3/', 'https://regex101.com/r/vkijKf/1/', 'https://semver.org/']
INFO     pypidb._pypi:_pypi.py:365 aws-cdk.assets: from https://semver.org/ added urls ['https://github.com/semver/semver']
DEBUG    pypidb._similarity:__init__.py:157 Calling pypidb._similarity.get_best_match(name = 'aws-cdk-assets', urls = ['https://github.com/aws/aws-cdk', 'https://github.com/aws/aws-cdk', 'https://github.com/semver/semver', 'https://github.com/semver/semver'])
DEBUG    pypidb._similarity:__init__.py:157 Calling pypidb._similarity._get_weighted_best(name = 'aws-cdk-assets', urls = ['https://github.com/aws/aws-cdk', 'https://github.com/aws/aws-cdk', 'https://github.com/semver/semver', 'https://github.com/semver/semver'], cond = <function _slash_endswith at 0x7f641e407280>)
DEBUG    pypidb._similarity:__init__.py:177 Returned None from pypidb._similarity._get_weighted_best
DEBUG    pypidb._similarity:__init__.py:157 Calling pypidb._similarity._get_weighted_best(name = 'aws-cdk-assets', urls = ['https://github.com/aws/aws-cdk', 'https://github.com/aws/aws-cdk', 'https://github.com/semver/semver', 'https://github.com/semver/semver'], cond = <function _endswith at 0x7f641e407310>)
DEBUG    pypidb._similarity:__init__.py:177 Returned None from pypidb._similarity._get_weighted_best
DEBUG    pypidb._similarity:__init__.py:157 Calling pypidb._similarity._get_weighted_best(name = 'aws-cdk-assets', urls = ['https://github.com/aws/aws-cdk', 'https://github.com/aws/aws-cdk', 'https://github.com/semver/semver', 'https://github.com/semver/semver'], cond = <function _python_and_name_match at 0x7f641e4073a0>)
DEBUG    pypidb._similarity:__init__.py:177 Returned None from pypidb._similarity._get_weighted_best
DEBUG    pypidb._similarity:__init__.py:157 Calling pypidb._similarity._get_weighted_best(name = 'aws-cdk-assets', urls = ['https://github.com/aws/aws-cdk', 'https://github.com/aws/aws-cdk', 'https://github.com/semver/semver', 'https://github.com/semver/semver'], cond = <function _only_name_match at 0x7f641e407430>)
DEBUG    pypidb._similarity:__init__.py:177 Returned None from pypidb._similarity._get_weighted_best
DEBUG    pypidb._similarity:__init__.py:157 Calling pypidb._similarity._get_weighted_best(name = 'aws-cdk-assets', urls = ['https://github.com/aws/aws-cdk', 'https://github.com/aws/aws-cdk', 'https://github.com/semver/semver', 'https://github.com/semver/semver'], cond = <function get_best_match.<locals>.<lambda> at 0x7f6416c60b80>)
DEBUG    pypidb._similarity:_similarity.py:101 counter - Counter({'https://github.com/aws/aws-cdk': 2, 'https://github.com/semver/semver': 2})
DEBUG    pypidb._similarity:__init__.py:177 Returned ['https://github.com/aws/aws-cdk', 'https://github.com/semver/semver'] from pypidb._similarity._get_weighted_best
DEBUG    pypidb._similarity:_similarity.py:244 !!!!finding similar: ['https://github.com/aws/aws-cdk', 'https://github.com/aws/aws-cdk', 'https://github.com/semver/semver', 'https://github.com/semver/semver']
DEBUG    pypidb._similarity:__init__.py:157 Calling pypidb._similarity._get_most_similar(name = 'aws-cdk-assets', urls = {'https://github.com/semver/semver', 'https://github.com/aws/aws-cdk'})
DEBUG    pypidb._similarity:__init__.py:157 Calling pypidb._similarity._compute_similarity(name = 'aws-cdk-assets', url = 'https://github.com/semver/semver', algo = <bound method _Distance.dist of <abydos.distance._ratcliff_obershelp.RatcliffObershelp object at 0x7f642a746430>>, comp_op = <built-in function min>)
DEBUG    pypidb._similarity:_similarity.py:44 _similarity of semver semver to awscdkassets
DEBUG    pypidb._similarity:__init__.py:177 Returned 0.7142857142857143 from pypidb._similarity._compute_similarity
DEBUG    pypidb._similarity:__init__.py:157 Calling pypidb._similarity._compute_similarity(name = 'aws-cdk-assets', url = 'https://github.com/aws/aws-cdk', algo = <bound method _Distance.dist of <abydos.distance._ratcliff_obershelp.RatcliffObershelp object at 0x7f642a746430>>, comp_op = <built-in function min>)
DEBUG    pypidb._similarity:_similarity.py:44 _similarity of aws awscdk to awscdkassets
DEBUG    pypidb._similarity:__init__.py:177 Returned 0.23809523809523814 from pypidb._similarity._compute_similarity
DEBUG    pypidb._similarity:_similarity.py:118 computed similarity: {'https://github.com/semver/semver': 0.7142857142857143, 'https://github.com/aws/aws-cdk': 0.23809523809523814}
DEBUG    pypidb._similarity:_similarity.py:126 nearest: 0.7142857142857143 https://github.com/semver/semver
DEBUG    pypidb._similarity:_similarity.py:126 nearest: 0.23809523809523814 https://github.com/aws/aws-cdk
DEBUG    pypidb._similarity:_similarity.py:129 nearest_counter Counter({0.7142857142857143: 1, 0.23809523809523814: 1})
DEBUG    pypidb._similarity:__init__.py:177 Returned 'https://github.com/aws/aws-cdk' from pypidb._similarity._get_most_similar
DEBUG    pypidb._similarity:_similarity.py:255 common, similar: None - https://github.com/aws/aws-cdk
DEBUG    pypidb._similarity:_similarity.py:263 most similar w/ identical commonality: https://github.com/aws/aws-cdk
DEBUG    pypidb._similarity:__init__.py:177 Returned 'https://github.com/aws/aws-cdk' from pypidb._similarity.get_best_match
DEBUG    pypidb._similarity:__init__.py:157 Calling pypidb._similarity._compute_similarity(name = 'aws-cdk-assets', url = 'https://github.com/aws/aws-cdk', algo = <bound method _Distance.dist of <abydos.distance._ratcliff_obershelp.RatcliffObershelp object at 0x7f642a746430>>, comp_op = <built-in function min>)
DEBUG    pypidb._similarity:_similarity.py:44 _similarity of aws awscdk to awscdkassets
DEBUG    pypidb._similarity:__init__.py:177 Returned 0.23809523809523814 from pypidb._similarity._compute_similarity
DEBUG    pypidb._pypi:_pypi.py:317 processing Url: https://semver.org/ @ Unknown
DEBUG    pypidb._pypi:_pypi.py:405 queue loop skipping already fetched https://semver.org/
DEBUG    pypidb._pypi:_pypi.py:317 processing Url: http://schema.org @ Unknown
DEBUG    pypidb._pypi:_pypi.py:427 reject rule http://schema.org: False
INFO     pypidb._pypi:_pypi.py:451 r http://schema.org
DEBUG    pypidb._pypi:_pypi.py:455 r Response.url http://schema.org/ elapsed 0:00:00.000067
DEBUG    pypidb._pypi:_pypi.py:460 r http://schema.org/ headers: {}
WARNING  pypidb._pypi:_pypi.py:463 http://schema.org: 500 Server Error: HTTPS Everywhere for url: http://schema.org/
DEBUG    pypidb._pypi:_pypi.py:317 processing Url: https://regex101.com/r/Ly7O1x/3/ @ Unknown
DEBUG    pypidb._pypi:_pypi.py:427 reject rule https://regex101.com/r/Ly7O1x/3/: False
INFO     pypidb._pypi:_pypi.py:451 r https://regex101.com/r/Ly7O1x/3/
DEBUG    pypidb._adapters:_adapters.py:84 cdn block of https://regex101.com/r/Ly7O1x/3/ skipped
DEBUG    pypidb._adapters:_adapters.py:108 domain block of https://regex101.com/r/Ly7O1x/3/ skipped
DEBUG    pypidb._adapters:_adapters.py:32 is_num = False; is_IP = False: https://regex101.com/r/Ly7O1x/3/
DEBUG    pypidb._adapters:_adapters.py:42 is_IP = False: https://regex101.com/r/Ly7O1x/3/
DEBUG    pypidb._adapters:_adapters.py:46 IPblock of https://regex101.com/r/Ly7O1x/3/ skipped
DEBUG    https_everywhere.adapter:adapter.py:86 No implementation for get_redirect('https://regex101.com/r/Ly7O1x/3/')
DEBUG    https_everywhere.adapter:adapter.py:117 no redirection of https://regex101.com/r/Ly7O1x/3/ occurred
DEBUG    cachecontrol.controller:controller.py:126 Looking up "https://regex101.com/r/Ly7O1x/3/" in the cache
DEBUG    cachecontrol.controller:controller.py:179 Current age based on date: 153116
DEBUG    cachecontrol.controller:controller.py:201 Freshness lifetime from expires: 431944
DEBUG    cachecontrol.controller:controller.py:219 The response is "fresh", returning cached response
DEBUG    cachecontrol.controller:controller.py:220 431944 > 153116
DEBUG    pypidb._pypi:_pypi.py:455 r Response.url https://regex101.com/r/Ly7O1x/3/ elapsed 0:00:00.027027
DEBUG    pypidb._pypi:_pypi.py:460 r https://regex101.com/r/Ly7O1x/3/ headers: {'X-Powered-By': 'Express', 'Access-Control-Allow-Origin': '*', 'Content-Type': 'text/html; charset=utf-8', 'ETag': 'W/"3178-e2Ofmb4iU3FuJCFOpBq2OoDsFD4"', 'Vary': '', 'Content-Encoding': 'gzip', 'Date': 'Sun, 19 Apr 2020 02:55:06 GMT', 'Connection': 'keep-alive', 'expires': 'Fri, 24 Apr 2020 02:54:10 -0000', 'cache-control': 'public', 'Warning': '110 - Automatically cached for 5 days, 0:00:00. Response might be stale'}
DEBUG    pypidb._pypi:_pypi.py:317 processing Url: http://tools.ietf.org/html/rfc2119 @ Unknown
DEBUG    pypidb._pypi:_pypi.py:427 reject rule http://tools.ietf.org/html/rfc2119: False
INFO     pypidb._pypi:_pypi.py:451 r http://tools.ietf.org/html/rfc2119
DEBUG    pypidb._adapters:_adapters.py:84 cdn block of http://tools.ietf.org/html/rfc2119 skipped
DEBUG    pypidb._adapters:_adapters.py:108 domain block of http://tools.ietf.org/html/rfc2119 skipped
DEBUG    pypidb._adapters:_adapters.py:32 is_num = False; is_IP = False: http://tools.ietf.org/html/rfc2119
DEBUG    pypidb._adapters:_adapters.py:42 is_IP = False: http://tools.ietf.org/html/rfc2119
DEBUG    pypidb._adapters:_adapters.py:46 IPblock of http://tools.ietf.org/html/rfc2119 skipped
DEBUG    urllib3.connectionpool:connectionpool.py:226 Starting new HTTP connection (1): tools.ietf.org:80
DEBUG    urllib3.connectionpool:connectionpool.py:433 http://tools.ietf.org:80 "HEAD /html/rfc2119 HTTP/1.1" 301 0
DEBUG    pypidb._adapters:_adapters.py:271 head http://tools.ietf.org/html/rfc2119 http://tools.ietf.org/html/rfc2119 <Response [301]> {'Date': 'Mon, 20 Apr 2020 21:27:59 GMT', 'Server': 'Apache/2.2.22 (Debian)', 'Location': 'https://tools.ietf.org/html/rfc2119', 'Vary': 'Accept-Encoding', 'Content-Length': '0', 'Content-Type': 'text/html; charset=iso-8859-1', 'Connection': 'Keep-Alive', 'Content-Encoding': 'gzip'} b''
INFO     https_everywhere.adapter:adapter.py:94 adapter responding to http://tools.ietf.org/html/rfc2119 with http://tools.ietf.org/html/rfc2119: {'Date': 'Mon, 20 Apr 2020 21:27:59 GMT', 'Server': 'Apache/2.2.22 (Debian)', 'Location': 'https://tools.ietf.org/html/rfc2119', 'Vary': 'Accept-Encoding', 'Content-Length': '0', 'Content-Type': 'text/html; charset=iso-8859-1', 'Connection': 'Keep-Alive', 'Content-Encoding': 'gzip'}
DEBUG    pypidb._adapters:_adapters.py:84 cdn block of https://tools.ietf.org/html/rfc2119 skipped
DEBUG    pypidb._adapters:_adapters.py:108 domain block of https://tools.ietf.org/html/rfc2119 skipped
DEBUG    pypidb._adapters:_adapters.py:32 is_num = False; is_IP = False: https://tools.ietf.org/html/rfc2119
DEBUG    pypidb._adapters:_adapters.py:42 is_IP = False: https://tools.ietf.org/html/rfc2119
DEBUG    pypidb._adapters:_adapters.py:46 IPblock of https://tools.ietf.org/html/rfc2119 skipped
DEBUG    https_everywhere.adapter:adapter.py:86 No implementation for get_redirect('https://tools.ietf.org/html/rfc2119')
DEBUG    https_everywhere.adapter:adapter.py:117 no redirection of https://tools.ietf.org/html/rfc2119 occurred
DEBUG    cachecontrol.controller:controller.py:126 Looking up "https://tools.ietf.org/html/rfc2119" in the cache
DEBUG    cachecontrol.controller:controller.py:179 Current age based on date: 153122
DEBUG    cachecontrol.controller:controller.py:201 Freshness lifetime from expires: 431945
DEBUG    cachecontrol.controller:controller.py:219 The response is "fresh", returning cached response
DEBUG    cachecontrol.controller:controller.py:220 431945 > 153122
DEBUG    pypidb._pypi:_pypi.py:455 r Response.url https://tools.ietf.org/html/rfc2119 elapsed 0:00:00.049948
DEBUG    pypidb._pypi:_pypi.py:460 r https://tools.ietf.org/html/rfc2119 headers: {'Date': 'Sun, 19 Apr 2020 02:55:01 GMT', 'Server': 'Apache/2.2.22 (Debian)', 'Content-Location': 'rfc2119.html', 'Vary': '', 'TCN': 'choice', 'Last-Modified': 'Mon, 13 Apr 2020 14:05:07 GMT', 'ETag': '"3caa6a-3b43-5a32c92014ac0;5a39b890dffe1"', 'Accept-Ranges': 'bytes', 'cache-control': 'public', 'expires': 'Fri, 24 Apr 2020 02:54:06 -0000', 'Content-Encoding': 'gzip', 'Strict-Transport-Security': 'max-age=3600', 'X-Frame-Options': 'SAMEORIGIN', 'X-Xss-Protection': '1; mode=block', 'X-Content-Type-Options': 'nosniff', 'X-Clacks-Overhead': 'GNU Terry Pratchett', 'Content-Length': '4585', 'Keep-Alive': 'timeout=5, max=100', 'Connection': 'Keep-Alive', 'Content-Type': 'text/html; charset=UTF-8', 'Warning': '110 - Automatically cached for 5 days, 0:00:00. Response might be stale'}
DEBUG    pypidb._pypi:_pypi.py:317 processing Url: https://regex101.com/r/vkijKf/1/ @ Unknown
DEBUG    pypidb._pypi:_pypi.py:427 reject rule https://regex101.com/r/vkijKf/1/: False
INFO     pypidb._pypi:_pypi.py:451 r https://regex101.com/r/vkijKf/1/
DEBUG    pypidb._adapters:_adapters.py:84 cdn block of https://regex101.com/r/vkijKf/1/ skipped
DEBUG    pypidb._adapters:_adapters.py:108 domain block of https://regex101.com/r/vkijKf/1/ skipped
DEBUG    pypidb._adapters:_adapters.py:32 is_num = False; is_IP = False: https://regex101.com/r/vkijKf/1/
DEBUG    pypidb._adapters:_adapters.py:42 is_IP = False: https://regex101.com/r/vkijKf/1/
DEBUG    pypidb._adapters:_adapters.py:46 IPblock of https://regex101.com/r/vkijKf/1/ skipped
DEBUG    https_everywhere.adapter:adapter.py:86 No implementation for get_redirect('https://regex101.com/r/vkijKf/1/')
DEBUG    https_everywhere.adapter:adapter.py:117 no redirection of https://regex101.com/r/vkijKf/1/ occurred
DEBUG    cachecontrol.controller:controller.py:126 Looking up "https://regex101.com/r/vkijKf/1/" in the cache
DEBUG    cachecontrol.controller:controller.py:179 Current age based on date: 153120
DEBUG    cachecontrol.controller:controller.py:201 Freshness lifetime from expires: 431944
DEBUG    cachecontrol.controller:controller.py:219 The response is "fresh", returning cached response
DEBUG    cachecontrol.controller:controller.py:220 431944 > 153120
DEBUG    pypidb._pypi:_pypi.py:455 r Response.url https://regex101.com/r/vkijKf/1/ elapsed 0:00:00.003494
DEBUG    pypidb._pypi:_pypi.py:460 r https://regex101.com/r/vkijKf/1/ headers: {'X-Powered-By': 'Express', 'Access-Control-Allow-Origin': '*', 'Content-Type': 'text/html; charset=utf-8', 'ETag': 'W/"312c-uZAUga0sfinhMY6ltW+6vUfGhwI"', 'Vary': '', 'Content-Encoding': 'gzip', 'Date': 'Sun, 19 Apr 2020 02:55:03 GMT', 'Connection': 'keep-alive', 'expires': 'Fri, 24 Apr 2020 02:54:07 -0000', 'cache-control': 'public', 'Warning': '110 - Automatically cached for 5 days, 0:00:00. Response might be stale'}
DEBUG    pypidb._pypi:_pypi.py:317 processing Url: http://tom.preston-werner.com @ Unknown
DEBUG    pypidb._pypi:_pypi.py:427 reject rule http://tom.preston-werner.com: False
INFO     pypidb._pypi:_pypi.py:451 r http://tom.preston-werner.com
DEBUG    pypidb._adapters:_adapters.py:84 cdn block of http://tom.preston-werner.com/ skipped
DEBUG    pypidb._adapters:_adapters.py:108 domain block of http://tom.preston-werner.com/ skipped
DEBUG    pypidb._adapters:_adapters.py:32 is_num = False; is_IP = False: http://tom.preston-werner.com/
DEBUG    pypidb._adapters:_adapters.py:42 is_IP = False: http://tom.preston-werner.com/
DEBUG    pypidb._adapters:_adapters.py:46 IPblock of http://tom.preston-werner.com/ skipped
DEBUG    urllib3.connectionpool:connectionpool.py:226 Starting new HTTP connection (1): tom.preston-werner.com:80
DEBUG    urllib3.connectionpool:connectionpool.py:433 http://tom.preston-werner.com:80 "HEAD / HTTP/1.1" 301 0
DEBUG    pypidb._adapters:_adapters.py:271 head http://tom.preston-werner.com/ http://tom.preston-werner.com/ <Response [301]> {'Cache-Control': 'public, max-age=0, must-revalidate', 'Content-Length': '47', 'Content-Type': 'text/plain', 'Date': 'Fri, 17 Apr 2020 22:36:53 GMT', 'Location': 'https://tom.preston-werner.com/', 'Age': '255066', 'Server': 'Netlify', 'X-NF-Request-ID': '0123aed4-76fc-4924-893b-f6bd27db24db-17293744', 'Connection': 'Keep-Alive'} b''
INFO     https_everywhere.adapter:adapter.py:94 adapter responding to http://tom.preston-werner.com/ with http://tom.preston-werner.com/: {'Cache-Control': 'public, max-age=0, must-revalidate', 'Content-Length': '47', 'Content-Type': 'text/plain', 'Date': 'Fri, 17 Apr 2020 22:36:53 GMT', 'Location': 'https://tom.preston-werner.com/', 'Age': '255066', 'Server': 'Netlify', 'X-NF-Request-ID': '0123aed4-76fc-4924-893b-f6bd27db24db-17293744', 'Connection': 'Keep-Alive'}
DEBUG    pypidb._adapters:_adapters.py:84 cdn block of https://tom.preston-werner.com/ skipped
DEBUG    pypidb._adapters:_adapters.py:108 domain block of https://tom.preston-werner.com/ skipped
DEBUG    pypidb._adapters:_adapters.py:32 is_num = False; is_IP = False: https://tom.preston-werner.com/
DEBUG    pypidb._adapters:_adapters.py:42 is_IP = False: https://tom.preston-werner.com/
DEBUG    pypidb._adapters:_adapters.py:46 IPblock of https://tom.preston-werner.com/ skipped
DEBUG    https_everywhere.adapter:adapter.py:86 No implementation for get_redirect('https://tom.preston-werner.com/')
DEBUG    https_everywhere.adapter:adapter.py:117 no redirection of https://tom.preston-werner.com/ occurred
DEBUG    cachecontrol.controller:controller.py:126 Looking up "https://tom.preston-werner.com/" in the cache
DEBUG    cachecontrol.controller:controller.py:179 Current age based on date: 416494
DEBUG    cachecontrol.controller:controller.py:201 Freshness lifetime from expires: 695323
DEBUG    cachecontrol.controller:controller.py:219 The response is "fresh", returning cached response
DEBUG    cachecontrol.controller:controller.py:220 695323 > 416494
DEBUG    pypidb._pypi:_pypi.py:455 r Response.url https://tom.preston-werner.com/ elapsed 0:00:00.043068
DEBUG    pypidb._pypi:_pypi.py:460 r https://tom.preston-werner.com/ headers: {'cache-control': 'public', 'Content-Type': 'text/html; charset=UTF-8', 'Date': 'Thu, 16 Apr 2020 01:45:29 GMT', 'Etag': '"0d03a5bde96c039d5e94c814f8237fd6-ssl"', 'Strict-Transport-Security': 'max-age=31536000', 'Age': '263379', 'Content-Length': '14212', 'Connection': 'keep-alive', 'Server': 'Netlify', 'X-NF-Request-ID': 'd26f0029-921e-4771-84fb-d1401657063b-26033572', 'expires': 'Fri, 24 Apr 2020 02:54:12 -0000', 'Warning': '110 - Automatically cached for 5 days, 0:00:00. Response might be stale'}
DEBUG    pypidb._pypi:_pypi.py:317 processing Url: http://creativecommons.org/licenses/by/3.0/ @ Unknown
DEBUG    pypidb._pypi:_pypi.py:427 reject rule http://creativecommons.org/licenses/by/3.0/: False
INFO     pypidb._pypi:_pypi.py:451 r http://creativecommons.org/licenses/by/3.0/
DEBUG    pypidb._adapters:_adapters.py:84 cdn block of http://creativecommons.org/licenses/by/3.0/ skipped
DEBUG    pypidb._adapters:_adapters.py:108 domain block of http://creativecommons.org/licenses/by/3.0/ skipped
DEBUG    pypidb._adapters:_adapters.py:32 is_num = False; is_IP = False: http://creativecommons.org/licenses/by/3.0/
DEBUG    pypidb._adapters:_adapters.py:42 is_IP = False: http://creativecommons.org/licenses/by/3.0/
DEBUG    pypidb._adapters:_adapters.py:46 IPblock of http://creativecommons.org/licenses/by/3.0/ skipped
DEBUG    urllib3.connectionpool:connectionpool.py:226 Starting new HTTP connection (1): creativecommons.org:80
DEBUG    urllib3.connectionpool:connectionpool.py:433 http://creativecommons.org:80 "HEAD /licenses/by/3.0/ HTTP/1.1" 301 0
DEBUG    pypidb._adapters:_adapters.py:271 head http://creativecommons.org/licenses/by/3.0/ http://creativecommons.org/licenses/by/3.0/ <Response [301]> {'Date': 'Mon, 20 Apr 2020 21:28:00 GMT', 'Content-Type': 'text/html', 'Location': 'https://creativecommons.org/licenses/by/3.0/', 'Strict-Transport-Security': 'max-age=15768000', 'X-Content-Type-Options': 'nosniff', 'X-Frame-Options': 'deny', 'X-XSS-Protection': '1; mode=block', 'Cache-Control': 'max-age=1800', 'CF-Cache-Status': 'EXPIRED', 'Vary': 'Accept-Encoding', 'Server': 'cloudflare', 'CF-RAY': '5871f3da0f56991d-LAX', 'cf-request-id': '023b16bc470000991d1f9d5200000001', 'Connection': 'Keep-Alive', 'Set-Cookie': '__cfduid=d6a72e9dc421ecf8e6730c5b4fc29f9f61587418080; expires=Wed, 20-May-20 21:28:00 GMT; path=/; domain=.creativecommons.org; HttpOnly; SameSite=Lax', 'Content-Length': '0'} b''
INFO     https_everywhere.adapter:adapter.py:94 adapter responding to http://creativecommons.org/licenses/by/3.0/ with http://creativecommons.org/licenses/by/3.0/: {'Date': 'Mon, 20 Apr 2020 21:28:00 GMT', 'Content-Type': 'text/html', 'Location': 'https://creativecommons.org/licenses/by/3.0/', 'Strict-Transport-Security': 'max-age=15768000', 'X-Content-Type-Options': 'nosniff', 'X-Frame-Options': 'deny', 'X-XSS-Protection': '1; mode=block', 'Cache-Control': 'max-age=1800', 'CF-Cache-Status': 'EXPIRED', 'Vary': 'Accept-Encoding', 'Server': 'cloudflare', 'CF-RAY': '5871f3da0f56991d-LAX', 'cf-request-id': '023b16bc470000991d1f9d5200000001', 'Connection': 'Keep-Alive', 'Set-Cookie': '__cfduid=d6a72e9dc421ecf8e6730c5b4fc29f9f61587418080; expires=Wed, 20-May-20 21:28:00 GMT; path=/; domain=.creativecommons.org; HttpOnly; SameSite=Lax', 'Content-Length': '0'}
DEBUG    pypidb._pypi:_pypi.py:455 r Response.url https://creativecommons.org/licenses/by/3.0/ elapsed 0:00:00.000059
DEBUG    pypidb._pypi:_pypi.py:460 r https://creativecommons.org/licenses/by/3.0/ headers: {}
WARNING  pypidb._pypi:_pypi.py:463 http://creativecommons.org/licenses/by/3.0/: 500 Server Error: HTTPS Everywhere for url: https://creativecommons.org/licenses/by/3.0/
DEBUG    pypidb._pypi:_pypi.py:317 processing Webpage: https://regex101.com/r/Ly7O1x/3/
DEBUG    pypidb._pypi:_pypi.py:383 @@ ran <function _url_extractor_wrapper at 0x7f641df40ca0> on text size 12664 for 7 urls !!
DEBUG    pypidb._pypi:_pypi.py:388 extracted ['http://enable-javascript.com/', 'https://github.com/firasdib/Regex101/issues', 'https://github.com/firasdib/Regex101/wiki', 'https://twitter.com/regex101', 'https://www.paypal.com/cgi-bin/webscr?cmd=_donations&amp', 'regex101.com', 'regex101.com/preview.png']
DEBUG    pypidb._pypi:_pypi.py:317 processing UrlSet: {'https://tools.ietf.org/html/rfc2119'}
DEBUG    pypidb._scm_url_cleaner:__init__.py:157 Calling pypidb._scm_url_cleaner.SCMURLCleaner.get_root(self = <pypidb._scm_url_cleaner.SCMURLCleaner object at 0x7f64161250d0>, url = 'https://tools.ietf.org/html/rfc2119')
DEBUG    pypidb._scm_url_cleaner:__init__.py:177 Returned None from pypidb._scm_url_cleaner.SCMURLCleaner.get_root
DEBUG    pypidb._pypi:_pypi.py:359 --none of this set: {'https://tools.ietf.org/html/rfc2119'}
DEBUG    pypidb._pypi:_pypi.py:361 queuing ['https://tools.ietf.org/html/rfc2119']
INFO     pypidb._pypi:_pypi.py:365 aws-cdk.assets: from None added urls []
DEBUG    pypidb._pypi:_pypi.py:317 processing Webpage: http://tools.ietf.org/html/rfc2119
DEBUG    pypidb._pypi:_pypi.py:383 @@ ran <function _url_extractor_wrapper at 0x7f641df40ca0> on text size 15171 for 8 urls !!
DEBUG    pypidb._pypi:_pypi.py:388 extracted ['http://dublincore.org/documents/2008/08/04/dc-html/', 'http://purl.org/dc/elements/1.1/', 'http://www.w3.org/1999/xhtml', 'http://www.w3.org/TR/xhtml1/DTD/xhtml1-transitional.dtd', 'https://datatracker.ietf.org/doc/rfc2119', 'https://tools.ietf.org/tools/rfcmarkup/', 'https://www.rfc-editor.org/errata_search.php?rfc=2119', 'tools.ietf.org']
DEBUG    pypidb._pypi:_pypi.py:317 processing Webpage: https://regex101.com/r/vkijKf/1/
DEBUG    pypidb._pypi:_pypi.py:383 @@ ran <function _url_extractor_wrapper at 0x7f641df40ca0> on text size 12588 for 7 urls !!
DEBUG    pypidb._pypi:_pypi.py:388 extracted ['http://enable-javascript.com/', 'https://github.com/firasdib/Regex101/issues', 'https://github.com/firasdib/Regex101/wiki', 'https://twitter.com/regex101', 'https://www.paypal.com/cgi-bin/webscr?cmd=_donations&amp', 'regex101.com', 'regex101.com/preview.png']
DEBUG    pypidb._pypi:_pypi.py:317 processing UrlSet: {'https://tom.preston-werner.com/'}
DEBUG    pypidb._scm_url_cleaner:__init__.py:157 Calling pypidb._scm_url_cleaner.SCMURLCleaner.get_root(self = <pypidb._scm_url_cleaner.SCMURLCleaner object at 0x7f64161250d0>, url = 'https://tom.preston-werner.com/')
DEBUG    pypidb._scm_url_cleaner:__init__.py:177 Returned None from pypidb._scm_url_cleaner.SCMURLCleaner.get_root
DEBUG    pypidb._pypi:_pypi.py:359 --none of this set: {'https://tom.preston-werner.com/'}
DEBUG    pypidb._pypi:_pypi.py:361 queuing ['https://tom.preston-werner.com/']
INFO     pypidb._pypi:_pypi.py:365 aws-cdk.assets: from None added urls []
DEBUG    pypidb._pypi:_pypi.py:317 processing Webpage: http://tom.preston-werner.com
INFO     urlextract:urlextract_core.py:602 Unknown exception during gethostbyname(tomdoc.org) gaierror(-11)
DEBUG    pypidb._pypi:_pypi.py:383 @@ ran <function _url_extractor_wrapper at 0x7f641df40ca0> on text size 14206 for 67 urls !!
DEBUG    pypidb._pypi:_pypi.py:388 extracted ['@mail.ru', 'WebPulp.tv', 'github.com/mojombo', 'http://37signals.com/svn/posts/2486-bootstrapped-profitable-proud-github', 'http://bits.blogs.nytimes.com/2012/12/28/github-has-big-dreams-for-open-source-software-and-more/', 'http://blog.gravatar.com/2007/10/18/automattic-gravatar/', 'http://blogs.sitepoint.com/podcast-107-social-coding-with-githubs-tom-preston-werner/', 'http://confreaks.com/videos/712-rubyconf2011-github-flavored-ruby', 'http://confreaks.net/videos/297-lsrc2010-keynote-address', 'http://confreaks.net/videos/432-rubyconf2010-the-road-from-ruby-hacker-to-entrepreneur', 'http://developer.yahoo.com/yui/theater/video.php?v=prestonwerner-github', 'http://en.oreilly.com/rails2008/public/schedule/speaker/2520', 'http://gigaom.com/cloud/10-innovators-changing-the-game-for-internet-infrastructure/7/', 'http://images.businessweek.com/ss/09/04/0421_best_young_entrepreneurs/17.htm', 'http://lambdaphant.com/blog/interview-with-tom-preston-werner-from-github', 'http://ontwik.com/github/tom-werner-co-founder-of-github/', 'http://rubyconf2009.confreaks.com/19-nov-2009-10-25-bert-and-ernie-scaling-your-ruby-site-with-erlang-tom-preston-werner.html', 'http://sea.ucar.edu/event/unlocking-secrets-git', 'http://skillsmatter.com/podcast/agile-testing/optimizing-for-happiness', 'http://speakerdeck.com/u/mojombo/p/github-flavored-ruby', 'http://speakerdeck.com/u/mojombo/p/optimizing-for-happiness', 'http://video.mail.ru/corp/video/conference2010/12.html', 'http://vimeo.com/17118008', 'http://vimeo.com/35640883', 'http://vimeo.com/39016099', 'http://web20show.com/episodes/web20show-ep45-github', 'http://webpulp.tv/post/708686185/github-tom-preston-werner', 'http://wp-community.org/2006/12/26/episode-15-interviews-with-tom-werner-gravatar-and-jaimie-sirovich-seo-egghead/', 'http://www.blognewcomb.com/blog/2007/08/powerset_interview_with_tom_pr.html', 'http://www.erlang-factory.com/conference/SFBayAreaErlangFactory2009/speakers/TomPrestonWerner', 'http://www.flickr.com/photos/seldaek/4986315165/', 'http://www.infoq.com/interviews/erlang-and-github', 'http://www.infoq.com/interviews/preston-werner-powerset-github-ruby', 'http://www.infoq.com/presentations/preston-werner-conceptual-algorithms', 'http://www.justin.tv/c3oorg/b/272031754', 'http://www.linux-mag.com/cache/7486/1.html', 'http://www.maestrosdelweb.com/editorial/tom-preston-werner-fundador-de-github/', 'http://www.mindmeister.com/fr/maps/show_public/60955251?title=tom-preston-schnitzelconf', 'http://www.rubyology.com/podcasts/show/66', 'http://www.rubyology.com/podcasts/show/67', 'http://www.viddler.com/explore/GreggPollack/videos/25/44', 'http://www.vimeo.com/1104583', 'http://www.w3.org/1999/xhtml', 'http://www.wired.com/wiredenterprise/2013/09/github-for-anything/all/', 'http://www.youtube.com/watch?v=7DoB0SCUtOk&list=SP055Epbe6d5aclKNAa8msO1VvDOJ8sYlS&index=23', 'http://www.youtube.com/watch?v=Hi2V1x1AkD8', 'http://www.youtube.com/watch?v=Ln-B_fs9QMY&feature=youtu.be', 'http://www.youtube.com/watch?v=P9jjDpWzsUI', 'http://www.youtube.com/watch?v=k2vJNNAQZlg', 'http://www.youtube.com/watch?v=weF-_dLYrzw', 'https://chatterbug.com/', 'https://feeds.feedburner.com/tom-preston-werner', 'https://github.com/', 'https://github.com/gollum/gollum/', 'https://github.com/mojombo/', 'https://github.com/mojombo/chronic/', 'https://github.com/mojombo/ernie/', 'https://github.com/mojombo/god/', 'https://github.com/mojombo/grit/', 'https://github.com/mojombo/proxymachine/', 'https://github.com/toml-lang/toml', 'https://jekyllrb.com', 'https://semver.org', 'https://twitter.com/mojombo/', 'https://web.archive.org/web/20061214040137/http://joshuaink2006.johnoxton.co.uk/blog/240/seven-and-a-half-questions-for-tom-werner', 'https://www.wired.com/2013/09/github-for-anything/', 'twitter.com/mojombo']
DEBUG    pypidb._pypi:_pypi.py:317 processing UrlSet: https://regex101.com/r/Ly7O1x/3/
DEBUG    pypidb._scm_url_cleaner:__init__.py:157 Calling pypidb._scm_url_cleaner.SCMURLCleaner.get_root(self = <pypidb._scm_url_cleaner.SCMURLCleaner object at 0x7f64161250d0>, url = 'https://twitter.com/regex101')
DEBUG    pypidb._scm_url_cleaner:__init__.py:177 Returned None from pypidb._scm_url_cleaner.SCMURLCleaner.get_root
DEBUG    pypidb._scm_url_cleaner:__init__.py:157 Calling pypidb._scm_url_cleaner.SCMURLCleaner.get_root(self = <pypidb._scm_url_cleaner.SCMURLCleaner object at 0x7f64161250d0>, url = 'https://www.paypal.com/cgi-bin/webscr?cmd=_donations&amp')
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url https://www.paypal.com/cgi-bin/webscr?cmd=_donations&amp subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url https://www.paypal.com/cgi-bin/webscr?cmd=_donations&amp subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url https://www.paypal.com/cgi-bin/webscr?cmd=_donations&amp subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url https://www.paypal.com/cgi-bin/webscr?cmd=_donations&amp subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url https://www.paypal.com/cgi-bin/webscr?cmd=_donations&amp subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url https://www.paypal.com/cgi-bin/webscr?cmd=_donations&amp subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url https://www.paypal.com/cgi-bin/webscr?cmd=_donations&amp subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url https://www.paypal.com/cgi-bin/webscr?cmd=_donations&amp subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url https://www.paypal.com/cgi-bin/webscr?cmd=_donations&amp subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url https://www.paypal.com/cgi-bin/webscr?cmd=_donations&amp subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url https://www.paypal.com/cgi-bin/webscr?cmd=_donations&amp subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url https://www.paypal.com/cgi-bin/webscr?cmd=_donations&amp subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url https://www.paypal.com/cgi-bin/webscr?cmd=_donations&amp subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url https://www.paypal.com/cgi-bin/webscr?cmd=_donations&amp subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url https://www.paypal.com/cgi-bin/webscr?cmd=_donations&amp subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url https://www.paypal.com/cgi-bin/webscr?cmd=_donations&amp subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url https://www.paypal.com/cgi-bin/webscr?cmd=_donations&amp subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url https://www.paypal.com/cgi-bin/webscr?cmd=_donations&amp subdomain www
DEBUG    pypidb._scm_url_cleaner:__init__.py:177 Returned None from pypidb._scm_url_cleaner.SCMURLCleaner.get_root
DEBUG    pypidb._scm_url_cleaner:__init__.py:157 Calling pypidb._scm_url_cleaner.SCMURLCleaner.get_root(self = <pypidb._scm_url_cleaner.SCMURLCleaner object at 0x7f64161250d0>, url = 'https://github.com/firasdib/Regex101/issues')
DEBUG    pypidb._scm_url_cleaner:__init__.py:157 Calling pypidb._scm_url_cleaner._github(url = 'https://github.com/firasdib/Regex101/issues')
DEBUG    pypidb._scm_url_cleaner:__init__.py:177 Returned ('firasdib', 'Regex101') from pypidb._scm_url_cleaner._github
DEBUG    pypidb._scm_url_cleaner:__init__.py:177 Returned 'https://github.com/firasdib/Regex101' from pypidb._scm_url_cleaner.SCMURLCleaner.get_root
DEBUG    pypidb._scm_url_cleaner:__init__.py:157 Calling pypidb._scm_url_cleaner.SCMURLCleaner.get_root(self = <pypidb._scm_url_cleaner.SCMURLCleaner object at 0x7f64161250d0>, url = 'regex101.com/preview.png')
DEBUG    pypidb._scm_url_cleaner:__init__.py:177 Returned None from pypidb._scm_url_cleaner.SCMURLCleaner.get_root
DEBUG    pypidb._scm_url_cleaner:__init__.py:157 Calling pypidb._scm_url_cleaner.SCMURLCleaner.get_root(self = <pypidb._scm_url_cleaner.SCMURLCleaner object at 0x7f64161250d0>, url = 'regex101.com')
DEBUG    pypidb._scm_url_cleaner:__init__.py:177 Returned None from pypidb._scm_url_cleaner.SCMURLCleaner.get_root
DEBUG    pypidb._scm_url_cleaner:__init__.py:157 Calling pypidb._scm_url_cleaner.SCMURLCleaner.get_root(self = <pypidb._scm_url_cleaner.SCMURLCleaner object at 0x7f64161250d0>, url = 'https://github.com/firasdib/Regex101/wiki')
DEBUG    pypidb._scm_url_cleaner:__init__.py:157 Calling pypidb._scm_url_cleaner._github(url = 'https://github.com/firasdib/Regex101/wiki')
DEBUG    pypidb._scm_url_cleaner:__init__.py:177 Returned ('firasdib', 'Regex101') from pypidb._scm_url_cleaner._github
DEBUG    pypidb._scm_url_cleaner:__init__.py:177 Returned 'https://github.com/firasdib/Regex101' from pypidb._scm_url_cleaner.SCMURLCleaner.get_root
DEBUG    pypidb._scm_url_cleaner:__init__.py:157 Calling pypidb._scm_url_cleaner.SCMURLCleaner.get_root(self = <pypidb._scm_url_cleaner.SCMURLCleaner object at 0x7f64161250d0>, url = 'http://enable-javascript.com/')
DEBUG    pypidb._scm_url_cleaner:__init__.py:177 Returned None from pypidb._scm_url_cleaner.SCMURLCleaner.get_root
DEBUG    pypidb._pypi:_pypi.py:355 --results from {'https://twitter.com/regex101', 'https://www.paypal.com/cgi-bin/webscr?cmd=_donations&amp', 'https://github.com/firasdib/Regex101/issues', 'regex101.com/preview.png', 'regex101.com', 'https://github.com/firasdib/Regex101/wiki', 'http://enable-javascript.com/'}: ['https://github.com/firasdib/Regex101', 'https://github.com/firasdib/Regex101']
DEBUG    pypidb._pypi:_pypi.py:361 queuing ['http://enable-javascript.com/', 'https://twitter.com/regex101', 'https://www.paypal.com/cgi-bin/webscr?cmd=_donations&amp', 'regex101.com', 'regex101.com/preview.png']
INFO     pypidb._pypi:_pypi.py:365 aws-cdk.assets: from https://regex101.com/r/Ly7O1x/3/ added urls ['https://github.com/firasdib/Regex101', 'https://github.com/firasdib/Regex101']
DEBUG    pypidb._similarity:__init__.py:157 Calling pypidb._similarity.get_best_match(name = 'aws-cdk-assets', urls = ['https://github.com/aws/aws-cdk', 'https://github.com/aws/aws-cdk', 'https://github.com/semver/semver', 'https://github.com/semver/semver', 'https://github.com/firasdib/Regex101', 'https://github.com/firasdib/Regex101'])
DEBUG    pypidb._similarity:__init__.py:157 Calling pypidb._similarity._get_weighted_best(name = 'aws-cdk-assets', urls = ['https://github.com/aws/aws-cdk', 'https://github.com/aws/aws-cdk', 'https://github.com/semver/semver', 'https://github.com/semver/semver', 'https://github.com/firasdib/Regex101', 'https://github.com/firasdib/Regex101'], cond = <function _slash_endswith at 0x7f641e407280>)
DEBUG    pypidb._similarity:__init__.py:177 Returned None from pypidb._similarity._get_weighted_best
DEBUG    pypidb._similarity:__init__.py:157 Calling pypidb._similarity._get_weighted_best(name = 'aws-cdk-assets', urls = ['https://github.com/aws/aws-cdk', 'https://github.com/aws/aws-cdk', 'https://github.com/semver/semver', 'https://github.com/semver/semver', 'https://github.com/firasdib/Regex101', 'https://github.com/firasdib/Regex101'], cond = <function _endswith at 0x7f641e407310>)
DEBUG    pypidb._similarity:__init__.py:177 Returned None from pypidb._similarity._get_weighted_best
DEBUG    pypidb._similarity:__init__.py:157 Calling pypidb._similarity._get_weighted_best(name = 'aws-cdk-assets', urls = ['https://github.com/aws/aws-cdk', 'https://github.com/aws/aws-cdk', 'https://github.com/semver/semver', 'https://github.com/semver/semver', 'https://github.com/firasdib/Regex101', 'https://github.com/firasdib/Regex101'], cond = <function _python_and_name_match at 0x7f641e4073a0>)
DEBUG    pypidb._similarity:__init__.py:177 Returned None from pypidb._similarity._get_weighted_best
DEBUG    pypidb._similarity:__init__.py:157 Calling pypidb._similarity._get_weighted_best(name = 'aws-cdk-assets', urls = ['https://github.com/aws/aws-cdk', 'https://github.com/aws/aws-cdk', 'https://github.com/semver/semver', 'https://github.com/semver/semver', 'https://github.com/firasdib/Regex101', 'https://github.com/firasdib/Regex101'], cond = <function _only_name_match at 0x7f641e407430>)
DEBUG    pypidb._similarity:__init__.py:177 Returned None from pypidb._similarity._get_weighted_best
DEBUG    pypidb._similarity:__init__.py:157 Calling pypidb._similarity._get_weighted_best(name = 'aws-cdk-assets', urls = ['https://github.com/aws/aws-cdk', 'https://github.com/aws/aws-cdk', 'https://github.com/semver/semver', 'https://github.com/semver/semver', 'https://github.com/firasdib/Regex101', 'https://github.com/firasdib/Regex101'], cond = <function get_best_match.<locals>.<lambda> at 0x7f64160bc700>)
DEBUG    pypidb._similarity:_similarity.py:101 counter - Counter({'https://github.com/aws/aws-cdk': 2, 'https://github.com/semver/semver': 2, 'https://github.com/firasdib/Regex101': 2})
DEBUG    pypidb._similarity:__init__.py:177 Returned ['https://github.com/aws/aws-cdk', 'https://github.com/semver/semver', 'https://github.com/firasdib/Regex101'] from pypidb._similarity._get_weighted_best
DEBUG    pypidb._similarity:_similarity.py:244 !!!!finding similar: ['https://github.com/aws/aws-cdk', 'https://github.com/aws/aws-cdk', 'https://github.com/semver/semver', 'https://github.com/semver/semver', 'https://github.com/firasdib/Regex101', 'https://github.com/firasdib/Regex101']
DEBUG    pypidb._similarity:__init__.py:157 Calling pypidb._similarity._get_most_similar(name = 'aws-cdk-assets', urls = {'https://github.com/semver/semver', 'https://github.com/firasdib/Regex101', 'https://github.com/aws/aws-cdk'})
DEBUG    pypidb._similarity:__init__.py:157 Calling pypidb._similarity._compute_similarity(name = 'aws-cdk-assets', url = 'https://github.com/semver/semver', algo = <bound method _Distance.dist of <abydos.distance._ratcliff_obershelp.RatcliffObershelp object at 0x7f642a746430>>, comp_op = <built-in function min>)
DEBUG    pypidb._similarity:_similarity.py:44 _similarity of semver semver to awscdkassets
DEBUG    pypidb._similarity:__init__.py:177 Returned 0.7142857142857143 from pypidb._similarity._compute_similarity
DEBUG    pypidb._similarity:__init__.py:157 Calling pypidb._similarity._compute_similarity(name = 'aws-cdk-assets', url = 'https://github.com/firasdib/Regex101', algo = <bound method _Distance.dist of <abydos.distance._ratcliff_obershelp.RatcliffObershelp object at 0x7f642a746430>>, comp_op = <built-in function min>)
DEBUG    pypidb._similarity:_similarity.py:44 _similarity of firasdib regex101 to awscdkassets
DEBUG    pypidb._similarity:__init__.py:177 Returned 0.6666666666666667 from pypidb._similarity._compute_similarity
DEBUG    pypidb._similarity:__init__.py:157 Calling pypidb._similarity._compute_similarity(name = 'aws-cdk-assets', url = 'https://github.com/aws/aws-cdk', algo = <bound method _Distance.dist of <abydos.distance._ratcliff_obershelp.RatcliffObershelp object at 0x7f642a746430>>, comp_op = <built-in function min>)
DEBUG    pypidb._similarity:_similarity.py:44 _similarity of aws awscdk to awscdkassets
DEBUG    pypidb._similarity:__init__.py:177 Returned 0.23809523809523814 from pypidb._similarity._compute_similarity
DEBUG    pypidb._similarity:_similarity.py:118 computed similarity: {'https://github.com/semver/semver': 0.7142857142857143, 'https://github.com/firasdib/Regex101': 0.6666666666666667, 'https://github.com/aws/aws-cdk': 0.23809523809523814}
DEBUG    pypidb._similarity:_similarity.py:126 nearest: 0.7142857142857143 https://github.com/semver/semver
DEBUG    pypidb._similarity:_similarity.py:126 nearest: 0.6666666666666667 https://github.com/firasdib/Regex101
DEBUG    pypidb._similarity:_similarity.py:126 nearest: 0.23809523809523814 https://github.com/aws/aws-cdk
DEBUG    pypidb._similarity:_similarity.py:129 nearest_counter Counter({0.7142857142857143: 1, 0.6666666666666667: 1, 0.23809523809523814: 1})
DEBUG    pypidb._similarity:__init__.py:177 Returned 'https://github.com/aws/aws-cdk' from pypidb._similarity._get_most_similar
DEBUG    pypidb._similarity:_similarity.py:255 common, similar: None - https://github.com/aws/aws-cdk
DEBUG    pypidb._similarity:_similarity.py:263 most similar w/ identical commonality: https://github.com/aws/aws-cdk
DEBUG    pypidb._similarity:__init__.py:177 Returned 'https://github.com/aws/aws-cdk' from pypidb._similarity.get_best_match
DEBUG    pypidb._similarity:__init__.py:157 Calling pypidb._similarity._compute_similarity(name = 'aws-cdk-assets', url = 'https://github.com/aws/aws-cdk', algo = <bound method _Distance.dist of <abydos.distance._ratcliff_obershelp.RatcliffObershelp object at 0x7f642a746430>>, comp_op = <built-in function min>)
DEBUG    pypidb._similarity:_similarity.py:44 _similarity of aws awscdk to awscdkassets
DEBUG    pypidb._similarity:__init__.py:177 Returned 0.23809523809523814 from pypidb._similarity._compute_similarity
DEBUG    pypidb._pypi:_pypi.py:317 processing Url: https://tools.ietf.org/html/rfc2119 @ Unknown
DEBUG    pypidb._pypi:_pypi.py:427 reject rule https://tools.ietf.org/html/rfc2119: False
INFO     pypidb._pypi:_pypi.py:451 r https://tools.ietf.org/html/rfc2119
DEBUG    pypidb._adapters:_adapters.py:84 cdn block of https://tools.ietf.org/html/rfc2119 skipped
DEBUG    pypidb._adapters:_adapters.py:108 domain block of https://tools.ietf.org/html/rfc2119 skipped
DEBUG    pypidb._adapters:_adapters.py:32 is_num = False; is_IP = False: https://tools.ietf.org/html/rfc2119
DEBUG    pypidb._adapters:_adapters.py:42 is_IP = False: https://tools.ietf.org/html/rfc2119
DEBUG    pypidb._adapters:_adapters.py:46 IPblock of https://tools.ietf.org/html/rfc2119 skipped
DEBUG    https_everywhere.adapter:adapter.py:86 No implementation for get_redirect('https://tools.ietf.org/html/rfc2119')
DEBUG    https_everywhere.adapter:adapter.py:117 no redirection of https://tools.ietf.org/html/rfc2119 occurred
DEBUG    cachecontrol.controller:controller.py:126 Looking up "https://tools.ietf.org/html/rfc2119" in the cache
DEBUG    cachecontrol.controller:controller.py:179 Current age based on date: 153124
DEBUG    cachecontrol.controller:controller.py:201 Freshness lifetime from expires: 431945
DEBUG    cachecontrol.controller:controller.py:219 The response is "fresh", returning cached response
DEBUG    cachecontrol.controller:controller.py:220 431945 > 153124
DEBUG    pypidb._pypi:_pypi.py:455 r Response.url https://tools.ietf.org/html/rfc2119 elapsed 0:00:00.003330
DEBUG    pypidb._pypi:_pypi.py:460 r https://tools.ietf.org/html/rfc2119 headers: {'Date': 'Sun, 19 Apr 2020 02:55:01 GMT', 'Server': 'Apache/2.2.22 (Debian)', 'Content-Location': 'rfc2119.html', 'Vary': '', 'TCN': 'choice', 'Last-Modified': 'Mon, 13 Apr 2020 14:05:07 GMT', 'ETag': '"3caa6a-3b43-5a32c92014ac0;5a39b890dffe1"', 'Accept-Ranges': 'bytes', 'cache-control': 'public', 'expires': 'Fri, 24 Apr 2020 02:54:06 -0000', 'Content-Encoding': 'gzip', 'Strict-Transport-Security': 'max-age=3600', 'X-Frame-Options': 'SAMEORIGIN', 'X-Xss-Protection': '1; mode=block', 'X-Content-Type-Options': 'nosniff', 'X-Clacks-Overhead': 'GNU Terry Pratchett', 'Content-Length': '4585', 'Keep-Alive': 'timeout=5, max=100', 'Connection': 'Keep-Alive', 'Content-Type': 'text/html; charset=UTF-8', 'Warning': '110 - Automatically cached for 5 days, 0:00:00. Response might be stale'}
DEBUG    pypidb._pypi:_pypi.py:317 processing UrlSet: http://tools.ietf.org/html/rfc2119
DEBUG    pypidb._scm_url_cleaner:__init__.py:157 Calling pypidb._scm_url_cleaner.SCMURLCleaner.get_root(self = <pypidb._scm_url_cleaner.SCMURLCleaner object at 0x7f64161250d0>, url = 'tools.ietf.org')
DEBUG    pypidb._scm_url_cleaner:__init__.py:177 Returned None from pypidb._scm_url_cleaner.SCMURLCleaner.get_root
DEBUG    pypidb._scm_url_cleaner:__init__.py:157 Calling pypidb._scm_url_cleaner.SCMURLCleaner.get_root(self = <pypidb._scm_url_cleaner.SCMURLCleaner object at 0x7f64161250d0>, url = 'http://www.w3.org/TR/xhtml1/DTD/xhtml1-transitional.dtd')
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.w3.org/TR/xhtml1/DTD/xhtml1-transitional.dtd subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.w3.org/TR/xhtml1/DTD/xhtml1-transitional.dtd subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.w3.org/TR/xhtml1/DTD/xhtml1-transitional.dtd subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.w3.org/TR/xhtml1/DTD/xhtml1-transitional.dtd subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.w3.org/TR/xhtml1/DTD/xhtml1-transitional.dtd subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.w3.org/TR/xhtml1/DTD/xhtml1-transitional.dtd subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.w3.org/TR/xhtml1/DTD/xhtml1-transitional.dtd subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.w3.org/TR/xhtml1/DTD/xhtml1-transitional.dtd subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.w3.org/TR/xhtml1/DTD/xhtml1-transitional.dtd subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.w3.org/TR/xhtml1/DTD/xhtml1-transitional.dtd subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.w3.org/TR/xhtml1/DTD/xhtml1-transitional.dtd subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.w3.org/TR/xhtml1/DTD/xhtml1-transitional.dtd subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.w3.org/TR/xhtml1/DTD/xhtml1-transitional.dtd subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.w3.org/TR/xhtml1/DTD/xhtml1-transitional.dtd subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.w3.org/TR/xhtml1/DTD/xhtml1-transitional.dtd subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.w3.org/TR/xhtml1/DTD/xhtml1-transitional.dtd subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.w3.org/TR/xhtml1/DTD/xhtml1-transitional.dtd subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.w3.org/TR/xhtml1/DTD/xhtml1-transitional.dtd subdomain www
DEBUG    pypidb._scm_url_cleaner:__init__.py:177 Returned None from pypidb._scm_url_cleaner.SCMURLCleaner.get_root
DEBUG    pypidb._scm_url_cleaner:__init__.py:157 Calling pypidb._scm_url_cleaner.SCMURLCleaner.get_root(self = <pypidb._scm_url_cleaner.SCMURLCleaner object at 0x7f64161250d0>, url = 'http://dublincore.org/documents/2008/08/04/dc-html/')
DEBUG    pypidb._scm_url_cleaner:__init__.py:177 Returned None from pypidb._scm_url_cleaner.SCMURLCleaner.get_root
DEBUG    pypidb._scm_url_cleaner:__init__.py:157 Calling pypidb._scm_url_cleaner.SCMURLCleaner.get_root(self = <pypidb._scm_url_cleaner.SCMURLCleaner object at 0x7f64161250d0>, url = 'https://datatracker.ietf.org/doc/rfc2119')
DEBUG    pypidb._scm_url_cleaner:__init__.py:177 Returned None from pypidb._scm_url_cleaner.SCMURLCleaner.get_root
DEBUG    pypidb._scm_url_cleaner:__init__.py:157 Calling pypidb._scm_url_cleaner.SCMURLCleaner.get_root(self = <pypidb._scm_url_cleaner.SCMURLCleaner object at 0x7f64161250d0>, url = 'https://tools.ietf.org/tools/rfcmarkup/')
DEBUG    pypidb._scm_url_cleaner:__init__.py:177 Returned None from pypidb._scm_url_cleaner.SCMURLCleaner.get_root
DEBUG    pypidb._scm_url_cleaner:__init__.py:157 Calling pypidb._scm_url_cleaner.SCMURLCleaner.get_root(self = <pypidb._scm_url_cleaner.SCMURLCleaner object at 0x7f64161250d0>, url = 'http://purl.org/dc/elements/1.1/')
DEBUG    pypidb._scm_url_cleaner:__init__.py:177 Returned None from pypidb._scm_url_cleaner.SCMURLCleaner.get_root
DEBUG    pypidb._scm_url_cleaner:__init__.py:157 Calling pypidb._scm_url_cleaner.SCMURLCleaner.get_root(self = <pypidb._scm_url_cleaner.SCMURLCleaner object at 0x7f64161250d0>, url = 'http://www.w3.org/1999/xhtml')
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.w3.org/1999/xhtml subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.w3.org/1999/xhtml subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.w3.org/1999/xhtml subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.w3.org/1999/xhtml subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.w3.org/1999/xhtml subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.w3.org/1999/xhtml subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.w3.org/1999/xhtml subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.w3.org/1999/xhtml subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.w3.org/1999/xhtml subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.w3.org/1999/xhtml subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.w3.org/1999/xhtml subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.w3.org/1999/xhtml subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.w3.org/1999/xhtml subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.w3.org/1999/xhtml subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.w3.org/1999/xhtml subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.w3.org/1999/xhtml subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.w3.org/1999/xhtml subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.w3.org/1999/xhtml subdomain www
DEBUG    pypidb._scm_url_cleaner:__init__.py:177 Returned None from pypidb._scm_url_cleaner.SCMURLCleaner.get_root
DEBUG    pypidb._scm_url_cleaner:__init__.py:157 Calling pypidb._scm_url_cleaner.SCMURLCleaner.get_root(self = <pypidb._scm_url_cleaner.SCMURLCleaner object at 0x7f64161250d0>, url = 'https://www.rfc-editor.org/errata_search.php?rfc=2119')
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url https://www.rfc-editor.org/errata_search.php?rfc=2119 subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url https://www.rfc-editor.org/errata_search.php?rfc=2119 subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url https://www.rfc-editor.org/errata_search.php?rfc=2119 subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url https://www.rfc-editor.org/errata_search.php?rfc=2119 subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url https://www.rfc-editor.org/errata_search.php?rfc=2119 subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url https://www.rfc-editor.org/errata_search.php?rfc=2119 subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url https://www.rfc-editor.org/errata_search.php?rfc=2119 subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url https://www.rfc-editor.org/errata_search.php?rfc=2119 subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url https://www.rfc-editor.org/errata_search.php?rfc=2119 subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url https://www.rfc-editor.org/errata_search.php?rfc=2119 subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url https://www.rfc-editor.org/errata_search.php?rfc=2119 subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url https://www.rfc-editor.org/errata_search.php?rfc=2119 subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url https://www.rfc-editor.org/errata_search.php?rfc=2119 subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url https://www.rfc-editor.org/errata_search.php?rfc=2119 subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url https://www.rfc-editor.org/errata_search.php?rfc=2119 subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url https://www.rfc-editor.org/errata_search.php?rfc=2119 subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url https://www.rfc-editor.org/errata_search.php?rfc=2119 subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url https://www.rfc-editor.org/errata_search.php?rfc=2119 subdomain www
DEBUG    pypidb._scm_url_cleaner:__init__.py:177 Returned None from pypidb._scm_url_cleaner.SCMURLCleaner.get_root
DEBUG    pypidb._pypi:_pypi.py:359 --none of this set: {'tools.ietf.org', 'http://www.w3.org/TR/xhtml1/DTD/xhtml1-transitional.dtd', 'http://dublincore.org/documents/2008/08/04/dc-html/', 'https://datatracker.ietf.org/doc/rfc2119', 'https://tools.ietf.org/tools/rfcmarkup/', 'http://purl.org/dc/elements/1.1/', 'http://www.w3.org/1999/xhtml', 'https://www.rfc-editor.org/errata_search.php?rfc=2119'}
DEBUG    pypidb._pypi:_pypi.py:361 queuing ['http://dublincore.org/documents/2008/08/04/dc-html/', 'http://purl.org/dc/elements/1.1/', 'http://www.w3.org/1999/xhtml', 'http://www.w3.org/TR/xhtml1/DTD/xhtml1-transitional.dtd', 'https://datatracker.ietf.org/doc/rfc2119', 'https://tools.ietf.org/tools/rfcmarkup/', 'https://www.rfc-editor.org/errata_search.php?rfc=2119', 'tools.ietf.org']
INFO     pypidb._pypi:_pypi.py:365 aws-cdk.assets: from http://tools.ietf.org/html/rfc2119 added urls []
DEBUG    pypidb._pypi:_pypi.py:317 processing UrlSet: https://regex101.com/r/vkijKf/1/
DEBUG    pypidb._scm_url_cleaner:__init__.py:157 Calling pypidb._scm_url_cleaner.SCMURLCleaner.get_root(self = <pypidb._scm_url_cleaner.SCMURLCleaner object at 0x7f64161250d0>, url = 'https://twitter.com/regex101')
DEBUG    pypidb._scm_url_cleaner:__init__.py:177 Returned None from pypidb._scm_url_cleaner.SCMURLCleaner.get_root
DEBUG    pypidb._scm_url_cleaner:__init__.py:157 Calling pypidb._scm_url_cleaner.SCMURLCleaner.get_root(self = <pypidb._scm_url_cleaner.SCMURLCleaner object at 0x7f64161250d0>, url = 'https://www.paypal.com/cgi-bin/webscr?cmd=_donations&amp')
DEBUG    pypidb._scm_url_cleaner:__init__.py:177 Returned None from pypidb._scm_url_cleaner.SCMURLCleaner.get_root
DEBUG    pypidb._scm_url_cleaner:__init__.py:157 Calling pypidb._scm_url_cleaner.SCMURLCleaner.get_root(self = <pypidb._scm_url_cleaner.SCMURLCleaner object at 0x7f64161250d0>, url = 'https://github.com/firasdib/Regex101/issues')
DEBUG    pypidb._scm_url_cleaner:__init__.py:177 Returned 'https://github.com/firasdib/Regex101' from pypidb._scm_url_cleaner.SCMURLCleaner.get_root
DEBUG    pypidb._scm_url_cleaner:__init__.py:157 Calling pypidb._scm_url_cleaner.SCMURLCleaner.get_root(self = <pypidb._scm_url_cleaner.SCMURLCleaner object at 0x7f64161250d0>, url = 'regex101.com/preview.png')
DEBUG    pypidb._scm_url_cleaner:__init__.py:177 Returned None from pypidb._scm_url_cleaner.SCMURLCleaner.get_root
DEBUG    pypidb._scm_url_cleaner:__init__.py:157 Calling pypidb._scm_url_cleaner.SCMURLCleaner.get_root(self = <pypidb._scm_url_cleaner.SCMURLCleaner object at 0x7f64161250d0>, url = 'regex101.com')
DEBUG    pypidb._scm_url_cleaner:__init__.py:177 Returned None from pypidb._scm_url_cleaner.SCMURLCleaner.get_root
DEBUG    pypidb._scm_url_cleaner:__init__.py:157 Calling pypidb._scm_url_cleaner.SCMURLCleaner.get_root(self = <pypidb._scm_url_cleaner.SCMURLCleaner object at 0x7f64161250d0>, url = 'https://github.com/firasdib/Regex101/wiki')
DEBUG    pypidb._scm_url_cleaner:__init__.py:177 Returned 'https://github.com/firasdib/Regex101' from pypidb._scm_url_cleaner.SCMURLCleaner.get_root
DEBUG    pypidb._scm_url_cleaner:__init__.py:157 Calling pypidb._scm_url_cleaner.SCMURLCleaner.get_root(self = <pypidb._scm_url_cleaner.SCMURLCleaner object at 0x7f64161250d0>, url = 'http://enable-javascript.com/')
DEBUG    pypidb._scm_url_cleaner:__init__.py:177 Returned None from pypidb._scm_url_cleaner.SCMURLCleaner.get_root
DEBUG    pypidb._pypi:_pypi.py:355 --results from {'https://twitter.com/regex101', 'https://www.paypal.com/cgi-bin/webscr?cmd=_donations&amp', 'https://github.com/firasdib/Regex101/issues', 'regex101.com/preview.png', 'regex101.com', 'https://github.com/firasdib/Regex101/wiki', 'http://enable-javascript.com/'}: ['https://github.com/firasdib/Regex101', 'https://github.com/firasdib/Regex101']
DEBUG    pypidb._pypi:_pypi.py:361 queuing ['http://enable-javascript.com/', 'https://twitter.com/regex101', 'https://www.paypal.com/cgi-bin/webscr?cmd=_donations&amp', 'regex101.com', 'regex101.com/preview.png']
INFO     pypidb._pypi:_pypi.py:365 aws-cdk.assets: from https://regex101.com/r/vkijKf/1/ added urls ['https://github.com/firasdib/Regex101', 'https://github.com/firasdib/Regex101']
DEBUG    pypidb._similarity:__init__.py:157 Calling pypidb._similarity.get_best_match(name = 'aws-cdk-assets', urls = ['https://github.com/aws/aws-cdk', 'https://github.com/aws/aws-cdk', 'https://github.com/semver/semver', 'https://github.com/semver/semver', 'https://github.com/firasdib/Regex101', 'https://github.com/firasdib/Regex101', 'https://github.com/firasdib/Regex101', 'https://github.com/firasdib/Regex101'])
DEBUG    pypidb._similarity:__init__.py:157 Calling pypidb._similarity._get_weighted_best(name = 'aws-cdk-assets', urls = ['https://github.com/aws/aws-cdk', 'https://github.com/aws/aws-cdk', 'https://github.com/semver/semver', 'https://github.com/semver/semver', 'https://github.com/firasdib/Regex101', 'https://github.com/firasdib/Regex101', 'https://github.com/firasdib/Regex101', 'https://github.com/firasdib/Regex101'], cond = <function _slash_endswith at 0x7f641e407280>)
DEBUG    pypidb._similarity:__init__.py:177 Returned None from pypidb._similarity._get_weighted_best
DEBUG    pypidb._similarity:__init__.py:157 Calling pypidb._similarity._get_weighted_best(name = 'aws-cdk-assets', urls = ['https://github.com/aws/aws-cdk', 'https://github.com/aws/aws-cdk', 'https://github.com/semver/semver', 'https://github.com/semver/semver', 'https://github.com/firasdib/Regex101', 'https://github.com/firasdib/Regex101', 'https://github.com/firasdib/Regex101', 'https://github.com/firasdib/Regex101'], cond = <function _endswith at 0x7f641e407310>)
DEBUG    pypidb._similarity:__init__.py:177 Returned None from pypidb._similarity._get_weighted_best
DEBUG    pypidb._similarity:__init__.py:157 Calling pypidb._similarity._get_weighted_best(name = 'aws-cdk-assets', urls = ['https://github.com/aws/aws-cdk', 'https://github.com/aws/aws-cdk', 'https://github.com/semver/semver', 'https://github.com/semver/semver', 'https://github.com/firasdib/Regex101', 'https://github.com/firasdib/Regex101', 'https://github.com/firasdib/Regex101', 'https://github.com/firasdib/Regex101'], cond = <function _python_and_name_match at 0x7f641e4073a0>)
DEBUG    pypidb._similarity:__init__.py:177 Returned None from pypidb._similarity._get_weighted_best
DEBUG    pypidb._similarity:__init__.py:157 Calling pypidb._similarity._get_weighted_best(name = 'aws-cdk-assets', urls = ['https://github.com/aws/aws-cdk', 'https://github.com/aws/aws-cdk', 'https://github.com/semver/semver', 'https://github.com/semver/semver', 'https://github.com/firasdib/Regex101', 'https://github.com/firasdib/Regex101', 'https://github.com/firasdib/Regex101', 'https://github.com/firasdib/Regex101'], cond = <function _only_name_match at 0x7f641e407430>)
DEBUG    pypidb._similarity:__init__.py:177 Returned None from pypidb._similarity._get_weighted_best
DEBUG    pypidb._similarity:__init__.py:157 Calling pypidb._similarity._get_weighted_best(name = 'aws-cdk-assets', urls = ['https://github.com/aws/aws-cdk', 'https://github.com/aws/aws-cdk', 'https://github.com/semver/semver', 'https://github.com/semver/semver', 'https://github.com/firasdib/Regex101', 'https://github.com/firasdib/Regex101', 'https://github.com/firasdib/Regex101', 'https://github.com/firasdib/Regex101'], cond = <function get_best_match.<locals>.<lambda> at 0x7f6416c60310>)
DEBUG    pypidb._similarity:_similarity.py:101 counter - Counter({'https://github.com/firasdib/Regex101': 4, 'https://github.com/aws/aws-cdk': 2, 'https://github.com/semver/semver': 2})
DEBUG    pypidb._similarity:__init__.py:177 Returned 'https://github.com/firasdib/Regex101' from pypidb._similarity._get_weighted_best
DEBUG    pypidb._similarity:_similarity.py:244 !!!!finding similar: ['https://github.com/aws/aws-cdk', 'https://github.com/aws/aws-cdk', 'https://github.com/semver/semver', 'https://github.com/semver/semver', 'https://github.com/firasdib/Regex101', 'https://github.com/firasdib/Regex101', 'https://github.com/firasdib/Regex101', 'https://github.com/firasdib/Regex101']
DEBUG    pypidb._similarity:__init__.py:157 Calling pypidb._similarity._get_most_similar(name = 'aws-cdk-assets', urls = {'https://github.com/semver/semver', 'https://github.com/firasdib/Regex101', 'https://github.com/aws/aws-cdk'})
DEBUG    pypidb._similarity:__init__.py:157 Calling pypidb._similarity._compute_similarity(name = 'aws-cdk-assets', url = 'https://github.com/semver/semver', algo = <bound method _Distance.dist of <abydos.distance._ratcliff_obershelp.RatcliffObershelp object at 0x7f642a746430>>, comp_op = <built-in function min>)
DEBUG    pypidb._similarity:_similarity.py:44 _similarity of semver semver to awscdkassets
DEBUG    pypidb._similarity:__init__.py:177 Returned 0.7142857142857143 from pypidb._similarity._compute_similarity
DEBUG    pypidb._similarity:__init__.py:157 Calling pypidb._similarity._compute_similarity(name = 'aws-cdk-assets', url = 'https://github.com/firasdib/Regex101', algo = <bound method _Distance.dist of <abydos.distance._ratcliff_obershelp.RatcliffObershelp object at 0x7f642a746430>>, comp_op = <built-in function min>)
DEBUG    pypidb._similarity:_similarity.py:44 _similarity of firasdib regex101 to awscdkassets
DEBUG    pypidb._similarity:__init__.py:177 Returned 0.6666666666666667 from pypidb._similarity._compute_similarity
DEBUG    pypidb._similarity:__init__.py:157 Calling pypidb._similarity._compute_similarity(name = 'aws-cdk-assets', url = 'https://github.com/aws/aws-cdk', algo = <bound method _Distance.dist of <abydos.distance._ratcliff_obershelp.RatcliffObershelp object at 0x7f642a746430>>, comp_op = <built-in function min>)
DEBUG    pypidb._similarity:_similarity.py:44 _similarity of aws awscdk to awscdkassets
DEBUG    pypidb._similarity:__init__.py:177 Returned 0.23809523809523814 from pypidb._similarity._compute_similarity
DEBUG    pypidb._similarity:_similarity.py:118 computed similarity: {'https://github.com/semver/semver': 0.7142857142857143, 'https://github.com/firasdib/Regex101': 0.6666666666666667, 'https://github.com/aws/aws-cdk': 0.23809523809523814}
DEBUG    pypidb._similarity:_similarity.py:126 nearest: 0.7142857142857143 https://github.com/semver/semver
DEBUG    pypidb._similarity:_similarity.py:126 nearest: 0.6666666666666667 https://github.com/firasdib/Regex101
DEBUG    pypidb._similarity:_similarity.py:126 nearest: 0.23809523809523814 https://github.com/aws/aws-cdk
DEBUG    pypidb._similarity:_similarity.py:129 nearest_counter Counter({0.7142857142857143: 1, 0.6666666666666667: 1, 0.23809523809523814: 1})
DEBUG    pypidb._similarity:__init__.py:177 Returned 'https://github.com/aws/aws-cdk' from pypidb._similarity._get_most_similar
DEBUG    pypidb._similarity:_similarity.py:255 common, similar: https://github.com/firasdib/Regex101 - https://github.com/aws/aws-cdk
DEBUG    pypidb._similarity:_similarity.py:260 most common: https://github.com/firasdib/Regex101
DEBUG    pypidb._similarity:__init__.py:177 Returned 'https://github.com/firasdib/Regex101' from pypidb._similarity.get_best_match
DEBUG    pypidb._similarity:__init__.py:157 Calling pypidb._similarity._compute_similarity(name = 'aws-cdk-assets', url = 'https://github.com/firasdib/Regex101', algo = <bound method _Distance.dist of <abydos.distance._ratcliff_obershelp.RatcliffObershelp object at 0x7f642a746430>>, comp_op = <built-in function min>)
DEBUG    pypidb._similarity:_similarity.py:44 _similarity of firasdib regex101 to awscdkassets
DEBUG    pypidb._similarity:__init__.py:177 Returned 0.6666666666666667 from pypidb._similarity._compute_similarity
DEBUG    pypidb._pypi:_pypi.py:317 processing Url: https://tom.preston-werner.com/ @ Unknown
DEBUG    pypidb._pypi:_pypi.py:416 queue loop skipping >5: https://tom.preston-werner.com/
DEBUG    pypidb._pypi:_pypi.py:317 processing UrlSet: http://tom.preston-werner.com
DEBUG    pypidb._scm_url_cleaner:__init__.py:157 Calling pypidb._scm_url_cleaner.SCMURLCleaner.get_root(self = <pypidb._scm_url_cleaner.SCMURLCleaner object at 0x7f64161250d0>, url = 'http://www.youtube.com/watch?v=7DoB0SCUtOk&list=SP055Epbe6d5aclKNAa8msO1VvDOJ8sYlS&index=23')
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.youtube.com/watch?v=7DoB0SCUtOk&list=SP055Epbe6d5aclKNAa8msO1VvDOJ8sYlS&index=23 subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.youtube.com/watch?v=7DoB0SCUtOk&list=SP055Epbe6d5aclKNAa8msO1VvDOJ8sYlS&index=23 subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.youtube.com/watch?v=7DoB0SCUtOk&list=SP055Epbe6d5aclKNAa8msO1VvDOJ8sYlS&index=23 subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.youtube.com/watch?v=7DoB0SCUtOk&list=SP055Epbe6d5aclKNAa8msO1VvDOJ8sYlS&index=23 subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.youtube.com/watch?v=7DoB0SCUtOk&list=SP055Epbe6d5aclKNAa8msO1VvDOJ8sYlS&index=23 subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.youtube.com/watch?v=7DoB0SCUtOk&list=SP055Epbe6d5aclKNAa8msO1VvDOJ8sYlS&index=23 subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.youtube.com/watch?v=7DoB0SCUtOk&list=SP055Epbe6d5aclKNAa8msO1VvDOJ8sYlS&index=23 subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.youtube.com/watch?v=7DoB0SCUtOk&list=SP055Epbe6d5aclKNAa8msO1VvDOJ8sYlS&index=23 subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.youtube.com/watch?v=7DoB0SCUtOk&list=SP055Epbe6d5aclKNAa8msO1VvDOJ8sYlS&index=23 subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.youtube.com/watch?v=7DoB0SCUtOk&list=SP055Epbe6d5aclKNAa8msO1VvDOJ8sYlS&index=23 subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.youtube.com/watch?v=7DoB0SCUtOk&list=SP055Epbe6d5aclKNAa8msO1VvDOJ8sYlS&index=23 subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.youtube.com/watch?v=7DoB0SCUtOk&list=SP055Epbe6d5aclKNAa8msO1VvDOJ8sYlS&index=23 subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.youtube.com/watch?v=7DoB0SCUtOk&list=SP055Epbe6d5aclKNAa8msO1VvDOJ8sYlS&index=23 subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.youtube.com/watch?v=7DoB0SCUtOk&list=SP055Epbe6d5aclKNAa8msO1VvDOJ8sYlS&index=23 subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.youtube.com/watch?v=7DoB0SCUtOk&list=SP055Epbe6d5aclKNAa8msO1VvDOJ8sYlS&index=23 subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.youtube.com/watch?v=7DoB0SCUtOk&list=SP055Epbe6d5aclKNAa8msO1VvDOJ8sYlS&index=23 subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.youtube.com/watch?v=7DoB0SCUtOk&list=SP055Epbe6d5aclKNAa8msO1VvDOJ8sYlS&index=23 subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.youtube.com/watch?v=7DoB0SCUtOk&list=SP055Epbe6d5aclKNAa8msO1VvDOJ8sYlS&index=23 subdomain www
DEBUG    pypidb._scm_url_cleaner:__init__.py:177 Returned None from pypidb._scm_url_cleaner.SCMURLCleaner.get_root
DEBUG    pypidb._scm_url_cleaner:__init__.py:157 Calling pypidb._scm_url_cleaner.SCMURLCleaner.get_root(self = <pypidb._scm_url_cleaner.SCMURLCleaner object at 0x7f64161250d0>, url = 'http://www.youtube.com/watch?v=P9jjDpWzsUI')
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.youtube.com/watch?v=P9jjDpWzsUI subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.youtube.com/watch?v=P9jjDpWzsUI subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.youtube.com/watch?v=P9jjDpWzsUI subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.youtube.com/watch?v=P9jjDpWzsUI subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.youtube.com/watch?v=P9jjDpWzsUI subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.youtube.com/watch?v=P9jjDpWzsUI subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.youtube.com/watch?v=P9jjDpWzsUI subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.youtube.com/watch?v=P9jjDpWzsUI subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.youtube.com/watch?v=P9jjDpWzsUI subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.youtube.com/watch?v=P9jjDpWzsUI subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.youtube.com/watch?v=P9jjDpWzsUI subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.youtube.com/watch?v=P9jjDpWzsUI subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.youtube.com/watch?v=P9jjDpWzsUI subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.youtube.com/watch?v=P9jjDpWzsUI subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.youtube.com/watch?v=P9jjDpWzsUI subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.youtube.com/watch?v=P9jjDpWzsUI subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.youtube.com/watch?v=P9jjDpWzsUI subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.youtube.com/watch?v=P9jjDpWzsUI subdomain www
DEBUG    pypidb._scm_url_cleaner:__init__.py:177 Returned None from pypidb._scm_url_cleaner.SCMURLCleaner.get_root
DEBUG    pypidb._scm_url_cleaner:__init__.py:157 Calling pypidb._scm_url_cleaner.SCMURLCleaner.get_root(self = <pypidb._scm_url_cleaner.SCMURLCleaner object at 0x7f64161250d0>, url = 'http://en.oreilly.com/rails2008/public/schedule/speaker/2520')
DEBUG    pypidb._scm_url_cleaner:__init__.py:177 Returned None from pypidb._scm_url_cleaner.SCMURLCleaner.get_root
DEBUG    pypidb._scm_url_cleaner:__init__.py:157 Calling pypidb._scm_url_cleaner.SCMURLCleaner.get_root(self = <pypidb._scm_url_cleaner.SCMURLCleaner object at 0x7f64161250d0>, url = 'https://github.com/mojombo/god/')
DEBUG    pypidb._scm_url_cleaner:__init__.py:157 Calling pypidb._scm_url_cleaner._github(url = 'https://github.com/mojombo/god/')
DEBUG    pypidb._scm_url_cleaner:__init__.py:177 Returned ('mojombo', 'god') from pypidb._scm_url_cleaner._github
DEBUG    pypidb._scm_url_cleaner:__init__.py:177 Returned 'https://github.com/mojombo/god' from pypidb._scm_url_cleaner.SCMURLCleaner.get_root
DEBUG    pypidb._scm_url_cleaner:__init__.py:157 Calling pypidb._scm_url_cleaner.SCMURLCleaner.get_root(self = <pypidb._scm_url_cleaner.SCMURLCleaner object at 0x7f64161250d0>, url = 'http://bits.blogs.nytimes.com/2012/12/28/github-has-big-dreams-for-open-source-software-and-more/')
DEBUG    pypidb._scm_url_cleaner:__init__.py:177 Returned None from pypidb._scm_url_cleaner.SCMURLCleaner.get_root
DEBUG    pypidb._scm_url_cleaner:__init__.py:157 Calling pypidb._scm_url_cleaner.SCMURLCleaner.get_root(self = <pypidb._scm_url_cleaner.SCMURLCleaner object at 0x7f64161250d0>, url = 'https://github.com/mojombo/chronic/')
DEBUG    pypidb._scm_url_cleaner:__init__.py:157 Calling pypidb._scm_url_cleaner._github(url = 'https://github.com/mojombo/chronic/')
DEBUG    pypidb._scm_url_cleaner:__init__.py:177 Returned ('mojombo', 'chronic') from pypidb._scm_url_cleaner._github
DEBUG    pypidb._scm_url_cleaner:__init__.py:177 Returned 'https://github.com/mojombo/chronic' from pypidb._scm_url_cleaner.SCMURLCleaner.get_root
DEBUG    pypidb._scm_url_cleaner:__init__.py:157 Calling pypidb._scm_url_cleaner.SCMURLCleaner.get_root(self = <pypidb._scm_url_cleaner.SCMURLCleaner object at 0x7f64161250d0>, url = 'https://www.wired.com/2013/09/github-for-anything/')
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url https://www.wired.com/2013/09/github-for-anything/ subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url https://www.wired.com/2013/09/github-for-anything/ subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url https://www.wired.com/2013/09/github-for-anything/ subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url https://www.wired.com/2013/09/github-for-anything/ subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url https://www.wired.com/2013/09/github-for-anything/ subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url https://www.wired.com/2013/09/github-for-anything/ subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url https://www.wired.com/2013/09/github-for-anything/ subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url https://www.wired.com/2013/09/github-for-anything/ subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url https://www.wired.com/2013/09/github-for-anything/ subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url https://www.wired.com/2013/09/github-for-anything/ subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url https://www.wired.com/2013/09/github-for-anything/ subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url https://www.wired.com/2013/09/github-for-anything/ subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url https://www.wired.com/2013/09/github-for-anything/ subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url https://www.wired.com/2013/09/github-for-anything/ subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url https://www.wired.com/2013/09/github-for-anything/ subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url https://www.wired.com/2013/09/github-for-anything/ subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url https://www.wired.com/2013/09/github-for-anything/ subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url https://www.wired.com/2013/09/github-for-anything/ subdomain www
DEBUG    pypidb._scm_url_cleaner:__init__.py:177 Returned None from pypidb._scm_url_cleaner.SCMURLCleaner.get_root
DEBUG    pypidb._scm_url_cleaner:__init__.py:157 Calling pypidb._scm_url_cleaner.SCMURLCleaner.get_root(self = <pypidb._scm_url_cleaner.SCMURLCleaner object at 0x7f64161250d0>, url = 'https://semver.org')
DEBUG    pypidb._scm_url_cleaner:__init__.py:177 Returned None from pypidb._scm_url_cleaner.SCMURLCleaner.get_root
DEBUG    pypidb._scm_url_cleaner:__init__.py:157 Calling pypidb._scm_url_cleaner.SCMURLCleaner.get_root(self = <pypidb._scm_url_cleaner.SCMURLCleaner object at 0x7f64161250d0>, url = 'http://www.linux-mag.com/cache/7486/1.html')
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.linux-mag.com/cache/7486/1.html subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.linux-mag.com/cache/7486/1.html subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.linux-mag.com/cache/7486/1.html subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.linux-mag.com/cache/7486/1.html subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.linux-mag.com/cache/7486/1.html subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.linux-mag.com/cache/7486/1.html subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.linux-mag.com/cache/7486/1.html subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.linux-mag.com/cache/7486/1.html subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.linux-mag.com/cache/7486/1.html subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.linux-mag.com/cache/7486/1.html subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.linux-mag.com/cache/7486/1.html subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.linux-mag.com/cache/7486/1.html subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.linux-mag.com/cache/7486/1.html subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.linux-mag.com/cache/7486/1.html subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.linux-mag.com/cache/7486/1.html subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.linux-mag.com/cache/7486/1.html subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.linux-mag.com/cache/7486/1.html subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.linux-mag.com/cache/7486/1.html subdomain www
DEBUG    pypidb._scm_url_cleaner:__init__.py:177 Returned None from pypidb._scm_url_cleaner.SCMURLCleaner.get_root
DEBUG    pypidb._scm_url_cleaner:__init__.py:157 Calling pypidb._scm_url_cleaner.SCMURLCleaner.get_root(self = <pypidb._scm_url_cleaner.SCMURLCleaner object at 0x7f64161250d0>, url = 'http://vimeo.com/35640883')
DEBUG    pypidb._scm_url_cleaner:__init__.py:177 Returned None from pypidb._scm_url_cleaner.SCMURLCleaner.get_root
DEBUG    pypidb._scm_url_cleaner:__init__.py:157 Calling pypidb._scm_url_cleaner.SCMURLCleaner.get_root(self = <pypidb._scm_url_cleaner.SCMURLCleaner object at 0x7f64161250d0>, url = 'http://confreaks.net/videos/432-rubyconf2010-the-road-from-ruby-hacker-to-entrepreneur')
DEBUG    pypidb._scm_url_cleaner:__init__.py:177 Returned None from pypidb._scm_url_cleaner.SCMURLCleaner.get_root
DEBUG    pypidb._scm_url_cleaner:__init__.py:157 Calling pypidb._scm_url_cleaner.SCMURLCleaner.get_root(self = <pypidb._scm_url_cleaner.SCMURLCleaner object at 0x7f64161250d0>, url = 'http://wp-community.org/2006/12/26/episode-15-interviews-with-tom-werner-gravatar-and-jaimie-sirovich-seo-egghead/')
DEBUG    pypidb._scm_url_cleaner:__init__.py:177 Returned None from pypidb._scm_url_cleaner.SCMURLCleaner.get_root
DEBUG    pypidb._scm_url_cleaner:__init__.py:157 Calling pypidb._scm_url_cleaner.SCMURLCleaner.get_root(self = <pypidb._scm_url_cleaner.SCMURLCleaner object at 0x7f64161250d0>, url = 'http://web20show.com/episodes/web20show-ep45-github')
DEBUG    pypidb._scm_url_cleaner:__init__.py:177 Returned None from pypidb._scm_url_cleaner.SCMURLCleaner.get_root
DEBUG    pypidb._scm_url_cleaner:__init__.py:157 Calling pypidb._scm_url_cleaner.SCMURLCleaner.get_root(self = <pypidb._scm_url_cleaner.SCMURLCleaner object at 0x7f64161250d0>, url = 'https://github.com/mojombo/')
DEBUG    pypidb._scm_url_cleaner:__init__.py:157 Calling pypidb._scm_url_cleaner._github(url = 'https://github.com/mojombo/')
DEBUG    pypidb._scm_url_cleaner:__init__.py:177 Returned False from pypidb._scm_url_cleaner._github
DEBUG    pypidb._scm_url_cleaner:__init__.py:177 Returned False from pypidb._scm_url_cleaner.SCMURLCleaner.get_root
DEBUG    pypidb._scm_url_cleaner:__init__.py:157 Calling pypidb._scm_url_cleaner.SCMURLCleaner.get_root(self = <pypidb._scm_url_cleaner.SCMURLCleaner object at 0x7f64161250d0>, url = 'http://blogs.sitepoint.com/podcast-107-social-coding-with-githubs-tom-preston-werner/')
DEBUG    pypidb._scm_url_cleaner:__init__.py:177 Returned None from pypidb._scm_url_cleaner.SCMURLCleaner.get_root
DEBUG    pypidb._scm_url_cleaner:__init__.py:157 Calling pypidb._scm_url_cleaner.SCMURLCleaner.get_root(self = <pypidb._scm_url_cleaner.SCMURLCleaner object at 0x7f64161250d0>, url = 'http://confreaks.com/videos/712-rubyconf2011-github-flavored-ruby')
DEBUG    pypidb._scm_url_cleaner:__init__.py:177 Returned None from pypidb._scm_url_cleaner.SCMURLCleaner.get_root
DEBUG    pypidb._scm_url_cleaner:__init__.py:157 Calling pypidb._scm_url_cleaner.SCMURLCleaner.get_root(self = <pypidb._scm_url_cleaner.SCMURLCleaner object at 0x7f64161250d0>, url = 'http://www.mindmeister.com/fr/maps/show_public/60955251?title=tom-preston-schnitzelconf')
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.mindmeister.com/fr/maps/show_public/60955251?title=tom-preston-schnitzelconf subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.mindmeister.com/fr/maps/show_public/60955251?title=tom-preston-schnitzelconf subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.mindmeister.com/fr/maps/show_public/60955251?title=tom-preston-schnitzelconf subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.mindmeister.com/fr/maps/show_public/60955251?title=tom-preston-schnitzelconf subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.mindmeister.com/fr/maps/show_public/60955251?title=tom-preston-schnitzelconf subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.mindmeister.com/fr/maps/show_public/60955251?title=tom-preston-schnitzelconf subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.mindmeister.com/fr/maps/show_public/60955251?title=tom-preston-schnitzelconf subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.mindmeister.com/fr/maps/show_public/60955251?title=tom-preston-schnitzelconf subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.mindmeister.com/fr/maps/show_public/60955251?title=tom-preston-schnitzelconf subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.mindmeister.com/fr/maps/show_public/60955251?title=tom-preston-schnitzelconf subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.mindmeister.com/fr/maps/show_public/60955251?title=tom-preston-schnitzelconf subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.mindmeister.com/fr/maps/show_public/60955251?title=tom-preston-schnitzelconf subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.mindmeister.com/fr/maps/show_public/60955251?title=tom-preston-schnitzelconf subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.mindmeister.com/fr/maps/show_public/60955251?title=tom-preston-schnitzelconf subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.mindmeister.com/fr/maps/show_public/60955251?title=tom-preston-schnitzelconf subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.mindmeister.com/fr/maps/show_public/60955251?title=tom-preston-schnitzelconf subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.mindmeister.com/fr/maps/show_public/60955251?title=tom-preston-schnitzelconf subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.mindmeister.com/fr/maps/show_public/60955251?title=tom-preston-schnitzelconf subdomain www
DEBUG    pypidb._scm_url_cleaner:__init__.py:177 Returned None from pypidb._scm_url_cleaner.SCMURLCleaner.get_root
DEBUG    pypidb._scm_url_cleaner:__init__.py:157 Calling pypidb._scm_url_cleaner.SCMURLCleaner.get_root(self = <pypidb._scm_url_cleaner.SCMURLCleaner object at 0x7f64161250d0>, url = 'http://www.rubyology.com/podcasts/show/66')
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.rubyology.com/podcasts/show/66 subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.rubyology.com/podcasts/show/66 subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.rubyology.com/podcasts/show/66 subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.rubyology.com/podcasts/show/66 subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.rubyology.com/podcasts/show/66 subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.rubyology.com/podcasts/show/66 subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.rubyology.com/podcasts/show/66 subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.rubyology.com/podcasts/show/66 subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.rubyology.com/podcasts/show/66 subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.rubyology.com/podcasts/show/66 subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.rubyology.com/podcasts/show/66 subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.rubyology.com/podcasts/show/66 subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.rubyology.com/podcasts/show/66 subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.rubyology.com/podcasts/show/66 subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.rubyology.com/podcasts/show/66 subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.rubyology.com/podcasts/show/66 subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.rubyology.com/podcasts/show/66 subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.rubyology.com/podcasts/show/66 subdomain www
DEBUG    pypidb._scm_url_cleaner:__init__.py:177 Returned None from pypidb._scm_url_cleaner.SCMURLCleaner.get_root
DEBUG    pypidb._scm_url_cleaner:__init__.py:157 Calling pypidb._scm_url_cleaner.SCMURLCleaner.get_root(self = <pypidb._scm_url_cleaner.SCMURLCleaner object at 0x7f64161250d0>, url = 'http://37signals.com/svn/posts/2486-bootstrapped-profitable-proud-github')
DEBUG    pypidb._scm_url_cleaner:__init__.py:177 Returned None from pypidb._scm_url_cleaner.SCMURLCleaner.get_root
DEBUG    pypidb._scm_url_cleaner:__init__.py:157 Calling pypidb._scm_url_cleaner.SCMURLCleaner.get_root(self = <pypidb._scm_url_cleaner.SCMURLCleaner object at 0x7f64161250d0>, url = 'http://lambdaphant.com/blog/interview-with-tom-preston-werner-from-github')
DEBUG    pypidb._scm_url_cleaner:__init__.py:177 Returned None from pypidb._scm_url_cleaner.SCMURLCleaner.get_root
DEBUG    pypidb._scm_url_cleaner:__init__.py:157 Calling pypidb._scm_url_cleaner.SCMURLCleaner.get_root(self = <pypidb._scm_url_cleaner.SCMURLCleaner object at 0x7f64161250d0>, url = 'http://blog.gravatar.com/2007/10/18/automattic-gravatar/')
DEBUG    pypidb._scm_url_cleaner:__init__.py:177 Returned None from pypidb._scm_url_cleaner.SCMURLCleaner.get_root
DEBUG    pypidb._scm_url_cleaner:__init__.py:157 Calling pypidb._scm_url_cleaner.SCMURLCleaner.get_root(self = <pypidb._scm_url_cleaner.SCMURLCleaner object at 0x7f64161250d0>, url = 'http://ontwik.com/github/tom-werner-co-founder-of-github/')
DEBUG    pypidb._scm_url_cleaner:__init__.py:177 Returned None from pypidb._scm_url_cleaner.SCMURLCleaner.get_root
DEBUG    pypidb._scm_url_cleaner:__init__.py:157 Calling pypidb._scm_url_cleaner.SCMURLCleaner.get_root(self = <pypidb._scm_url_cleaner.SCMURLCleaner object at 0x7f64161250d0>, url = 'http://www.wired.com/wiredenterprise/2013/09/github-for-anything/all/')
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.wired.com/wiredenterprise/2013/09/github-for-anything/all/ subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.wired.com/wiredenterprise/2013/09/github-for-anything/all/ subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.wired.com/wiredenterprise/2013/09/github-for-anything/all/ subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.wired.com/wiredenterprise/2013/09/github-for-anything/all/ subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.wired.com/wiredenterprise/2013/09/github-for-anything/all/ subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.wired.com/wiredenterprise/2013/09/github-for-anything/all/ subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.wired.com/wiredenterprise/2013/09/github-for-anything/all/ subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.wired.com/wiredenterprise/2013/09/github-for-anything/all/ subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.wired.com/wiredenterprise/2013/09/github-for-anything/all/ subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.wired.com/wiredenterprise/2013/09/github-for-anything/all/ subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.wired.com/wiredenterprise/2013/09/github-for-anything/all/ subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.wired.com/wiredenterprise/2013/09/github-for-anything/all/ subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.wired.com/wiredenterprise/2013/09/github-for-anything/all/ subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.wired.com/wiredenterprise/2013/09/github-for-anything/all/ subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.wired.com/wiredenterprise/2013/09/github-for-anything/all/ subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.wired.com/wiredenterprise/2013/09/github-for-anything/all/ subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.wired.com/wiredenterprise/2013/09/github-for-anything/all/ subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.wired.com/wiredenterprise/2013/09/github-for-anything/all/ subdomain www
DEBUG    pypidb._scm_url_cleaner:__init__.py:177 Returned None from pypidb._scm_url_cleaner.SCMURLCleaner.get_root
DEBUG    pypidb._scm_url_cleaner:__init__.py:157 Calling pypidb._scm_url_cleaner.SCMURLCleaner.get_root(self = <pypidb._scm_url_cleaner.SCMURLCleaner object at 0x7f64161250d0>, url = 'https://chatterbug.com/')
DEBUG    pypidb._scm_url_cleaner:__init__.py:177 Returned None from pypidb._scm_url_cleaner.SCMURLCleaner.get_root
DEBUG    pypidb._scm_url_cleaner:__init__.py:157 Calling pypidb._scm_url_cleaner.SCMURLCleaner.get_root(self = <pypidb._scm_url_cleaner.SCMURLCleaner object at 0x7f64161250d0>, url = 'http://www.infoq.com/interviews/erlang-and-github')
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.infoq.com/interviews/erlang-and-github subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.infoq.com/interviews/erlang-and-github subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.infoq.com/interviews/erlang-and-github subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.infoq.com/interviews/erlang-and-github subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.infoq.com/interviews/erlang-and-github subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.infoq.com/interviews/erlang-and-github subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.infoq.com/interviews/erlang-and-github subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.infoq.com/interviews/erlang-and-github subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.infoq.com/interviews/erlang-and-github subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.infoq.com/interviews/erlang-and-github subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.infoq.com/interviews/erlang-and-github subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.infoq.com/interviews/erlang-and-github subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.infoq.com/interviews/erlang-and-github subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.infoq.com/interviews/erlang-and-github subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.infoq.com/interviews/erlang-and-github subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.infoq.com/interviews/erlang-and-github subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.infoq.com/interviews/erlang-and-github subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.infoq.com/interviews/erlang-and-github subdomain www
DEBUG    pypidb._scm_url_cleaner:__init__.py:177 Returned None from pypidb._scm_url_cleaner.SCMURLCleaner.get_root
DEBUG    pypidb._scm_url_cleaner:__init__.py:157 Calling pypidb._scm_url_cleaner.SCMURLCleaner.get_root(self = <pypidb._scm_url_cleaner.SCMURLCleaner object at 0x7f64161250d0>, url = 'http://www.youtube.com/watch?v=weF-_dLYrzw')
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.youtube.com/watch?v=weF-_dLYrzw subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.youtube.com/watch?v=weF-_dLYrzw subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.youtube.com/watch?v=weF-_dLYrzw subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.youtube.com/watch?v=weF-_dLYrzw subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.youtube.com/watch?v=weF-_dLYrzw subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.youtube.com/watch?v=weF-_dLYrzw subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.youtube.com/watch?v=weF-_dLYrzw subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.youtube.com/watch?v=weF-_dLYrzw subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.youtube.com/watch?v=weF-_dLYrzw subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.youtube.com/watch?v=weF-_dLYrzw subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.youtube.com/watch?v=weF-_dLYrzw subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.youtube.com/watch?v=weF-_dLYrzw subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.youtube.com/watch?v=weF-_dLYrzw subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.youtube.com/watch?v=weF-_dLYrzw subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.youtube.com/watch?v=weF-_dLYrzw subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.youtube.com/watch?v=weF-_dLYrzw subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.youtube.com/watch?v=weF-_dLYrzw subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.youtube.com/watch?v=weF-_dLYrzw subdomain www
DEBUG    pypidb._scm_url_cleaner:__init__.py:177 Returned None from pypidb._scm_url_cleaner.SCMURLCleaner.get_root
DEBUG    pypidb._scm_url_cleaner:__init__.py:157 Calling pypidb._scm_url_cleaner.SCMURLCleaner.get_root(self = <pypidb._scm_url_cleaner.SCMURLCleaner object at 0x7f64161250d0>, url = 'http://gigaom.com/cloud/10-innovators-changing-the-game-for-internet-infrastructure/7/')
DEBUG    pypidb._scm_url_cleaner:__init__.py:177 Returned None from pypidb._scm_url_cleaner.SCMURLCleaner.get_root
DEBUG    pypidb._scm_url_cleaner:__init__.py:157 Calling pypidb._scm_url_cleaner.SCMURLCleaner.get_root(self = <pypidb._scm_url_cleaner.SCMURLCleaner object at 0x7f64161250d0>, url = 'http://rubyconf2009.confreaks.com/19-nov-2009-10-25-bert-and-ernie-scaling-your-ruby-site-with-erlang-tom-preston-werner.html')
DEBUG    pypidb._scm_url_cleaner:__init__.py:177 Returned None from pypidb._scm_url_cleaner.SCMURLCleaner.get_root
DEBUG    pypidb._scm_url_cleaner:__init__.py:157 Calling pypidb._scm_url_cleaner.SCMURLCleaner.get_root(self = <pypidb._scm_url_cleaner.SCMURLCleaner object at 0x7f64161250d0>, url = 'http://www.viddler.com/explore/GreggPollack/videos/25/44')
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.viddler.com/explore/GreggPollack/videos/25/44 subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.viddler.com/explore/GreggPollack/videos/25/44 subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.viddler.com/explore/GreggPollack/videos/25/44 subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.viddler.com/explore/GreggPollack/videos/25/44 subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.viddler.com/explore/GreggPollack/videos/25/44 subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.viddler.com/explore/GreggPollack/videos/25/44 subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.viddler.com/explore/GreggPollack/videos/25/44 subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.viddler.com/explore/GreggPollack/videos/25/44 subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.viddler.com/explore/GreggPollack/videos/25/44 subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.viddler.com/explore/GreggPollack/videos/25/44 subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.viddler.com/explore/GreggPollack/videos/25/44 subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.viddler.com/explore/GreggPollack/videos/25/44 subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.viddler.com/explore/GreggPollack/videos/25/44 subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.viddler.com/explore/GreggPollack/videos/25/44 subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.viddler.com/explore/GreggPollack/videos/25/44 subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.viddler.com/explore/GreggPollack/videos/25/44 subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.viddler.com/explore/GreggPollack/videos/25/44 subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.viddler.com/explore/GreggPollack/videos/25/44 subdomain www
DEBUG    pypidb._scm_url_cleaner:__init__.py:177 Returned None from pypidb._scm_url_cleaner.SCMURLCleaner.get_root
DEBUG    pypidb._scm_url_cleaner:__init__.py:157 Calling pypidb._scm_url_cleaner.SCMURLCleaner.get_root(self = <pypidb._scm_url_cleaner.SCMURLCleaner object at 0x7f64161250d0>, url = 'https://github.com/toml-lang/toml')
DEBUG    pypidb._scm_url_cleaner:__init__.py:157 Calling pypidb._scm_url_cleaner._github(url = 'https://github.com/toml-lang/toml')
DEBUG    pypidb._scm_url_cleaner:__init__.py:177 Returned ('toml-lang', 'toml') from pypidb._scm_url_cleaner._github
DEBUG    pypidb._scm_url_cleaner:__init__.py:177 Returned 'https://github.com/toml-lang/toml' from pypidb._scm_url_cleaner.SCMURLCleaner.get_root
DEBUG    pypidb._scm_url_cleaner:__init__.py:157 Calling pypidb._scm_url_cleaner.SCMURLCleaner.get_root(self = <pypidb._scm_url_cleaner.SCMURLCleaner object at 0x7f64161250d0>, url = 'https://github.com/mojombo/proxymachine/')
DEBUG    pypidb._scm_url_cleaner:__init__.py:157 Calling pypidb._scm_url_cleaner._github(url = 'https://github.com/mojombo/proxymachine/')
DEBUG    pypidb._scm_url_cleaner:__init__.py:177 Returned ('mojombo', 'proxymachine') from pypidb._scm_url_cleaner._github
DEBUG    pypidb._scm_url_cleaner:__init__.py:177 Returned 'https://github.com/mojombo/proxymachine' from pypidb._scm_url_cleaner.SCMURLCleaner.get_root
DEBUG    pypidb._scm_url_cleaner:__init__.py:157 Calling pypidb._scm_url_cleaner.SCMURLCleaner.get_root(self = <pypidb._scm_url_cleaner.SCMURLCleaner object at 0x7f64161250d0>, url = 'http://www.flickr.com/photos/seldaek/4986315165/')
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.flickr.com/photos/seldaek/4986315165/ subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.flickr.com/photos/seldaek/4986315165/ subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.flickr.com/photos/seldaek/4986315165/ subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.flickr.com/photos/seldaek/4986315165/ subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.flickr.com/photos/seldaek/4986315165/ subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.flickr.com/photos/seldaek/4986315165/ subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.flickr.com/photos/seldaek/4986315165/ subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.flickr.com/photos/seldaek/4986315165/ subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.flickr.com/photos/seldaek/4986315165/ subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.flickr.com/photos/seldaek/4986315165/ subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.flickr.com/photos/seldaek/4986315165/ subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.flickr.com/photos/seldaek/4986315165/ subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.flickr.com/photos/seldaek/4986315165/ subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.flickr.com/photos/seldaek/4986315165/ subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.flickr.com/photos/seldaek/4986315165/ subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.flickr.com/photos/seldaek/4986315165/ subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.flickr.com/photos/seldaek/4986315165/ subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.flickr.com/photos/seldaek/4986315165/ subdomain www
DEBUG    pypidb._scm_url_cleaner:__init__.py:177 Returned None from pypidb._scm_url_cleaner.SCMURLCleaner.get_root
DEBUG    pypidb._scm_url_cleaner:__init__.py:157 Calling pypidb._scm_url_cleaner.SCMURLCleaner.get_root(self = <pypidb._scm_url_cleaner.SCMURLCleaner object at 0x7f64161250d0>, url = 'http://www.youtube.com/watch?v=Hi2V1x1AkD8')
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.youtube.com/watch?v=Hi2V1x1AkD8 subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.youtube.com/watch?v=Hi2V1x1AkD8 subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.youtube.com/watch?v=Hi2V1x1AkD8 subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.youtube.com/watch?v=Hi2V1x1AkD8 subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.youtube.com/watch?v=Hi2V1x1AkD8 subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.youtube.com/watch?v=Hi2V1x1AkD8 subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.youtube.com/watch?v=Hi2V1x1AkD8 subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.youtube.com/watch?v=Hi2V1x1AkD8 subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.youtube.com/watch?v=Hi2V1x1AkD8 subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.youtube.com/watch?v=Hi2V1x1AkD8 subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.youtube.com/watch?v=Hi2V1x1AkD8 subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.youtube.com/watch?v=Hi2V1x1AkD8 subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.youtube.com/watch?v=Hi2V1x1AkD8 subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.youtube.com/watch?v=Hi2V1x1AkD8 subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.youtube.com/watch?v=Hi2V1x1AkD8 subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.youtube.com/watch?v=Hi2V1x1AkD8 subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.youtube.com/watch?v=Hi2V1x1AkD8 subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.youtube.com/watch?v=Hi2V1x1AkD8 subdomain www
DEBUG    pypidb._scm_url_cleaner:__init__.py:177 Returned None from pypidb._scm_url_cleaner.SCMURLCleaner.get_root
DEBUG    pypidb._scm_url_cleaner:__init__.py:157 Calling pypidb._scm_url_cleaner.SCMURLCleaner.get_root(self = <pypidb._scm_url_cleaner.SCMURLCleaner object at 0x7f64161250d0>, url = 'https://github.com/mojombo/grit/')
DEBUG    pypidb._scm_url_cleaner:__init__.py:157 Calling pypidb._scm_url_cleaner._github(url = 'https://github.com/mojombo/grit/')
DEBUG    pypidb._scm_url_cleaner:__init__.py:177 Returned ('mojombo', 'grit') from pypidb._scm_url_cleaner._github
DEBUG    pypidb._scm_url_cleaner:__init__.py:177 Returned 'https://github.com/mojombo/grit' from pypidb._scm_url_cleaner.SCMURLCleaner.get_root
DEBUG    pypidb._scm_url_cleaner:__init__.py:157 Calling pypidb._scm_url_cleaner.SCMURLCleaner.get_root(self = <pypidb._scm_url_cleaner.SCMURLCleaner object at 0x7f64161250d0>, url = 'http://www.justin.tv/c3oorg/b/272031754')
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.justin.tv/c3oorg/b/272031754 subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.justin.tv/c3oorg/b/272031754 subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.justin.tv/c3oorg/b/272031754 subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.justin.tv/c3oorg/b/272031754 subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.justin.tv/c3oorg/b/272031754 subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.justin.tv/c3oorg/b/272031754 subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.justin.tv/c3oorg/b/272031754 subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.justin.tv/c3oorg/b/272031754 subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.justin.tv/c3oorg/b/272031754 subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.justin.tv/c3oorg/b/272031754 subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.justin.tv/c3oorg/b/272031754 subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.justin.tv/c3oorg/b/272031754 subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.justin.tv/c3oorg/b/272031754 subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.justin.tv/c3oorg/b/272031754 subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.justin.tv/c3oorg/b/272031754 subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.justin.tv/c3oorg/b/272031754 subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.justin.tv/c3oorg/b/272031754 subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.justin.tv/c3oorg/b/272031754 subdomain www
DEBUG    pypidb._scm_url_cleaner:__init__.py:177 Returned None from pypidb._scm_url_cleaner.SCMURLCleaner.get_root
DEBUG    pypidb._scm_url_cleaner:__init__.py:157 Calling pypidb._scm_url_cleaner.SCMURLCleaner.get_root(self = <pypidb._scm_url_cleaner.SCMURLCleaner object at 0x7f64161250d0>, url = 'http://www.infoq.com/presentations/preston-werner-conceptual-algorithms')
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.infoq.com/presentations/preston-werner-conceptual-algorithms subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.infoq.com/presentations/preston-werner-conceptual-algorithms subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.infoq.com/presentations/preston-werner-conceptual-algorithms subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.infoq.com/presentations/preston-werner-conceptual-algorithms subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.infoq.com/presentations/preston-werner-conceptual-algorithms subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.infoq.com/presentations/preston-werner-conceptual-algorithms subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.infoq.com/presentations/preston-werner-conceptual-algorithms subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.infoq.com/presentations/preston-werner-conceptual-algorithms subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.infoq.com/presentations/preston-werner-conceptual-algorithms subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.infoq.com/presentations/preston-werner-conceptual-algorithms subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.infoq.com/presentations/preston-werner-conceptual-algorithms subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.infoq.com/presentations/preston-werner-conceptual-algorithms subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.infoq.com/presentations/preston-werner-conceptual-algorithms subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.infoq.com/presentations/preston-werner-conceptual-algorithms subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.infoq.com/presentations/preston-werner-conceptual-algorithms subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.infoq.com/presentations/preston-werner-conceptual-algorithms subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.infoq.com/presentations/preston-werner-conceptual-algorithms subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.infoq.com/presentations/preston-werner-conceptual-algorithms subdomain www
DEBUG    pypidb._scm_url_cleaner:__init__.py:177 Returned None from pypidb._scm_url_cleaner.SCMURLCleaner.get_root
DEBUG    pypidb._scm_url_cleaner:__init__.py:157 Calling pypidb._scm_url_cleaner.SCMURLCleaner.get_root(self = <pypidb._scm_url_cleaner.SCMURLCleaner object at 0x7f64161250d0>, url = 'github.com/mojombo')
DEBUG    pypidb._scm_url_cleaner:__init__.py:177 Returned None from pypidb._scm_url_cleaner.SCMURLCleaner.get_root
DEBUG    pypidb._scm_url_cleaner:__init__.py:157 Calling pypidb._scm_url_cleaner.SCMURLCleaner.get_root(self = <pypidb._scm_url_cleaner.SCMURLCleaner object at 0x7f64161250d0>, url = 'https://twitter.com/mojombo/')
DEBUG    pypidb._scm_url_cleaner:__init__.py:177 Returned None from pypidb._scm_url_cleaner.SCMURLCleaner.get_root
DEBUG    pypidb._scm_url_cleaner:__init__.py:157 Calling pypidb._scm_url_cleaner.SCMURLCleaner.get_root(self = <pypidb._scm_url_cleaner.SCMURLCleaner object at 0x7f64161250d0>, url = 'twitter.com/mojombo')
DEBUG    pypidb._scm_url_cleaner:__init__.py:177 Returned None from pypidb._scm_url_cleaner.SCMURLCleaner.get_root
DEBUG    pypidb._scm_url_cleaner:__init__.py:157 Calling pypidb._scm_url_cleaner.SCMURLCleaner.get_root(self = <pypidb._scm_url_cleaner.SCMURLCleaner object at 0x7f64161250d0>, url = 'http://developer.yahoo.com/yui/theater/video.php?v=prestonwerner-github')
DEBUG    pypidb._scm_url_cleaner:__init__.py:177 Returned None from pypidb._scm_url_cleaner.SCMURLCleaner.get_root
DEBUG    pypidb._scm_url_cleaner:__init__.py:157 Calling pypidb._scm_url_cleaner.SCMURLCleaner.get_root(self = <pypidb._scm_url_cleaner.SCMURLCleaner object at 0x7f64161250d0>, url = 'http://www.maestrosdelweb.com/editorial/tom-preston-werner-fundador-de-github/')
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.maestrosdelweb.com/editorial/tom-preston-werner-fundador-de-github/ subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.maestrosdelweb.com/editorial/tom-preston-werner-fundador-de-github/ subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.maestrosdelweb.com/editorial/tom-preston-werner-fundador-de-github/ subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.maestrosdelweb.com/editorial/tom-preston-werner-fundador-de-github/ subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.maestrosdelweb.com/editorial/tom-preston-werner-fundador-de-github/ subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.maestrosdelweb.com/editorial/tom-preston-werner-fundador-de-github/ subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.maestrosdelweb.com/editorial/tom-preston-werner-fundador-de-github/ subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.maestrosdelweb.com/editorial/tom-preston-werner-fundador-de-github/ subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.maestrosdelweb.com/editorial/tom-preston-werner-fundador-de-github/ subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.maestrosdelweb.com/editorial/tom-preston-werner-fundador-de-github/ subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.maestrosdelweb.com/editorial/tom-preston-werner-fundador-de-github/ subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.maestrosdelweb.com/editorial/tom-preston-werner-fundador-de-github/ subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.maestrosdelweb.com/editorial/tom-preston-werner-fundador-de-github/ subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.maestrosdelweb.com/editorial/tom-preston-werner-fundador-de-github/ subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.maestrosdelweb.com/editorial/tom-preston-werner-fundador-de-github/ subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.maestrosdelweb.com/editorial/tom-preston-werner-fundador-de-github/ subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.maestrosdelweb.com/editorial/tom-preston-werner-fundador-de-github/ subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.maestrosdelweb.com/editorial/tom-preston-werner-fundador-de-github/ subdomain www
DEBUG    pypidb._scm_url_cleaner:__init__.py:177 Returned None from pypidb._scm_url_cleaner.SCMURLCleaner.get_root
DEBUG    pypidb._scm_url_cleaner:__init__.py:157 Calling pypidb._scm_url_cleaner.SCMURLCleaner.get_root(self = <pypidb._scm_url_cleaner.SCMURLCleaner object at 0x7f64161250d0>, url = 'https://jekyllrb.com')
DEBUG    pypidb._scm_url_cleaner:__init__.py:177 Returned None from pypidb._scm_url_cleaner.SCMURLCleaner.get_root
DEBUG    pypidb._scm_url_cleaner:__init__.py:157 Calling pypidb._scm_url_cleaner.SCMURLCleaner.get_root(self = <pypidb._scm_url_cleaner.SCMURLCleaner object at 0x7f64161250d0>, url = 'http://www.infoq.com/interviews/preston-werner-powerset-github-ruby')
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.infoq.com/interviews/preston-werner-powerset-github-ruby subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.infoq.com/interviews/preston-werner-powerset-github-ruby subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.infoq.com/interviews/preston-werner-powerset-github-ruby subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.infoq.com/interviews/preston-werner-powerset-github-ruby subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.infoq.com/interviews/preston-werner-powerset-github-ruby subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.infoq.com/interviews/preston-werner-powerset-github-ruby subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.infoq.com/interviews/preston-werner-powerset-github-ruby subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.infoq.com/interviews/preston-werner-powerset-github-ruby subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.infoq.com/interviews/preston-werner-powerset-github-ruby subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.infoq.com/interviews/preston-werner-powerset-github-ruby subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.infoq.com/interviews/preston-werner-powerset-github-ruby subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.infoq.com/interviews/preston-werner-powerset-github-ruby subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.infoq.com/interviews/preston-werner-powerset-github-ruby subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.infoq.com/interviews/preston-werner-powerset-github-ruby subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.infoq.com/interviews/preston-werner-powerset-github-ruby subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.infoq.com/interviews/preston-werner-powerset-github-ruby subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.infoq.com/interviews/preston-werner-powerset-github-ruby subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.infoq.com/interviews/preston-werner-powerset-github-ruby subdomain www
DEBUG    pypidb._scm_url_cleaner:__init__.py:177 Returned None from pypidb._scm_url_cleaner.SCMURLCleaner.get_root
DEBUG    pypidb._scm_url_cleaner:__init__.py:157 Calling pypidb._scm_url_cleaner.SCMURLCleaner.get_root(self = <pypidb._scm_url_cleaner.SCMURLCleaner object at 0x7f64161250d0>, url = 'http://speakerdeck.com/u/mojombo/p/github-flavored-ruby')
DEBUG    pypidb._scm_url_cleaner:__init__.py:177 Returned None from pypidb._scm_url_cleaner.SCMURLCleaner.get_root
DEBUG    pypidb._scm_url_cleaner:__init__.py:157 Calling pypidb._scm_url_cleaner.SCMURLCleaner.get_root(self = <pypidb._scm_url_cleaner.SCMURLCleaner object at 0x7f64161250d0>, url = 'http://www.youtube.com/watch?v=k2vJNNAQZlg')
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.youtube.com/watch?v=k2vJNNAQZlg subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.youtube.com/watch?v=k2vJNNAQZlg subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.youtube.com/watch?v=k2vJNNAQZlg subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.youtube.com/watch?v=k2vJNNAQZlg subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.youtube.com/watch?v=k2vJNNAQZlg subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.youtube.com/watch?v=k2vJNNAQZlg subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.youtube.com/watch?v=k2vJNNAQZlg subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.youtube.com/watch?v=k2vJNNAQZlg subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.youtube.com/watch?v=k2vJNNAQZlg subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.youtube.com/watch?v=k2vJNNAQZlg subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.youtube.com/watch?v=k2vJNNAQZlg subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.youtube.com/watch?v=k2vJNNAQZlg subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.youtube.com/watch?v=k2vJNNAQZlg subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.youtube.com/watch?v=k2vJNNAQZlg subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.youtube.com/watch?v=k2vJNNAQZlg subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.youtube.com/watch?v=k2vJNNAQZlg subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.youtube.com/watch?v=k2vJNNAQZlg subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.youtube.com/watch?v=k2vJNNAQZlg subdomain www
DEBUG    pypidb._scm_url_cleaner:__init__.py:177 Returned None from pypidb._scm_url_cleaner.SCMURLCleaner.get_root
DEBUG    pypidb._scm_url_cleaner:__init__.py:157 Calling pypidb._scm_url_cleaner.SCMURLCleaner.get_root(self = <pypidb._scm_url_cleaner.SCMURLCleaner object at 0x7f64161250d0>, url = 'http://www.youtube.com/watch?v=Ln-B_fs9QMY&feature=youtu.be')
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.youtube.com/watch?v=Ln-B_fs9QMY&feature=youtu.be subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.youtube.com/watch?v=Ln-B_fs9QMY&feature=youtu.be subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.youtube.com/watch?v=Ln-B_fs9QMY&feature=youtu.be subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.youtube.com/watch?v=Ln-B_fs9QMY&feature=youtu.be subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.youtube.com/watch?v=Ln-B_fs9QMY&feature=youtu.be subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.youtube.com/watch?v=Ln-B_fs9QMY&feature=youtu.be subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.youtube.com/watch?v=Ln-B_fs9QMY&feature=youtu.be subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.youtube.com/watch?v=Ln-B_fs9QMY&feature=youtu.be subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.youtube.com/watch?v=Ln-B_fs9QMY&feature=youtu.be subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.youtube.com/watch?v=Ln-B_fs9QMY&feature=youtu.be subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.youtube.com/watch?v=Ln-B_fs9QMY&feature=youtu.be subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.youtube.com/watch?v=Ln-B_fs9QMY&feature=youtu.be subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.youtube.com/watch?v=Ln-B_fs9QMY&feature=youtu.be subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.youtube.com/watch?v=Ln-B_fs9QMY&feature=youtu.be subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.youtube.com/watch?v=Ln-B_fs9QMY&feature=youtu.be subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.youtube.com/watch?v=Ln-B_fs9QMY&feature=youtu.be subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.youtube.com/watch?v=Ln-B_fs9QMY&feature=youtu.be subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.youtube.com/watch?v=Ln-B_fs9QMY&feature=youtu.be subdomain www
DEBUG    pypidb._scm_url_cleaner:__init__.py:177 Returned None from pypidb._scm_url_cleaner.SCMURLCleaner.get_root
DEBUG    pypidb._scm_url_cleaner:__init__.py:157 Calling pypidb._scm_url_cleaner.SCMURLCleaner.get_root(self = <pypidb._scm_url_cleaner.SCMURLCleaner object at 0x7f64161250d0>, url = '@mail.ru')
DEBUG    pypidb._scm_url_cleaner:__init__.py:177 Returned None from pypidb._scm_url_cleaner.SCMURLCleaner.get_root
DEBUG    pypidb._scm_url_cleaner:__init__.py:157 Calling pypidb._scm_url_cleaner.SCMURLCleaner.get_root(self = <pypidb._scm_url_cleaner.SCMURLCleaner object at 0x7f64161250d0>, url = 'https://github.com/gollum/gollum/')
DEBUG    pypidb._scm_url_cleaner:__init__.py:157 Calling pypidb._scm_url_cleaner._github(url = 'https://github.com/gollum/gollum/')
DEBUG    pypidb._scm_url_cleaner:__init__.py:177 Returned ('gollum', 'gollum') from pypidb._scm_url_cleaner._github
DEBUG    pypidb._scm_url_cleaner:__init__.py:177 Returned 'https://github.com/gollum/gollum' from pypidb._scm_url_cleaner.SCMURLCleaner.get_root
DEBUG    pypidb._scm_url_cleaner:__init__.py:157 Calling pypidb._scm_url_cleaner.SCMURLCleaner.get_root(self = <pypidb._scm_url_cleaner.SCMURLCleaner object at 0x7f64161250d0>, url = 'https://github.com/')
DEBUG    pypidb._scm_url_cleaner:__init__.py:157 Calling pypidb._scm_url_cleaner._github(url = 'https://github.com/')
DEBUG    pypidb._scm_url_cleaner:__init__.py:177 Returned False from pypidb._scm_url_cleaner._github
DEBUG    pypidb._scm_url_cleaner:__init__.py:177 Returned False from pypidb._scm_url_cleaner.SCMURLCleaner.get_root
DEBUG    pypidb._scm_url_cleaner:__init__.py:157 Calling pypidb._scm_url_cleaner.SCMURLCleaner.get_root(self = <pypidb._scm_url_cleaner.SCMURLCleaner object at 0x7f64161250d0>, url = 'http://sea.ucar.edu/event/unlocking-secrets-git')
DEBUG    pypidb._scm_url_cleaner:__init__.py:177 Returned None from pypidb._scm_url_cleaner.SCMURLCleaner.get_root
DEBUG    pypidb._scm_url_cleaner:__init__.py:157 Calling pypidb._scm_url_cleaner.SCMURLCleaner.get_root(self = <pypidb._scm_url_cleaner.SCMURLCleaner object at 0x7f64161250d0>, url = 'http://skillsmatter.com/podcast/agile-testing/optimizing-for-happiness')
DEBUG    pypidb._scm_url_cleaner:__init__.py:177 Returned None from pypidb._scm_url_cleaner.SCMURLCleaner.get_root
DEBUG    pypidb._scm_url_cleaner:__init__.py:157 Calling pypidb._scm_url_cleaner.SCMURLCleaner.get_root(self = <pypidb._scm_url_cleaner.SCMURLCleaner object at 0x7f64161250d0>, url = 'http://video.mail.ru/corp/video/conference2010/12.html')
DEBUG    pypidb._scm_url_cleaner:__init__.py:177 Returned None from pypidb._scm_url_cleaner.SCMURLCleaner.get_root
DEBUG    pypidb._scm_url_cleaner:__init__.py:157 Calling pypidb._scm_url_cleaner.SCMURLCleaner.get_root(self = <pypidb._scm_url_cleaner.SCMURLCleaner object at 0x7f64161250d0>, url = 'http://www.blognewcomb.com/blog/2007/08/powerset_interview_with_tom_pr.html')
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.blognewcomb.com/blog/2007/08/powerset_interview_with_tom_pr.html subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.blognewcomb.com/blog/2007/08/powerset_interview_with_tom_pr.html subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.blognewcomb.com/blog/2007/08/powerset_interview_with_tom_pr.html subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.blognewcomb.com/blog/2007/08/powerset_interview_with_tom_pr.html subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.blognewcomb.com/blog/2007/08/powerset_interview_with_tom_pr.html subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.blognewcomb.com/blog/2007/08/powerset_interview_with_tom_pr.html subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.blognewcomb.com/blog/2007/08/powerset_interview_with_tom_pr.html subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.blognewcomb.com/blog/2007/08/powerset_interview_with_tom_pr.html subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.blognewcomb.com/blog/2007/08/powerset_interview_with_tom_pr.html subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.blognewcomb.com/blog/2007/08/powerset_interview_with_tom_pr.html subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.blognewcomb.com/blog/2007/08/powerset_interview_with_tom_pr.html subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.blognewcomb.com/blog/2007/08/powerset_interview_with_tom_pr.html subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.blognewcomb.com/blog/2007/08/powerset_interview_with_tom_pr.html subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.blognewcomb.com/blog/2007/08/powerset_interview_with_tom_pr.html subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.blognewcomb.com/blog/2007/08/powerset_interview_with_tom_pr.html subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.blognewcomb.com/blog/2007/08/powerset_interview_with_tom_pr.html subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.blognewcomb.com/blog/2007/08/powerset_interview_with_tom_pr.html subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.blognewcomb.com/blog/2007/08/powerset_interview_with_tom_pr.html subdomain www
DEBUG    pypidb._scm_url_cleaner:__init__.py:177 Returned None from pypidb._scm_url_cleaner.SCMURLCleaner.get_root
DEBUG    pypidb._scm_url_cleaner:__init__.py:157 Calling pypidb._scm_url_cleaner.SCMURLCleaner.get_root(self = <pypidb._scm_url_cleaner.SCMURLCleaner object at 0x7f64161250d0>, url = 'http://www.rubyology.com/podcasts/show/67')
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.rubyology.com/podcasts/show/67 subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.rubyology.com/podcasts/show/67 subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.rubyology.com/podcasts/show/67 subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.rubyology.com/podcasts/show/67 subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.rubyology.com/podcasts/show/67 subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.rubyology.com/podcasts/show/67 subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.rubyology.com/podcasts/show/67 subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.rubyology.com/podcasts/show/67 subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.rubyology.com/podcasts/show/67 subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.rubyology.com/podcasts/show/67 subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.rubyology.com/podcasts/show/67 subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.rubyology.com/podcasts/show/67 subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.rubyology.com/podcasts/show/67 subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.rubyology.com/podcasts/show/67 subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.rubyology.com/podcasts/show/67 subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.rubyology.com/podcasts/show/67 subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.rubyology.com/podcasts/show/67 subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.rubyology.com/podcasts/show/67 subdomain www
DEBUG    pypidb._scm_url_cleaner:__init__.py:177 Returned None from pypidb._scm_url_cleaner.SCMURLCleaner.get_root
DEBUG    pypidb._scm_url_cleaner:__init__.py:157 Calling pypidb._scm_url_cleaner.SCMURLCleaner.get_root(self = <pypidb._scm_url_cleaner.SCMURLCleaner object at 0x7f64161250d0>, url = 'http://www.erlang-factory.com/conference/SFBayAreaErlangFactory2009/speakers/TomPrestonWerner')
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.erlang-factory.com/conference/SFBayAreaErlangFactory2009/speakers/TomPrestonWerner subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.erlang-factory.com/conference/SFBayAreaErlangFactory2009/speakers/TomPrestonWerner subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.erlang-factory.com/conference/SFBayAreaErlangFactory2009/speakers/TomPrestonWerner subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.erlang-factory.com/conference/SFBayAreaErlangFactory2009/speakers/TomPrestonWerner subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.erlang-factory.com/conference/SFBayAreaErlangFactory2009/speakers/TomPrestonWerner subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.erlang-factory.com/conference/SFBayAreaErlangFactory2009/speakers/TomPrestonWerner subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.erlang-factory.com/conference/SFBayAreaErlangFactory2009/speakers/TomPrestonWerner subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.erlang-factory.com/conference/SFBayAreaErlangFactory2009/speakers/TomPrestonWerner subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.erlang-factory.com/conference/SFBayAreaErlangFactory2009/speakers/TomPrestonWerner subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.erlang-factory.com/conference/SFBayAreaErlangFactory2009/speakers/TomPrestonWerner subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.erlang-factory.com/conference/SFBayAreaErlangFactory2009/speakers/TomPrestonWerner subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.erlang-factory.com/conference/SFBayAreaErlangFactory2009/speakers/TomPrestonWerner subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.erlang-factory.com/conference/SFBayAreaErlangFactory2009/speakers/TomPrestonWerner subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.erlang-factory.com/conference/SFBayAreaErlangFactory2009/speakers/TomPrestonWerner subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.erlang-factory.com/conference/SFBayAreaErlangFactory2009/speakers/TomPrestonWerner subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.erlang-factory.com/conference/SFBayAreaErlangFactory2009/speakers/TomPrestonWerner subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.erlang-factory.com/conference/SFBayAreaErlangFactory2009/speakers/TomPrestonWerner subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.erlang-factory.com/conference/SFBayAreaErlangFactory2009/speakers/TomPrestonWerner subdomain www
DEBUG    pypidb._scm_url_cleaner:__init__.py:177 Returned None from pypidb._scm_url_cleaner.SCMURLCleaner.get_root
DEBUG    pypidb._scm_url_cleaner:__init__.py:157 Calling pypidb._scm_url_cleaner.SCMURLCleaner.get_root(self = <pypidb._scm_url_cleaner.SCMURLCleaner object at 0x7f64161250d0>, url = 'http://www.vimeo.com/1104583')
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.vimeo.com/1104583 subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.vimeo.com/1104583 subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.vimeo.com/1104583 subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.vimeo.com/1104583 subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.vimeo.com/1104583 subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.vimeo.com/1104583 subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.vimeo.com/1104583 subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.vimeo.com/1104583 subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.vimeo.com/1104583 subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.vimeo.com/1104583 subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.vimeo.com/1104583 subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.vimeo.com/1104583 subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.vimeo.com/1104583 subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.vimeo.com/1104583 subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.vimeo.com/1104583 subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.vimeo.com/1104583 subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.vimeo.com/1104583 subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.vimeo.com/1104583 subdomain www
DEBUG    pypidb._scm_url_cleaner:__init__.py:177 Returned None from pypidb._scm_url_cleaner.SCMURLCleaner.get_root
DEBUG    pypidb._scm_url_cleaner:__init__.py:157 Calling pypidb._scm_url_cleaner.SCMURLCleaner.get_root(self = <pypidb._scm_url_cleaner.SCMURLCleaner object at 0x7f64161250d0>, url = 'http://images.businessweek.com/ss/09/04/0421_best_young_entrepreneurs/17.htm')
DEBUG    pypidb._scm_url_cleaner:__init__.py:177 Returned None from pypidb._scm_url_cleaner.SCMURLCleaner.get_root
DEBUG    pypidb._scm_url_cleaner:__init__.py:157 Calling pypidb._scm_url_cleaner.SCMURLCleaner.get_root(self = <pypidb._scm_url_cleaner.SCMURLCleaner object at 0x7f64161250d0>, url = 'http://webpulp.tv/post/708686185/github-tom-preston-werner')
DEBUG    pypidb._scm_url_cleaner:__init__.py:177 Returned None from pypidb._scm_url_cleaner.SCMURLCleaner.get_root
DEBUG    pypidb._scm_url_cleaner:__init__.py:157 Calling pypidb._scm_url_cleaner.SCMURLCleaner.get_root(self = <pypidb._scm_url_cleaner.SCMURLCleaner object at 0x7f64161250d0>, url = 'https://feeds.feedburner.com/tom-preston-werner')
DEBUG    pypidb._scm_url_cleaner:__init__.py:177 Returned None from pypidb._scm_url_cleaner.SCMURLCleaner.get_root
DEBUG    pypidb._scm_url_cleaner:__init__.py:157 Calling pypidb._scm_url_cleaner.SCMURLCleaner.get_root(self = <pypidb._scm_url_cleaner.SCMURLCleaner object at 0x7f64161250d0>, url = 'https://web.archive.org/web/20061214040137/http://joshuaink2006.johnoxton.co.uk/blog/240/seven-and-a-half-questions-for-tom-werner')
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:70 url https://web.archive.org/web/20061214040137/http://joshuaink2006.johnoxton.co.uk/blog/240/seven-and-a-half-questions-for-tom-werner not path web/20170225171913/https://gna.org/projects/pychart
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:70 url https://web.archive.org/web/20061214040137/http://joshuaink2006.johnoxton.co.uk/blog/240/seven-and-a-half-questions-for-tom-werner not path web/20130829003029/http://code.larlet.fr/django-oauth/wiki/Home
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:70 url https://web.archive.org/web/20061214040137/http://joshuaink2006.johnoxton.co.uk/blog/240/seven-and-a-half-questions-for-tom-werner not path web/2009*/http://svn.wiretooth.com/svn/open/pypicache/
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:70 url https://web.archive.org/web/20061214040137/http://joshuaink2006.johnoxton.co.uk/blog/240/seven-and-a-half-questions-for-tom-werner not path web/20131016122638/http://vehq.ru/project/django-extended-attachments/
DEBUG    pypidb._scm_url_cleaner:__init__.py:177 Returned None from pypidb._scm_url_cleaner.SCMURLCleaner.get_root
DEBUG    pypidb._scm_url_cleaner:__init__.py:157 Calling pypidb._scm_url_cleaner.SCMURLCleaner.get_root(self = <pypidb._scm_url_cleaner.SCMURLCleaner object at 0x7f64161250d0>, url = 'https://github.com/mojombo/ernie/')
DEBUG    pypidb._scm_url_cleaner:__init__.py:157 Calling pypidb._scm_url_cleaner._github(url = 'https://github.com/mojombo/ernie/')
DEBUG    pypidb._scm_url_cleaner:__init__.py:177 Returned ('mojombo', 'ernie') from pypidb._scm_url_cleaner._github
DEBUG    pypidb._scm_url_cleaner:__init__.py:177 Returned 'https://github.com/mojombo/ernie' from pypidb._scm_url_cleaner.SCMURLCleaner.get_root
DEBUG    pypidb._scm_url_cleaner:__init__.py:157 Calling pypidb._scm_url_cleaner.SCMURLCleaner.get_root(self = <pypidb._scm_url_cleaner.SCMURLCleaner object at 0x7f64161250d0>, url = 'http://www.w3.org/1999/xhtml')
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.w3.org/1999/xhtml subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.w3.org/1999/xhtml subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.w3.org/1999/xhtml subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.w3.org/1999/xhtml subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.w3.org/1999/xhtml subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.w3.org/1999/xhtml subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.w3.org/1999/xhtml subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.w3.org/1999/xhtml subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.w3.org/1999/xhtml subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.w3.org/1999/xhtml subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.w3.org/1999/xhtml subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.w3.org/1999/xhtml subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.w3.org/1999/xhtml subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.w3.org/1999/xhtml subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.w3.org/1999/xhtml subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.w3.org/1999/xhtml subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.w3.org/1999/xhtml subdomain www
DEBUG    pypidb._scm_url_cleaner:_scm_url_cleaner.py:49 url http://www.w3.org/1999/xhtml subdomain www
DEBUG    pypidb._scm_url_cleaner:__init__.py:177 Returned None from pypidb._scm_url_cleaner.SCMURLCleaner.get_root
DEBUG    pypidb._scm_url_cleaner:__init__.py:157 Calling pypidb._scm_url_cleaner.SCMURLCleaner.get_root(self = <pypidb._scm_url_cleaner.SCMURLCleaner object at 0x7f64161250d0>, url = 'http://confreaks.net/videos/297-lsrc2010-keynote-address')
DEBUG    pypidb._scm_url_cleaner:__init__.py:177 Returned None from pypidb._scm_url_cleaner.SCMURLCleaner.get_root
DEBUG    pypidb._scm_url_cleaner:__init__.py:157 Calling pypidb._scm_url_cleaner.SCMURLCleaner.get_root(self = <pypidb._scm_url_cleaner.SCMURLCleaner object at 0x7f64161250d0>, url = 'http://vimeo.com/17118008')
DEBUG    pypidb._scm_url_cleaner:__init__.py:177 Returned None from pypidb._scm_url_cleaner.SCMURLCleaner.get_root
DEBUG    pypidb._scm_url_cleaner:__init__.py:157 Calling pypidb._scm_url_cleaner.SCMURLCleaner.get_root(self = <pypidb._scm_url_cleaner.SCMURLCleaner object at 0x7f64161250d0>, url = 'WebPulp.tv')
DEBUG    pypidb._scm_url_cleaner:__init__.py:177 Returned None from pypidb._scm_url_cleaner.SCMURLCleaner.get_root
DEBUG    pypidb._scm_url_cleaner:__init__.py:157 Calling pypidb._scm_url_cleaner.SCMURLCleaner.get_root(self = <pypidb._scm_url_cleaner.SCMURLCleaner object at 0x7f64161250d0>, url = 'http://speakerdeck.com/u/mojombo/p/optimizing-for-happiness')
DEBUG    pypidb._scm_url_cleaner:__init__.py:177 Returned None from pypidb._scm_url_cleaner.SCMURLCleaner.get_root
DEBUG    pypidb._scm_url_cleaner:__init__.py:157 Calling pypidb._scm_url_cleaner.SCMURLCleaner.get_root(self = <pypidb._scm_url_cleaner.SCMURLCleaner object at 0x7f64161250d0>, url = 'http://vimeo.com/39016099')
DEBUG    pypidb._scm_url_cleaner:__init__.py:177 Returned None from pypidb._scm_url_cleaner.SCMURLCleaner.get_root
DEBUG    pypidb._pypi:_pypi.py:355 --results from {'http://www.youtube.com/watch?v=7DoB0SCUtOk&list=SP055Epbe6d5aclKNAa8msO1VvDOJ8sYlS&index=23', 'http://www.youtube.com/watch?v=P9jjDpWzsUI', 'http://en.oreilly.com/rails2008/public/schedule/speaker/2520', 'https://github.com/mojombo/god/', 'http://bits.blogs.nytimes.com/2012/12/28/github-has-big-dreams-for-open-source-software-and-more/', 'https://github.com/mojombo/chronic/', 'https://www.wired.com/2013/09/github-for-anything/', 'https://semver.org', 'http://www.linux-mag.com/cache/7486/1.html', 'http://vimeo.com/35640883', 'http://confreaks.net/videos/432-rubyconf2010-the-road-from-ruby-hacker-to-entrepreneur', 'http://web20show.com/episodes/web20show-ep45-github', 'http://wp-community.org/2006/12/26/episode-15-interviews-with-tom-werner-gravatar-and-jaimie-sirovich-seo-egghead/', 'https://github.com/mojombo/', 'http://blogs.sitepoint.com/podcast-107-social-coding-with-githubs-tom-preston-werner/', 'http://confreaks.com/videos/712-rubyconf2011-github-flavored-ruby', 'http://www.mindmeister.com/fr/maps/show_public/60955251?title=tom-preston-schnitzelconf', 'http://www.rubyology.com/podcasts/show/66', 'http://37signals.com/svn/posts/2486-bootstrapped-profitable-proud-github', 'http://lambdaphant.com/blog/interview-with-tom-preston-werner-from-github', 'http://blog.gravatar.com/2007/10/18/automattic-gravatar/', 'http://ontwik.com/github/tom-werner-co-founder-of-github/', 'http://www.wired.com/wiredenterprise/2013/09/github-for-anything/all/', 'https://chatterbug.com/', 'http://www.infoq.com/interviews/erlang-and-github', 'http://www.youtube.com/watch?v=weF-_dLYrzw', 'http://gigaom.com/cloud/10-innovators-changing-the-game-for-internet-infrastructure/7/', 'http://rubyconf2009.confreaks.com/19-nov-2009-10-25-bert-and-ernie-scaling-your-ruby-site-with-erlang-tom-preston-werner.html', 'http://www.viddler.com/explore/GreggPollack/videos/25/44', 'https://github.com/toml-lang/toml', 'https://github.com/mojombo/proxymachine/', 'http://www.flickr.com/photos/seldaek/4986315165/', 'http://www.youtube.com/watch?v=Hi2V1x1AkD8', 'https://github.com/mojombo/grit/', 'http://www.justin.tv/c3oorg/b/272031754', 'http://www.infoq.com/presentations/preston-werner-conceptual-algorithms', 'github.com/mojombo', 'https://twitter.com/mojombo/', 'twitter.com/mojombo', 'http://developer.yahoo.com/yui/theater/video.php?v=prestonwerner-github', 'http://www.maestrosdelweb.com/editorial/tom-preston-werner-fundador-de-github/', 'https://jekyllrb.com', 'http://www.infoq.com/interviews/preston-werner-powerset-github-ruby', 'http://speakerdeck.com/u/mojombo/p/github-flavored-ruby', 'http://www.youtube.com/watch?v=k2vJNNAQZlg', 'http://www.youtube.com/watch?v=Ln-B_fs9QMY&feature=youtu.be', '@mail.ru', 'https://github.com/gollum/gollum/', 'https://github.com/', 'http://sea.ucar.edu/event/unlocking-secrets-git', 'http://skillsmatter.com/podcast/agile-testing/optimizing-for-happiness', 'http://video.mail.ru/corp/video/conference2010/12.html', 'http://www.blognewcomb.com/blog/2007/08/powerset_interview_with_tom_pr.html', 'http://www.rubyology.com/podcasts/show/67', 'http://www.erlang-factory.com/conference/SFBayAreaErlangFactory2009/speakers/TomPrestonWerner', 'http://www.vimeo.com/1104583', 'http://images.businessweek.com/ss/09/04/0421_best_young_entrepreneurs/17.htm', 'http://webpulp.tv/post/708686185/github-tom-preston-werner', 'https://feeds.feedburner.com/tom-preston-werner', 'https://web.archive.org/web/20061214040137/http://joshuaink2006.johnoxton.co.uk/blog/240/seven-and-a-half-questions-for-tom-werner', 'https://github.com/mojombo/ernie/', 'http://www.w3.org/1999/xhtml', 'http://confreaks.net/videos/297-lsrc2010-keynote-address', 'http://vimeo.com/17118008', 'WebPulp.tv', 'http://speakerdeck.com/u/mojombo/p/optimizing-for-happiness', 'http://vimeo.com/39016099'}: ['https://github.com/mojombo/god', 'https://github.com/mojombo/chronic', 'https://github.com/toml-lang/toml', 'https://github.com/mojombo/proxymachine', 'https://github.com/mojombo/grit', 'https://github.com/gollum/gollum', 'https://github.com/mojombo/ernie']
DEBUG    pypidb._pypi:_pypi.py:361 queuing ['@mail.ru', 'WebPulp.tv', 'github.com/mojombo', 'http://37signals.com/svn/posts/2486-bootstrapped-profitable-proud-github', 'http://bits.blogs.nytimes.com/2012/12/28/github-has-big-dreams-for-open-source-software-and-more/', 'http://blog.gravatar.com/2007/10/18/automattic-gravatar/', 'http://blogs.sitepoint.com/podcast-107-social-coding-with-githubs-tom-preston-werner/', 'http://confreaks.com/videos/712-rubyconf2011-github-flavored-ruby', 'http://confreaks.net/videos/297-lsrc2010-keynote-address', 'http://confreaks.net/videos/432-rubyconf2010-the-road-from-ruby-hacker-to-entrepreneur', 'http://developer.yahoo.com/yui/theater/video.php?v=prestonwerner-github', 'http://en.oreilly.com/rails2008/public/schedule/speaker/2520', 'http://gigaom.com/cloud/10-innovators-changing-the-game-for-internet-infrastructure/7/', 'http://images.businessweek.com/ss/09/04/0421_best_young_entrepreneurs/17.htm', 'http://lambdaphant.com/blog/interview-with-tom-preston-werner-from-github', 'http://ontwik.com/github/tom-werner-co-founder-of-github/', 'http://rubyconf2009.confreaks.com/19-nov-2009-10-25-bert-and-ernie-scaling-your-ruby-site-with-erlang-tom-preston-werner.html', 'http://sea.ucar.edu/event/unlocking-secrets-git', 'http://skillsmatter.com/podcast/agile-testing/optimizing-for-happiness', 'http://speakerdeck.com/u/mojombo/p/github-flavored-ruby', 'http://speakerdeck.com/u/mojombo/p/optimizing-for-happiness', 'http://video.mail.ru/corp/video/conference2010/12.html', 'http://vimeo.com/17118008', 'http://vimeo.com/35640883', 'http://vimeo.com/39016099', 'http://web20show.com/episodes/web20show-ep45-github', 'http://webpulp.tv/post/708686185/github-tom-preston-werner', 'http://wp-community.org/2006/12/26/episode-15-interviews-with-tom-werner-gravatar-and-jaimie-sirovich-seo-egghead/', 'http://www.blognewcomb.com/blog/2007/08/powerset_interview_with_tom_pr.html', 'http://www.erlang-factory.com/conference/SFBayAreaErlangFactory2009/speakers/TomPrestonWerner', 'http://www.flickr.com/photos/seldaek/4986315165/', 'http://www.infoq.com/interviews/erlang-and-github', 'http://www.infoq.com/interviews/preston-werner-powerset-github-ruby', 'http://www.infoq.com/presentations/preston-werner-conceptual-algorithms', 'http://www.justin.tv/c3oorg/b/272031754', 'http://www.linux-mag.com/cache/7486/1.html', 'http://www.maestrosdelweb.com/editorial/tom-preston-werner-fundador-de-github/', 'http://www.mindmeister.com/fr/maps/show_public/60955251?title=tom-preston-schnitzelconf', 'http://www.rubyology.com/podcasts/show/66', 'http://www.rubyology.com/podcasts/show/67', 'http://www.viddler.com/explore/GreggPollack/videos/25/44', 'http://www.vimeo.com/1104583', 'http://www.w3.org/1999/xhtml', 'http://www.wired.com/wiredenterprise/2013/09/github-for-anything/all/', 'http://www.youtube.com/watch?v=7DoB0SCUtOk&list=SP055Epbe6d5aclKNAa8msO1VvDOJ8sYlS&index=23', 'http://www.youtube.com/watch?v=Hi2V1x1AkD8', 'http://www.youtube.com/watch?v=Ln-B_fs9QMY&feature=youtu.be', 'http://www.youtube.com/watch?v=P9jjDpWzsUI', 'http://www.youtube.com/watch?v=k2vJNNAQZlg', 'http://www.youtube.com/watch?v=weF-_dLYrzw', 'https://chatterbug.com/', 'https://feeds.feedburner.com/tom-preston-werner', 'https://jekyllrb.com', 'https://semver.org', 'https://twitter.com/mojombo/', 'https://web.archive.org/web/20061214040137/http://joshuaink2006.johnoxton.co.uk/blog/240/seven-and-a-half-questions-for-tom-werner', 'https://www.wired.com/2013/09/github-for-anything/', 'twitter.com/mojombo']
INFO     pypidb._pypi:_pypi.py:365 aws-cdk.assets: from http://tom.preston-werner.com added urls ['https://github.com/gollum/gollum', 'https://github.com/mojombo/chronic', 'https://github.com/mojombo/ernie', 'https://github.com/mojombo/god', 'https://github.com/mojombo/grit', 'https://github.com/mojombo/proxymachine', 'https://github.com/toml-lang/toml']
DEBUG    pypidb._similarity:__init__.py:157 Calling pypidb._similarity.get_best_match(name = 'aws-cdk-assets', urls = ['https://github.com/aws/aws-cdk', 'https://github.com/aws/aws-cdk', 'https://github.com/semver/semver', 'https://github.com/semver/semver', 'https://github.com/firasdib/Regex101', 'https://github.com/firasdib/Regex101', 'https://github.com/firasdib/Regex101', 'https://github.com/firasdib/Regex101', 'https://github.com/mojombo/god', 'https://github.com/mojombo/chronic', 'https://github.com/toml-lang/toml', 'https://github.com/mojombo/proxymachine', 'https://github.com/mojombo/grit', 'https://github.com/gollum/gollum', 'https://github.com/mojombo/ernie'])
DEBUG    pypidb._similarity:__init__.py:157 Calling pypidb._similarity._get_weighted_best(name = 'aws-cdk-assets', urls = ['https://github.com/aws/aws-cdk', 'https://github.com/aws/aws-cdk', 'https://github.com/semver/semver', 'https://github.com/semver/semver', 'https://github.com/firasdib/Regex101', 'https://github.com/firasdib/Regex101', 'https://github.com/firasdib/Regex101', 'https://github.com/firasdib/Regex101', 'https://github.com/mojombo/god', 'https://github.com/mojombo/chronic', 'https://github.com/toml-lang/toml', 'https://github.com/mojombo/proxymachine', 'https://github.com/mojombo/grit', 'https://github.com/gollum/gollum', 'https://github.com/mojombo/ernie'], cond = <function _slash_endswith at 0x7f641e407280>)
DEBUG    pypidb._similarity:__init__.py:177 Returned None from pypidb._similarity._get_weighted_best
DEBUG    pypidb._similarity:__init__.py:157 Calling pypidb._similarity._get_weighted_best(name = 'aws-cdk-assets', urls = ['https://github.com/aws/aws-cdk', 'https://github.com/aws/aws-cdk', 'https://github.com/semver/semver', 'https://github.com/semver/semver', 'https://github.com/firasdib/Regex101', 'https://github.com/firasdib/Regex101', 'https://github.com/firasdib/Regex101', 'https://github.com/firasdib/Regex101', 'https://github.com/mojombo/god', 'https://github.com/mojombo/chronic', 'https://github.com/toml-lang/toml', 'https://github.com/mojombo/proxymachine', 'https://github.com/mojombo/grit', 'https://github.com/gollum/gollum', 'https://github.com/mojombo/ernie'], cond = <function _endswith at 0x7f641e407310>)
DEBUG    pypidb._similarity:__init__.py:177 Returned None from pypidb._similarity._get_weighted_best
DEBUG    pypidb._similarity:__init__.py:157 Calling pypidb._similarity._get_weighted_best(name = 'aws-cdk-assets', urls = ['https://github.com/aws/aws-cdk', 'https://github.com/aws/aws-cdk', 'https://github.com/semver/semver', 'https://github.com/semver/semver', 'https://github.com/firasdib/Regex101', 'https://github.com/firasdib/Regex101', 'https://github.com/firasdib/Regex101', 'https://github.com/firasdib/Regex101', 'https://github.com/mojombo/god', 'https://github.com/mojombo/chronic', 'https://github.com/toml-lang/toml', 'https://github.com/mojombo/proxymachine', 'https://github.com/mojombo/grit', 'https://github.com/gollum/gollum', 'https://github.com/mojombo/ernie'], cond = <function _python_and_name_match at 0x7f641e4073a0>)
DEBUG    pypidb._similarity:__init__.py:177 Returned None from pypidb._similarity._get_weighted_best
DEBUG    pypidb._similarity:__init__.py:157 Calling pypidb._similarity._get_weighted_best(name = 'aws-cdk-assets', urls = ['https://github.com/aws/aws-cdk', 'https://github.com/aws/aws-cdk', 'https://github.com/semver/semver', 'https://github.com/semver/semver', 'https://github.com/firasdib/Regex101', 'https://github.com/firasdib/Regex101', 'https://github.com/firasdib/Regex101', 'https://github.com/firasdib/Regex101', 'https://github.com/mojombo/god', 'https://github.com/mojombo/chronic', 'https://github.com/toml-lang/toml', 'https://github.com/mojombo/proxymachine', 'https://github.com/mojombo/grit', 'https://github.com/gollum/gollum', 'https://github.com/mojombo/ernie'], cond = <function _only_name_match at 0x7f641e407430>)
DEBUG    pypidb._similarity:__init__.py:177 Returned None from pypidb._similarity._get_weighted_best
DEBUG    pypidb._similarity:__init__.py:157 Calling pypidb._similarity._get_weighted_best(name = 'aws-cdk-assets', urls = ['https://github.com/aws/aws-cdk', 'https://github.com/aws/aws-cdk', 'https://github.com/semver/semver', 'https://github.com/semver/semver', 'https://github.com/firasdib/Regex101', 'https://github.com/firasdib/Regex101', 'https://github.com/firasdib/Regex101', 'https://github.com/firasdib/Regex101', 'https://github.com/mojombo/god', 'https://github.com/mojombo/chronic', 'https://github.com/toml-lang/toml', 'https://github.com/mojombo/proxymachine', 'https://github.com/mojombo/grit', 'https://github.com/gollum/gollum', 'https://github.com/mojombo/ernie'], cond = <function get_best_match.<locals>.<lambda> at 0x7f641cf5a4c0>)
DEBUG    pypidb._similarity:_similarity.py:101 counter - Counter({'https://github.com/firasdib/Regex101': 4, 'https://github.com/aws/aws-cdk': 2, 'https://github.com/semver/semver': 2, 'https://github.com/mojombo/god': 1, 'https://github.com/mojombo/chronic': 1, 'https://github.com/toml-lang/toml': 1, 'https://github.com/mojombo/proxymachine': 1, 'https://github.com/mojombo/grit': 1, 'https://github.com/gollum/gollum': 1, 'https://github.com/mojombo/ernie': 1})
DEBUG    pypidb._similarity:__init__.py:177 Returned 'https://github.com/firasdib/Regex101' from pypidb._similarity._get_weighted_best
DEBUG    pypidb._similarity:_similarity.py:244 !!!!finding similar: ['https://github.com/aws/aws-cdk', 'https://github.com/aws/aws-cdk', 'https://github.com/semver/semver', 'https://github.com/semver/semver', 'https://github.com/firasdib/Regex101', 'https://github.com/firasdib/Regex101', 'https://github.com/firasdib/Regex101', 'https://github.com/firasdib/Regex101', 'https://github.com/mojombo/god', 'https://github.com/mojombo/chronic', 'https://github.com/toml-lang/toml', 'https://github.com/mojombo/proxymachine', 'https://github.com/mojombo/grit', 'https://github.com/gollum/gollum', 'https://github.com/mojombo/ernie']
DEBUG    pypidb._similarity:__init__.py:157 Calling pypidb._similarity._get_most_similar(name = 'aws-cdk-assets', urls = {'https://github.com/semver/semver', 'https://github.com/mojombo/chronic', 'https://github.com/mojombo/ernie', 'https://github.com/mojombo/god', 'https://github.com/firasdib/Regex101', 'https://github.com/gollum/gollum', 'https://github.com/toml-lang/toml', 'https://github.com/aws/aws-cdk', 'https://github.com/mojombo/grit', 'https://github.com/mojombo/proxymachine'})
DEBUG    pypidb._similarity:__init__.py:157 Calling pypidb._similarity._compute_similarity(name = 'aws-cdk-assets', url = 'https://github.com/semver/semver', algo = <bound method _Distance.dist of <abydos.distance._ratcliff_obershelp.RatcliffObershelp object at 0x7f642a746430>>, comp_op = <built-in function min>)
DEBUG    pypidb._similarity:_similarity.py:44 _similarity of semver semver to awscdkassets
DEBUG    pypidb._similarity:__init__.py:177 Returned 0.7142857142857143 from pypidb._similarity._compute_similarity
DEBUG    pypidb._similarity:__init__.py:157 Calling pypidb._similarity._compute_similarity(name = 'aws-cdk-assets', url = 'https://github.com/mojombo/chronic', algo = <bound method _Distance.dist of <abydos.distance._ratcliff_obershelp.RatcliffObershelp object at 0x7f642a746430>>, comp_op = <built-in function min>)
DEBUG    pypidb._similarity:_similarity.py:44 _similarity of mojombo chronic to awscdkassets
DEBUG    pypidb._similarity:__init__.py:177 Returned 0.6129032258064516 from pypidb._similarity._compute_similarity
DEBUG    pypidb._similarity:__init__.py:157 Calling pypidb._similarity._compute_similarity(name = 'aws-cdk-assets', url = 'https://github.com/mojombo/ernie', algo = <bound method _Distance.dist of <abydos.distance._ratcliff_obershelp.RatcliffObershelp object at 0x7f642a746430>>, comp_op = <built-in function min>)
DEBUG    pypidb._similarity:_similarity.py:44 _similarity of mojombo ernie to awscdkassets
DEBUG    pypidb._similarity:__init__.py:177 Returned 0.6296296296296297 from pypidb._similarity._compute_similarity
DEBUG    pypidb._similarity:__init__.py:157 Calling pypidb._similarity._compute_similarity(name = 'aws-cdk-assets', url = 'https://github.com/mojombo/god', algo = <bound method _Distance.dist of <abydos.distance._ratcliff_obershelp.RatcliffObershelp object at 0x7f642a746430>>, comp_op = <built-in function min>)
DEBUG    pypidb._similarity:_similarity.py:44 _similarity of mojombo god to awscdkassets
DEBUG    pypidb._similarity:__init__.py:177 Returned 0.6 from pypidb._similarity._compute_similarity
DEBUG    pypidb._similarity:__init__.py:157 Calling pypidb._similarity._compute_similarity(name = 'aws-cdk-assets', url = 'https://github.com/firasdib/Regex101', algo = <bound method _Distance.dist of <abydos.distance._ratcliff_obershelp.RatcliffObershelp object at 0x7f642a746430>>, comp_op = <built-in function min>)
DEBUG    pypidb._similarity:_similarity.py:44 _similarity of firasdib regex101 to awscdkassets
DEBUG    pypidb._similarity:__init__.py:177 Returned 0.6666666666666667 from pypidb._similarity._compute_similarity
DEBUG    pypidb._similarity:__init__.py:157 Calling pypidb._similarity._compute_similarity(name = 'aws-cdk-assets', url = 'https://github.com/gollum/gollum', algo = <bound method _Distance.dist of <abydos.distance._ratcliff_obershelp.RatcliffObershelp object at 0x7f642a746430>>, comp_op = <built-in function min>)
DEBUG    pypidb._similarity:_similarity.py:44 _similarity of gollum gollum to awscdkassets
DEBUG    pypidb._similarity:__init__.py:177 Returned 0.6428571428571428 from pypidb._similarity._compute_similarity
DEBUG    pypidb._similarity:__init__.py:157 Calling pypidb._similarity._compute_similarity(name = 'aws-cdk-assets', url = 'https://github.com/toml-lang/toml', algo = <bound method _Distance.dist of <abydos.distance._ratcliff_obershelp.RatcliffObershelp object at 0x7f642a746430>>, comp_op = <built-in function min>)
DEBUG    pypidb._similarity:_similarity.py:44 _similarity of tomllang toml to awscdkassets
DEBUG    pypidb._similarity:__init__.py:177 Returned 0.6153846153846154 from pypidb._similarity._compute_similarity
DEBUG    pypidb._similarity:__init__.py:157 Calling pypidb._similarity._compute_similarity(name = 'aws-cdk-assets', url = 'https://github.com/aws/aws-cdk', algo = <bound method _Distance.dist of <abydos.distance._ratcliff_obershelp.RatcliffObershelp object at 0x7f642a746430>>, comp_op = <built-in function min>)
DEBUG    pypidb._similarity:_similarity.py:44 _similarity of aws awscdk to awscdkassets
DEBUG    pypidb._similarity:__init__.py:177 Returned 0.23809523809523814 from pypidb._similarity._compute_similarity
DEBUG    pypidb._similarity:__init__.py:157 Calling pypidb._similarity._compute_similarity(name = 'aws-cdk-assets', url = 'https://github.com/mojombo/grit', algo = <bound method _Distance.dist of <abydos.distance._ratcliff_obershelp.RatcliffObershelp object at 0x7f642a746430>>, comp_op = <built-in function min>)
DEBUG    pypidb._similarity:_similarity.py:44 _similarity of mojombo grit to awscdkassets
DEBUG    pypidb._similarity:__init__.py:177 Returned 0.6153846153846154 from pypidb._similarity._compute_similarity
DEBUG    pypidb._similarity:__init__.py:157 Calling pypidb._similarity._compute_similarity(name = 'aws-cdk-assets', url = 'https://github.com/mojombo/proxymachine', algo = <bound method _Distance.dist of <abydos.distance._ratcliff_obershelp.RatcliffObershelp object at 0x7f642a746430>>, comp_op = <built-in function min>)
DEBUG    pypidb._similarity:_similarity.py:44 _similarity of mojombo proxymachine to awscdkassets
DEBUG    pypidb._similarity:__init__.py:177 Returned 0.6153846153846154 from pypidb._similarity._compute_similarity
DEBUG    pypidb._similarity:_similarity.py:118 computed similarity: {'https://github.com/semver/semver': 0.7142857142857143, 'https://github.com/mojombo/chronic': 0.6129032258064516, 'https://github.com/mojombo/ernie': 0.6296296296296297, 'https://github.com/mojombo/god': 0.6, 'https://github.com/firasdib/Regex101': 0.6666666666666667, 'https://github.com/gollum/gollum': 0.6428571428571428, 'https://github.com/toml-lang/toml': 0.6153846153846154, 'https://github.com/aws/aws-cdk': 0.23809523809523814, 'https://github.com/mojombo/grit': 0.6153846153846154, 'https://github.com/mojombo/proxymachine': 0.6153846153846154}
DEBUG    pypidb._similarity:_similarity.py:126 nearest: 0.7142857142857143 https://github.com/semver/semver
DEBUG    pypidb._similarity:_similarity.py:126 nearest: 0.6129032258064516 https://github.com/mojombo/chronic
DEBUG    pypidb._similarity:_similarity.py:126 nearest: 0.6 https://github.com/mojombo/god
DEBUG    pypidb._similarity:_similarity.py:126 nearest: 0.23809523809523814 https://github.com/aws/aws-cdk
DEBUG    pypidb._similarity:_similarity.py:129 nearest_counter Counter({0.6153846153846154: 3, 0.7142857142857143: 1, 0.6129032258064516: 1, 0.6296296296296297: 1, 0.6: 1, 0.6666666666666667: 1, 0.6428571428571428: 1, 0.23809523809523814: 1})
DEBUG    pypidb._similarity:__init__.py:177 Returned 'https://github.com/aws/aws-cdk' from pypidb._similarity._get_most_similar
DEBUG    pypidb._similarity:_similarity.py:255 common, similar: https://github.com/firasdib/Regex101 - https://github.com/aws/aws-cdk
DEBUG    pypidb._similarity:_similarity.py:260 most common: https://github.com/firasdib/Regex101
DEBUG    pypidb._similarity:__init__.py:177 Returned 'https://github.com/firasdib/Regex101' from pypidb._similarity.get_best_match
DEBUG    pypidb._similarity:__init__.py:157 Calling pypidb._similarity._compute_similarity(name = 'aws-cdk-assets', url = 'https://github.com/firasdib/Regex101', algo = <bound method _Distance.dist of <abydos.distance._ratcliff_obershelp.RatcliffObershelp object at 0x7f642a746430>>, comp_op = <built-in function min>)
DEBUG    pypidb._similarity:_similarity.py:44 _similarity of firasdib regex101 to awscdkassets
DEBUG    pypidb._similarity:__init__.py:177 Returned 0.6666666666666667 from pypidb._similarity._compute_similarity
DEBUG    pypidb._pypi:_pypi.py:317 processing Url: https://twitter.com/regex101 @ Unknown
DEBUG    pypidb._pypi:_pypi.py:416 queue loop skipping >5: https://twitter.com/regex101
DEBUG    pypidb._pypi:_pypi.py:317 processing Url: https://www.paypal.com/cgi-bin/webscr?cmd=_donations&amp @ Unknown
DEBUG    pypidb._pypi:_pypi.py:416 queue loop skipping >5: https://www.paypal.com/cgi-bin/webscr?cmd=_donations&amp
DEBUG    pypidb._pypi:_pypi.py:317 processing Url: regex101.com/preview.png @ Unknown
DEBUG    pypidb._pypi:_pypi.py:416 queue loop skipping >5: regex101.com/preview.png
DEBUG    pypidb._pypi:_pypi.py:317 processing Url: regex101.com @ Unknown
DEBUG    pypidb._pypi:_pypi.py:416 queue loop skipping >5: regex101.com
DEBUG    pypidb._pypi:_pypi.py:317 processing Url: http://enable-javascript.com/ @ Unknown
DEBUG    pypidb._pypi:_pypi.py:416 queue loop skipping >5: http://enable-javascript.com/
DEBUG    pypidb._pypi:_pypi.py:317 processing Webpage: https://tools.ietf.org/html/rfc2119
INFO     pypidb._pypi:_pypi.py:373 Not processing text from https://tools.ietf.org/html/rfc2119
DEBUG    pypidb._pypi:_pypi.py:317 processing Url: tools.ietf.org @ Unknown
DEBUG    pypidb._pypi:_pypi.py:416 queue loop skipping >5: tools.ietf.org
DEBUG    pypidb._pypi:_pypi.py:317 processing Url: http://www.w3.org/TR/xhtml1/DTD/xhtml1-transitional.dtd @ Unknown
DEBUG    pypidb._pypi:_pypi.py:416 queue loop skipping >5: http://www.w3.org/TR/xhtml1/DTD/xhtml1-transitional.dtd
DEBUG    pypidb._pypi:_pypi.py:317 processing Url: http://dublincore.org/documents/2008/08/04/dc-html/ @ Unknown
DEBUG    pypidb._pypi:_pypi.py:416 queue loop skipping >5: http://dublincore.org/documents/2008/08/04/dc-html/
DEBUG    pypidb._pypi:_pypi.py:317 processing Url: https://datatracker.ietf.org/doc/rfc2119 @ Unknown
DEBUG    pypidb._pypi:_pypi.py:416 queue loop skipping >5: https://datatracker.ietf.org/doc/rfc2119
DEBUG    pypidb._pypi:_pypi.py:317 processing Url: https://tools.ietf.org/tools/rfcmarkup/ @ Unknown
DEBUG    pypidb._pypi:_pypi.py:416 queue loop skipping >5: https://tools.ietf.org/tools/rfcmarkup/
DEBUG    pypidb._pypi:_pypi.py:317 processing Url: http://purl.org/dc/elements/1.1/ @ Unknown
DEBUG    pypidb._pypi:_pypi.py:416 queue loop skipping >5: http://purl.org/dc/elements/1.1/
DEBUG    pypidb._pypi:_pypi.py:317 processing Url: http://www.w3.org/1999/xhtml @ Unknown
DEBUG    pypidb._pypi:_pypi.py:416 queue loop skipping >5: http://www.w3.org/1999/xhtml
DEBUG    pypidb._pypi:_pypi.py:317 processing Url: https://www.rfc-editor.org/errata_search.php?rfc=2119 @ Unknown
DEBUG    pypidb._pypi:_pypi.py:416 queue loop skipping >5: https://www.rfc-editor.org/errata_search.php?rfc=2119
DEBUG    pypidb._pypi:_pypi.py:317 processing Url: https://twitter.com/regex101 @ Unknown
DEBUG    pypidb._pypi:_pypi.py:416 queue loop skipping >5: https://twitter.com/regex101
DEBUG    pypidb._pypi:_pypi.py:317 processing Url: https://www.paypal.com/cgi-bin/webscr?cmd=_donations&amp @ Unknown
DEBUG    pypidb._pypi:_pypi.py:416 queue loop skipping >5: https://www.paypal.com/cgi-bin/webscr?cmd=_donations&amp
DEBUG    pypidb._pypi:_pypi.py:317 processing Url: regex101.com/preview.png @ Unknown
DEBUG    pypidb._pypi:_pypi.py:416 queue loop skipping >5: regex101.com/preview.png
DEBUG    pypidb._pypi:_pypi.py:317 processing Url: regex101.com @ Unknown
DEBUG    pypidb._pypi:_pypi.py:416 queue loop skipping >5: regex101.com
DEBUG    pypidb._pypi:_pypi.py:317 processing Url: http://enable-javascript.com/ @ Unknown
DEBUG    pypidb._pypi:_pypi.py:416 queue loop skipping >5: http://enable-javascript.com/
DEBUG    pypidb._pypi:_pypi.py:317 processing Url: http://www.youtube.com/watch?v=7DoB0SCUtOk&list=SP055Epbe6d5aclKNAa8msO1VvDOJ8sYlS&index=23 @ Unknown
DEBUG    pypidb._pypi:_pypi.py:416 queue loop skipping >5: http://www.youtube.com/watch?v=7DoB0SCUtOk&list=SP055Epbe6d5aclKNAa8msO1VvDOJ8sYlS&index=23
DEBUG    pypidb._pypi:_pypi.py:317 processing Url: http://www.youtube.com/watch?v=P9jjDpWzsUI @ Unknown
DEBUG    pypidb._pypi:_pypi.py:416 queue loop skipping >5: http://www.youtube.com/watch?v=P9jjDpWzsUI
DEBUG    pypidb._pypi:_pypi.py:317 processing Url: http://en.oreilly.com/rails2008/public/schedule/speaker/2520 @ Unknown
DEBUG    pypidb._pypi:_pypi.py:416 queue loop skipping >5: http://en.oreilly.com/rails2008/public/schedule/speaker/2520
DEBUG    pypidb._pypi:_pypi.py:317 processing Url: http://bits.blogs.nytimes.com/2012/12/28/github-has-big-dreams-for-open-source-software-and-more/ @ Unknown
DEBUG    pypidb._pypi:_pypi.py:416 queue loop skipping >5: http://bits.blogs.nytimes.com/2012/12/28/github-has-big-dreams-for-open-source-software-and-more/
DEBUG    pypidb._pypi:_pypi.py:317 processing Url: https://www.wired.com/2013/09/github-for-anything/ @ Unknown
DEBUG    pypidb._pypi:_pypi.py:416 queue loop skipping >5: https://www.wired.com/2013/09/github-for-anything/
DEBUG    pypidb._pypi:_pypi.py:317 processing Url: https://semver.org @ Unknown
DEBUG    pypidb._pypi:_pypi.py:416 queue loop skipping >5: https://semver.org
DEBUG    pypidb._pypi:_pypi.py:317 processing Url: http://www.linux-mag.com/cache/7486/1.html @ Unknown
DEBUG    pypidb._pypi:_pypi.py:416 queue loop skipping >5: http://www.linux-mag.com/cache/7486/1.html
DEBUG    pypidb._pypi:_pypi.py:317 processing Url: http://vimeo.com/35640883 @ Unknown
DEBUG    pypidb._pypi:_pypi.py:416 queue loop skipping >5: http://vimeo.com/35640883
DEBUG    pypidb._pypi:_pypi.py:317 processing Url: http://wp-community.org/2006/12/26/episode-15-interviews-with-tom-werner-gravatar-and-jaimie-sirovich-seo-egghead/ @ Unknown
DEBUG    pypidb._pypi:_pypi.py:416 queue loop skipping >5: http://wp-community.org/2006/12/26/episode-15-interviews-with-tom-werner-gravatar-and-jaimie-sirovich-seo-egghead/
DEBUG    pypidb._pypi:_pypi.py:317 processing Url: http://confreaks.net/videos/432-rubyconf2010-the-road-from-ruby-hacker-to-entrepreneur @ Unknown
DEBUG    pypidb._pypi:_pypi.py:416 queue loop skipping >5: http://confreaks.net/videos/432-rubyconf2010-the-road-from-ruby-hacker-to-entrepreneur
DEBUG    pypidb._pypi:_pypi.py:317 processing Url: http://web20show.com/episodes/web20show-ep45-github @ Unknown
DEBUG    pypidb._pypi:_pypi.py:416 queue loop skipping >5: http://web20show.com/episodes/web20show-ep45-github
DEBUG    pypidb._pypi:_pypi.py:317 processing Url: http://confreaks.com/videos/712-rubyconf2011-github-flavored-ruby @ Unknown
DEBUG    pypidb._pypi:_pypi.py:416 queue loop skipping >5: http://confreaks.com/videos/712-rubyconf2011-github-flavored-ruby
DEBUG    pypidb._pypi:_pypi.py:317 processing Url: http://blogs.sitepoint.com/podcast-107-social-coding-with-githubs-tom-preston-werner/ @ Unknown
DEBUG    pypidb._pypi:_pypi.py:416 queue loop skipping >5: http://blogs.sitepoint.com/podcast-107-social-coding-with-githubs-tom-preston-werner/
DEBUG    pypidb._pypi:_pypi.py:317 processing Url: http://www.mindmeister.com/fr/maps/show_public/60955251?title=tom-preston-schnitzelconf @ Unknown
DEBUG    pypidb._pypi:_pypi.py:416 queue loop skipping >5: http://www.mindmeister.com/fr/maps/show_public/60955251?title=tom-preston-schnitzelconf
DEBUG    pypidb._pypi:_pypi.py:317 processing Url: http://www.rubyology.com/podcasts/show/66 @ Unknown
DEBUG    pypidb._pypi:_pypi.py:416 queue loop skipping >5: http://www.rubyology.com/podcasts/show/66
DEBUG    pypidb._pypi:_pypi.py:317 processing Url: http://37signals.com/svn/posts/2486-bootstrapped-profitable-proud-github @ Unknown
DEBUG    pypidb._pypi:_pypi.py:416 queue loop skipping >5: http://37signals.com/svn/posts/2486-bootstrapped-profitable-proud-github
DEBUG    pypidb._pypi:_pypi.py:317 processing Url: http://lambdaphant.com/blog/interview-with-tom-preston-werner-from-github @ Unknown
DEBUG    pypidb._pypi:_pypi.py:416 queue loop skipping >5: http://lambdaphant.com/blog/interview-with-tom-preston-werner-from-github
DEBUG    pypidb._pypi:_pypi.py:317 processing Url: http://blog.gravatar.com/2007/10/18/automattic-gravatar/ @ Unknown
DEBUG    pypidb._pypi:_pypi.py:416 queue loop skipping >5: http://blog.gravatar.com/2007/10/18/automattic-gravatar/
DEBUG    pypidb._pypi:_pypi.py:317 processing Url: http://ontwik.com/github/tom-werner-co-founder-of-github/ @ Unknown
DEBUG    pypidb._pypi:_pypi.py:416 queue loop skipping >5: http://ontwik.com/github/tom-werner-co-founder-of-github/
DEBUG    pypidb._pypi:_pypi.py:317 processing Url: http://www.wired.com/wiredenterprise/2013/09/github-for-anything/all/ @ Unknown
DEBUG    pypidb._pypi:_pypi.py:416 queue loop skipping >5: http://www.wired.com/wiredenterprise/2013/09/github-for-anything/all/
DEBUG    pypidb._pypi:_pypi.py:317 processing Url: https://chatterbug.com/ @ Unknown
DEBUG    pypidb._pypi:_pypi.py:416 queue loop skipping >5: https://chatterbug.com/
DEBUG    pypidb._pypi:_pypi.py:317 processing Url: http://www.infoq.com/interviews/erlang-and-github @ Unknown
DEBUG    pypidb._pypi:_pypi.py:416 queue loop skipping >5: http://www.infoq.com/interviews/erlang-and-github
DEBUG    pypidb._pypi:_pypi.py:317 processing Url: http://www.youtube.com/watch?v=weF-_dLYrzw @ Unknown
DEBUG    pypidb._pypi:_pypi.py:416 queue loop skipping >5: http://www.youtube.com/watch?v=weF-_dLYrzw
DEBUG    pypidb._pypi:_pypi.py:317 processing Url: http://gigaom.com/cloud/10-innovators-changing-the-game-for-internet-infrastructure/7/ @ Unknown
DEBUG    pypidb._pypi:_pypi.py:416 queue loop skipping >5: http://gigaom.com/cloud/10-innovators-changing-the-game-for-internet-infrastructure/7/
DEBUG    pypidb._pypi:_pypi.py:317 processing Url: http://rubyconf2009.confreaks.com/19-nov-2009-10-25-bert-and-ernie-scaling-your-ruby-site-with-erlang-tom-preston-werner.html @ Unknown
DEBUG    pypidb._pypi:_pypi.py:416 queue loop skipping >5: http://rubyconf2009.confreaks.com/19-nov-2009-10-25-bert-and-ernie-scaling-your-ruby-site-with-erlang-tom-preston-werner.html
DEBUG    pypidb._pypi:_pypi.py:317 processing Url: http://www.viddler.com/explore/GreggPollack/videos/25/44 @ Unknown
DEBUG    pypidb._pypi:_pypi.py:416 queue loop skipping >5: http://www.viddler.com/explore/GreggPollack/videos/25/44
DEBUG    pypidb._pypi:_pypi.py:317 processing Url: github.com/mojombo @ Unknown
DEBUG    pypidb._pypi:_pypi.py:416 queue loop skipping >5: github.com/mojombo
DEBUG    pypidb._pypi:_pypi.py:317 processing Url: twitter.com/mojombo @ Unknown
DEBUG    pypidb._pypi:_pypi.py:416 queue loop skipping >5: twitter.com/mojombo
DEBUG    pypidb._pypi:_pypi.py:317 processing Url: http://www.flickr.com/photos/seldaek/4986315165/ @ Unknown
DEBUG    pypidb._pypi:_pypi.py:416 queue loop skipping >5: http://www.flickr.com/photos/seldaek/4986315165/
DEBUG    pypidb._pypi:_pypi.py:317 processing Url: http://www.youtube.com/watch?v=Hi2V1x1AkD8 @ Unknown
DEBUG    pypidb._pypi:_pypi.py:416 queue loop skipping >5: http://www.youtube.com/watch?v=Hi2V1x1AkD8
DEBUG    pypidb._pypi:_pypi.py:317 processing Url: http://www.justin.tv/c3oorg/b/272031754 @ Unknown
DEBUG    pypidb._pypi:_pypi.py:416 queue loop skipping >5: http://www.justin.tv/c3oorg/b/272031754
DEBUG    pypidb._pypi:_pypi.py:317 processing Url: http://www.infoq.com/presentations/preston-werner-conceptual-algorithms @ Unknown
DEBUG    pypidb._pypi:_pypi.py:416 queue loop skipping >5: http://www.infoq.com/presentations/preston-werner-conceptual-algorithms
DEBUG    pypidb._pypi:_pypi.py:317 processing Url: https://twitter.com/mojombo/ @ Unknown
DEBUG    pypidb._pypi:_pypi.py:416 queue loop skipping >5: https://twitter.com/mojombo/
DEBUG    pypidb._pypi:_pypi.py:317 processing Url: http://developer.yahoo.com/yui/theater/video.php?v=prestonwerner-github @ Unknown
DEBUG    pypidb._pypi:_pypi.py:416 queue loop skipping >5: http://developer.yahoo.com/yui/theater/video.php?v=prestonwerner-github
DEBUG    pypidb._pypi:_pypi.py:317 processing Url: http://www.maestrosdelweb.com/editorial/tom-preston-werner-fundador-de-github/ @ Unknown
DEBUG    pypidb._pypi:_pypi.py:416 queue loop skipping >5: http://www.maestrosdelweb.com/editorial/tom-preston-werner-fundador-de-github/
DEBUG    pypidb._pypi:_pypi.py:317 processing Url: https://jekyllrb.com @ Unknown
DEBUG    pypidb._pypi:_pypi.py:416 queue loop skipping >5: https://jekyllrb.com
DEBUG    pypidb._pypi:_pypi.py:317 processing Url: http://www.infoq.com/interviews/preston-werner-powerset-github-ruby @ Unknown
DEBUG    pypidb._pypi:_pypi.py:416 queue loop skipping >5: http://www.infoq.com/interviews/preston-werner-powerset-github-ruby
DEBUG    pypidb._pypi:_pypi.py:317 processing Url: http://speakerdeck.com/u/mojombo/p/github-flavored-ruby @ Unknown
DEBUG    pypidb._pypi:_pypi.py:416 queue loop skipping >5: http://speakerdeck.com/u/mojombo/p/github-flavored-ruby
DEBUG    pypidb._pypi:_pypi.py:317 processing Url: http://www.youtube.com/watch?v=k2vJNNAQZlg @ Unknown
DEBUG    pypidb._pypi:_pypi.py:416 queue loop skipping >5: http://www.youtube.com/watch?v=k2vJNNAQZlg
DEBUG    pypidb._pypi:_pypi.py:317 processing Url: http://www.youtube.com/watch?v=Ln-B_fs9QMY&feature=youtu.be @ Unknown
DEBUG    pypidb._pypi:_pypi.py:416 queue loop skipping >5: http://www.youtube.com/watch?v=Ln-B_fs9QMY&feature=youtu.be
DEBUG    pypidb._pypi:_pypi.py:317 processing Url: @mail.ru @ Unknown
DEBUG    pypidb._pypi:_pypi.py:416 queue loop skipping >5: @mail.ru
DEBUG    pypidb._pypi:_pypi.py:317 processing Url: http://sea.ucar.edu/event/unlocking-secrets-git @ Unknown
DEBUG    pypidb._pypi:_pypi.py:416 queue loop skipping >5: http://sea.ucar.edu/event/unlocking-secrets-git
DEBUG    pypidb._pypi:_pypi.py:317 processing Url: http://skillsmatter.com/podcast/agile-testing/optimizing-for-happiness @ Unknown
DEBUG    pypidb._pypi:_pypi.py:416 queue loop skipping >5: http://skillsmatter.com/podcast/agile-testing/optimizing-for-happiness
DEBUG    pypidb._pypi:_pypi.py:317 processing Url: http://video.mail.ru/corp/video/conference2010/12.html @ Unknown
DEBUG    pypidb._pypi:_pypi.py:416 queue loop skipping >5: http://video.mail.ru/corp/video/conference2010/12.html
DEBUG    pypidb._pypi:_pypi.py:317 processing Url: http://www.blognewcomb.com/blog/2007/08/powerset_interview_with_tom_pr.html @ Unknown
DEBUG    pypidb._pypi:_pypi.py:416 queue loop skipping >5: http://www.blognewcomb.com/blog/2007/08/powerset_interview_with_tom_pr.html
DEBUG    pypidb._pypi:_pypi.py:317 processing Url: http://www.rubyology.com/podcasts/show/67 @ Unknown
DEBUG    pypidb._pypi:_pypi.py:416 queue loop skipping >5: http://www.rubyology.com/podcasts/show/67
DEBUG    pypidb._pypi:_pypi.py:317 processing Url: http://www.erlang-factory.com/conference/SFBayAreaErlangFactory2009/speakers/TomPrestonWerner @ Unknown
DEBUG    pypidb._pypi:_pypi.py:416 queue loop skipping >5: http://www.erlang-factory.com/conference/SFBayAreaErlangFactory2009/speakers/TomPrestonWerner
DEBUG    pypidb._pypi:_pypi.py:317 processing Url: http://www.vimeo.com/1104583 @ Unknown
DEBUG    pypidb._pypi:_pypi.py:416 queue loop skipping >5: http://www.vimeo.com/1104583
DEBUG    pypidb._pypi:_pypi.py:317 processing Url: http://images.businessweek.com/ss/09/04/0421_best_young_entrepreneurs/17.htm @ Unknown
DEBUG    pypidb._pypi:_pypi.py:416 queue loop skipping >5: http://images.businessweek.com/ss/09/04/0421_best_young_entrepreneurs/17.htm
DEBUG    pypidb._pypi:_pypi.py:317 processing Url: http://webpulp.tv/post/708686185/github-tom-preston-werner @ Unknown
DEBUG    pypidb._pypi:_pypi.py:416 queue loop skipping >5: http://webpulp.tv/post/708686185/github-tom-preston-werner
DEBUG    pypidb._pypi:_pypi.py:317 processing Url: https://feeds.feedburner.com/tom-preston-werner @ Unknown
DEBUG    pypidb._pypi:_pypi.py:416 queue loop skipping >5: https://feeds.feedburner.com/tom-preston-werner
DEBUG    pypidb._pypi:_pypi.py:317 processing Url: https://web.archive.org/web/20061214040137/http://joshuaink2006.johnoxton.co.uk/blog/240/seven-and-a-half-questions-for-tom-werner @ Unknown
DEBUG    pypidb._pypi:_pypi.py:416 queue loop skipping >5: https://web.archive.org/web/20061214040137/http://joshuaink2006.johnoxton.co.uk/blog/240/seven-and-a-half-questions-for-tom-werner
DEBUG    pypidb._pypi:_pypi.py:317 processing Url: http://www.w3.org/1999/xhtml @ Unknown
DEBUG    pypidb._pypi:_pypi.py:416 queue loop skipping >5: http://www.w3.org/1999/xhtml
DEBUG    pypidb._pypi:_pypi.py:317 processing Url: http://confreaks.net/videos/297-lsrc2010-keynote-address @ Unknown
DEBUG    pypidb._pypi:_pypi.py:416 queue loop skipping >5: http://confreaks.net/videos/297-lsrc2010-keynote-address
DEBUG    pypidb._pypi:_pypi.py:317 processing Url: http://vimeo.com/17118008 @ Unknown
DEBUG    pypidb._pypi:_pypi.py:416 queue loop skipping >5: http://vimeo.com/17118008
DEBUG    pypidb._pypi:_pypi.py:317 processing Url: WebPulp.tv @ Unknown
DEBUG    pypidb._pypi:_pypi.py:416 queue loop skipping >5: WebPulp.tv
DEBUG    pypidb._pypi:_pypi.py:317 processing Url: http://speakerdeck.com/u/mojombo/p/optimizing-for-happiness @ Unknown
DEBUG    pypidb._pypi:_pypi.py:416 queue loop skipping >5: http://speakerdeck.com/u/mojombo/p/optimizing-for-happiness
DEBUG    pypidb._pypi:_pypi.py:317 processing Url: http://vimeo.com/39016099 @ Unknown
DEBUG    pypidb._pypi:_pypi.py:416 queue loop skipping >5: http://vimeo.com/39016099
DEBUG    pypidb._pypi:_pypi.py:503 fetched ['https://semver.org/', 'https://regex101.com/r/Ly7O1x/3/', 'http://tools.ietf.org/html/rfc2119', 'https://regex101.com/r/vkijKf/1/', 'http://tom.preston-werner.com', 'https://tools.ietf.org/html/rfc2119']
DEBUG    pypidb._similarity:__init__.py:157 Calling pypidb._similarity.get_best_match(name = 'aws-cdk-assets', urls = ['https://github.com/aws/aws-cdk', 'https://github.com/aws/aws-cdk', 'https://github.com/semver/semver', 'https://github.com/semver/semver', 'https://github.com/firasdib/Regex101', 'https://github.com/firasdib/Regex101', 'https://github.com/firasdib/Regex101', 'https://github.com/firasdib/Regex101', 'https://github.com/mojombo/god', 'https://github.com/mojombo/chronic', 'https://github.com/toml-lang/toml', 'https://github.com/mojombo/proxymachine', 'https://github.com/mojombo/grit', 'https://github.com/gollum/gollum', 'https://github.com/mojombo/ernie'])
DEBUG    pypidb._similarity:__init__.py:157 Calling pypidb._similarity._get_weighted_best(name = 'aws-cdk-assets', urls = ['https://github.com/aws/aws-cdk', 'https://github.com/aws/aws-cdk', 'https://github.com/semver/semver', 'https://github.com/semver/semver', 'https://github.com/firasdib/Regex101', 'https://github.com/firasdib/Regex101', 'https://github.com/firasdib/Regex101', 'https://github.com/firasdib/Regex101', 'https://github.com/mojombo/god', 'https://github.com/mojombo/chronic', 'https://github.com/toml-lang/toml', 'https://github.com/mojombo/proxymachine', 'https://github.com/mojombo/grit', 'https://github.com/gollum/gollum', 'https://github.com/mojombo/ernie'], cond = <function _slash_endswith at 0x7f641e407280>)
DEBUG    pypidb._similarity:__init__.py:177 Returned None from pypidb._similarity._get_weighted_best
DEBUG    pypidb._similarity:__init__.py:157 Calling pypidb._similarity._get_weighted_best(name = 'aws-cdk-assets', urls = ['https://github.com/aws/aws-cdk', 'https://github.com/aws/aws-cdk', 'https://github.com/semver/semver', 'https://github.com/semver/semver', 'https://github.com/firasdib/Regex101', 'https://github.com/firasdib/Regex101', 'https://github.com/firasdib/Regex101', 'https://github.com/firasdib/Regex101', 'https://github.com/mojombo/god', 'https://github.com/mojombo/chronic', 'https://github.com/toml-lang/toml', 'https://github.com/mojombo/proxymachine', 'https://github.com/mojombo/grit', 'https://github.com/gollum/gollum', 'https://github.com/mojombo/ernie'], cond = <function _endswith at 0x7f641e407310>)
DEBUG    pypidb._similarity:__init__.py:177 Returned None from pypidb._similarity._get_weighted_best
DEBUG    pypidb._similarity:__init__.py:157 Calling pypidb._similarity._get_weighted_best(name = 'aws-cdk-assets', urls = ['https://github.com/aws/aws-cdk', 'https://github.com/aws/aws-cdk', 'https://github.com/semver/semver', 'https://github.com/semver/semver', 'https://github.com/firasdib/Regex101', 'https://github.com/firasdib/Regex101', 'https://github.com/firasdib/Regex101', 'https://github.com/firasdib/Regex101', 'https://github.com/mojombo/god', 'https://github.com/mojombo/chronic', 'https://github.com/toml-lang/toml', 'https://github.com/mojombo/proxymachine', 'https://github.com/mojombo/grit', 'https://github.com/gollum/gollum', 'https://github.com/mojombo/ernie'], cond = <function _python_and_name_match at 0x7f641e4073a0>)
DEBUG    pypidb._similarity:__init__.py:177 Returned None from pypidb._similarity._get_weighted_best
DEBUG    pypidb._similarity:__init__.py:157 Calling pypidb._similarity._get_weighted_best(name = 'aws-cdk-assets', urls = ['https://github.com/aws/aws-cdk', 'https://github.com/aws/aws-cdk', 'https://github.com/semver/semver', 'https://github.com/semver/semver', 'https://github.com/firasdib/Regex101', 'https://github.com/firasdib/Regex101', 'https://github.com/firasdib/Regex101', 'https://github.com/firasdib/Regex101', 'https://github.com/mojombo/god', 'https://github.com/mojombo/chronic', 'https://github.com/toml-lang/toml', 'https://github.com/mojombo/proxymachine', 'https://github.com/mojombo/grit', 'https://github.com/gollum/gollum', 'https://github.com/mojombo/ernie'], cond = <function _only_name_match at 0x7f641e407430>)
DEBUG    pypidb._similarity:__init__.py:177 Returned None from pypidb._similarity._get_weighted_best
DEBUG    pypidb._similarity:__init__.py:157 Calling pypidb._similarity._get_weighted_best(name = 'aws-cdk-assets', urls = ['https://github.com/aws/aws-cdk', 'https://github.com/aws/aws-cdk', 'https://github.com/semver/semver', 'https://github.com/semver/semver', 'https://github.com/firasdib/Regex101', 'https://github.com/firasdib/Regex101', 'https://github.com/firasdib/Regex101', 'https://github.com/firasdib/Regex101', 'https://github.com/mojombo/god', 'https://github.com/mojombo/chronic', 'https://github.com/toml-lang/toml', 'https://github.com/mojombo/proxymachine', 'https://github.com/mojombo/grit', 'https://github.com/gollum/gollum', 'https://github.com/mojombo/ernie'], cond = <function get_best_match.<locals>.<lambda> at 0x7f641605f670>)
DEBUG    pypidb._similarity:_similarity.py:101 counter - Counter({'https://github.com/firasdib/Regex101': 4, 'https://github.com/aws/aws-cdk': 2, 'https://github.com/semver/semver': 2, 'https://github.com/mojombo/god': 1, 'https://github.com/mojombo/chronic': 1, 'https://github.com/toml-lang/toml': 1, 'https://github.com/mojombo/proxymachine': 1, 'https://github.com/mojombo/grit': 1, 'https://github.com/gollum/gollum': 1, 'https://github.com/mojombo/ernie': 1})
DEBUG    pypidb._similarity:__init__.py:177 Returned 'https://github.com/firasdib/Regex101' from pypidb._similarity._get_weighted_best
DEBUG    pypidb._similarity:_similarity.py:244 !!!!finding similar: ['https://github.com/aws/aws-cdk', 'https://github.com/aws/aws-cdk', 'https://github.com/semver/semver', 'https://github.com/semver/semver', 'https://github.com/firasdib/Regex101', 'https://github.com/firasdib/Regex101', 'https://github.com/firasdib/Regex101', 'https://github.com/firasdib/Regex101', 'https://github.com/mojombo/god', 'https://github.com/mojombo/chronic', 'https://github.com/toml-lang/toml', 'https://github.com/mojombo/proxymachine', 'https://github.com/mojombo/grit', 'https://github.com/gollum/gollum', 'https://github.com/mojombo/ernie']
DEBUG    pypidb._similarity:__init__.py:157 Calling pypidb._similarity._get_most_similar(name = 'aws-cdk-assets', urls = {'https://github.com/semver/semver', 'https://github.com/mojombo/chronic', 'https://github.com/mojombo/ernie', 'https://github.com/mojombo/god', 'https://github.com/firasdib/Regex101', 'https://github.com/gollum/gollum', 'https://github.com/toml-lang/toml', 'https://github.com/aws/aws-cdk', 'https://github.com/mojombo/grit', 'https://github.com/mojombo/proxymachine'})
DEBUG    pypidb._similarity:__init__.py:157 Calling pypidb._similarity._compute_similarity(name = 'aws-cdk-assets', url = 'https://github.com/semver/semver', algo = <bound method _Distance.dist of <abydos.distance._ratcliff_obershelp.RatcliffObershelp object at 0x7f642a746430>>, comp_op = <built-in function min>)
DEBUG    pypidb._similarity:_similarity.py:44 _similarity of semver semver to awscdkassets
DEBUG    pypidb._similarity:__init__.py:177 Returned 0.7142857142857143 from pypidb._similarity._compute_similarity
DEBUG    pypidb._similarity:__init__.py:157 Calling pypidb._similarity._compute_similarity(name = 'aws-cdk-assets', url = 'https://github.com/mojombo/chronic', algo = <bound method _Distance.dist of <abydos.distance._ratcliff_obershelp.RatcliffObershelp object at 0x7f642a746430>>, comp_op = <built-in function min>)
DEBUG    pypidb._similarity:_similarity.py:44 _similarity of mojombo chronic to awscdkassets
DEBUG    pypidb._similarity:__init__.py:177 Returned 0.6129032258064516 from pypidb._similarity._compute_similarity
DEBUG    pypidb._similarity:__init__.py:157 Calling pypidb._similarity._compute_similarity(name = 'aws-cdk-assets', url = 'https://github.com/mojombo/ernie', algo = <bound method _Distance.dist of <abydos.distance._ratcliff_obershelp.RatcliffObershelp object at 0x7f642a746430>>, comp_op = <built-in function min>)
DEBUG    pypidb._similarity:_similarity.py:44 _similarity of mojombo ernie to awscdkassets
DEBUG    pypidb._similarity:__init__.py:177 Returned 0.6296296296296297 from pypidb._similarity._compute_similarity
DEBUG    pypidb._similarity:__init__.py:157 Calling pypidb._similarity._compute_similarity(name = 'aws-cdk-assets', url = 'https://github.com/mojombo/god', algo = <bound method _Distance.dist of <abydos.distance._ratcliff_obershelp.RatcliffObershelp object at 0x7f642a746430>>, comp_op = <built-in function min>)
DEBUG    pypidb._similarity:_similarity.py:44 _similarity of mojombo god to awscdkassets
DEBUG    pypidb._similarity:__init__.py:177 Returned 0.6 from pypidb._similarity._compute_similarity
DEBUG    pypidb._similarity:__init__.py:157 Calling pypidb._similarity._compute_similarity(name = 'aws-cdk-assets', url = 'https://github.com/firasdib/Regex101', algo = <bound method _Distance.dist of <abydos.distance._ratcliff_obershelp.RatcliffObershelp object at 0x7f642a746430>>, comp_op = <built-in function min>)
DEBUG    pypidb._similarity:_similarity.py:44 _similarity of firasdib regex101 to awscdkassets
DEBUG    pypidb._similarity:__init__.py:177 Returned 0.6666666666666667 from pypidb._similarity._compute_similarity
DEBUG    pypidb._similarity:__init__.py:157 Calling pypidb._similarity._compute_similarity(name = 'aws-cdk-assets', url = 'https://github.com/gollum/gollum', algo = <bound method _Distance.dist of <abydos.distance._ratcliff_obershelp.RatcliffObershelp object at 0x7f642a746430>>, comp_op = <built-in function min>)
DEBUG    pypidb._similarity:_similarity.py:44 _similarity of gollum gollum to awscdkassets
DEBUG    pypidb._similarity:__init__.py:177 Returned 0.6428571428571428 from pypidb._similarity._compute_similarity
DEBUG    pypidb._similarity:__init__.py:157 Calling pypidb._similarity._compute_similarity(name = 'aws-cdk-assets', url = 'https://github.com/toml-lang/toml', algo = <bound method _Distance.dist of <abydos.distance._ratcliff_obershelp.RatcliffObershelp object at 0x7f642a746430>>, comp_op = <built-in function min>)
DEBUG    pypidb._similarity:_similarity.py:44 _similarity of tomllang toml to awscdkassets
DEBUG    pypidb._similarity:__init__.py:177 Returned 0.6153846153846154 from pypidb._similarity._compute_similarity
DEBUG    pypidb._similarity:__init__.py:157 Calling pypidb._similarity._compute_similarity(name = 'aws-cdk-assets', url = 'https://github.com/aws/aws-cdk', algo = <bound method _Distance.dist of <abydos.distance._ratcliff_obershelp.RatcliffObershelp object at 0x7f642a746430>>, comp_op = <built-in function min>)
DEBUG    pypidb._similarity:_similarity.py:44 _similarity of aws awscdk to awscdkassets
DEBUG    pypidb._similarity:__init__.py:177 Returned 0.23809523809523814 from pypidb._similarity._compute_similarity
DEBUG    pypidb._similarity:__init__.py:157 Calling pypidb._similarity._compute_similarity(name = 'aws-cdk-assets', url = 'https://github.com/mojombo/grit', algo = <bound method _Distance.dist of <abydos.distance._ratcliff_obershelp.RatcliffObershelp object at 0x7f642a746430>>, comp_op = <built-in function min>)
DEBUG    pypidb._similarity:_similarity.py:44 _similarity of mojombo grit to awscdkassets
DEBUG    pypidb._similarity:__init__.py:177 Returned 0.6153846153846154 from pypidb._similarity._compute_similarity
DEBUG    pypidb._similarity:__init__.py:157 Calling pypidb._similarity._compute_similarity(name = 'aws-cdk-assets', url = 'https://github.com/mojombo/proxymachine', algo = <bound method _Distance.dist of <abydos.distance._ratcliff_obershelp.RatcliffObershelp object at 0x7f642a746430>>, comp_op = <built-in function min>)
DEBUG    pypidb._similarity:_similarity.py:44 _similarity of mojombo proxymachine to awscdkassets
DEBUG    pypidb._similarity:__init__.py:177 Returned 0.6153846153846154 from pypidb._similarity._compute_similarity
DEBUG    pypidb._similarity:_similarity.py:118 computed similarity: {'https://github.com/semver/semver': 0.7142857142857143, 'https://github.com/mojombo/chronic': 0.6129032258064516, 'https://github.com/mojombo/ernie': 0.6296296296296297, 'https://github.com/mojombo/god': 0.6, 'https://github.com/firasdib/Regex101': 0.6666666666666667, 'https://github.com/gollum/gollum': 0.6428571428571428, 'https://github.com/toml-lang/toml': 0.6153846153846154, 'https://github.com/aws/aws-cdk': 0.23809523809523814, 'https://github.com/mojombo/grit': 0.6153846153846154, 'https://github.com/mojombo/proxymachine': 0.6153846153846154}
DEBUG    pypidb._similarity:_similarity.py:126 nearest: 0.7142857142857143 https://github.com/semver/semver
DEBUG    pypidb._similarity:_similarity.py:126 nearest: 0.6129032258064516 https://github.com/mojombo/chronic
DEBUG    pypidb._similarity:_similarity.py:126 nearest: 0.6 https://github.com/mojombo/god
DEBUG    pypidb._similarity:_similarity.py:126 nearest: 0.23809523809523814 https://github.com/aws/aws-cdk
DEBUG    pypidb._similarity:_similarity.py:129 nearest_counter Counter({0.6153846153846154: 3, 0.7142857142857143: 1, 0.6129032258064516: 1, 0.6296296296296297: 1, 0.6: 1, 0.6666666666666667: 1, 0.6428571428571428: 1, 0.23809523809523814: 1})
DEBUG    pypidb._similarity:__init__.py:177 Returned 'https://github.com/aws/aws-cdk' from pypidb._similarity._get_most_similar
DEBUG    pypidb._similarity:_similarity.py:255 common, similar: https://github.com/firasdib/Regex101 - https://github.com/aws/aws-cdk
DEBUG    pypidb._similarity:_similarity.py:260 most common: https://github.com/firasdib/Regex101
DEBUG    pypidb._similarity:__init__.py:177 Returned 'https://github.com/firasdib/Regex101' from pypidb._similarity.get_best_match
DEBUG    pypidb._pypi:_pypi.py:570 _get_vcs_links(aws-cdk.assets) returned .. https://github.com/firasdib/Regex101
DEBUG    pypidb._pypi:_pypi.py:584 aws-cdk.assets: found https://github.com/firasdib/Regex101
DEBUG    pypidb._scm_url_cleaner:__init__.py:157 Calling pypidb._scm_url_cleaner.SCMURLCleaner.get_root(self = <pypidb._scm_url_cleaner.SCMURLCleaner object at 0x7f64161250d0>, url = 'https://github.com/firasdib/Regex101')
DEBUG    pypidb._scm_url_cleaner:__init__.py:157 Calling pypidb._scm_url_cleaner._github(url = 'https://github.com/firasdib/Regex101')
DEBUG    pypidb._scm_url_cleaner:__init__.py:177 Returned ('firasdib', 'Regex101') from pypidb._scm_url_cleaner._github
DEBUG    pypidb._scm_url_cleaner:__init__.py:177 Returned 'https://github.com/firasdib/Regex101' from pypidb._scm_url_cleaner.SCMURLCleaner.get_root
INFO     pypidb._db:_db.py:79 Adding mapping aws-cdk.assets = https://github.com/firasdib/Regex101