issues
search
lipoja
/
URLExtract
URLExtract is python class for collecting (extracting) URLs from given text based on locating TLD.
MIT License
241
stars
61
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
IPv6?
#119
larseggert
opened
2 years ago
1
Passing custom cache_dir doesnt seem to actually save the tlds...txt file in that dir
#118
Yossi
opened
2 years ago
0
Wrong indices with uppercase characters in domain name
#117
tkrissuu
closed
1 year ago
1
TLD cache filelock error on read-only systems
#116
LaundroMat
opened
2 years ago
12
URLExtract no longer support Python 3.6 because of filelock recent changes
#115
za
opened
2 years ago
1
feat(typing): add typing
#114
georgettica
closed
1 year ago
8
add types to urlextract
#113
georgettica
closed
1 year ago
3
Version 1.5.0
#112
lipoja
closed
2 years ago
0
Update changelog and TLDs
#111
lipoja
closed
2 years ago
0
Fix incorrect indices when TLD is found twice
#110
lipoja
closed
2 years ago
0
Wrong indices when the domain name contains the same TLD twice
#109
tkrissuu
closed
2 years ago
3
Adding flake8 to tox
#108
lipoja
closed
2 years ago
0
Can't run test code: ModuleNotFoundError: No module named 'platformdirs'
#107
za
closed
2 years ago
2
Replace unmaintained appdirs with maintained platformdirs
#106
hugovk
closed
2 years ago
1
(style): initial commit formatting using black
#105
za
closed
2 years ago
21
(style): run isort on urlextract_core.py
#104
za
closed
2 years ago
2
(docs): update README and add docs how to run test code
#103
za
closed
2 years ago
3
Version 1.4.0
#102
lipoja
closed
2 years ago
0
Moving tests of has_url from doc-string to separate file
#101
lipoja
closed
2 years ago
0
Adding support to filter URLs with schema only
#100
lipoja
closed
2 years ago
0
Add support for py3.10
#99
lipoja
closed
2 years ago
0
Detect URLs starting with '//'
#98
lipoja
closed
2 years ago
0
dont url
#97
NeilRiver
closed
2 years ago
2
URL containing space is truncated
#95
begunrom
closed
2 years ago
3
//www.google.com cannot find such type of links
#94
akshayanandraut
closed
2 years ago
3
Adding python 3.10 to test
#93
lipoja
closed
3 years ago
0
move dns checking to dedicated class and add concurrency
#92
nicolasassi
opened
3 years ago
9
check dns concurrently to speed up lookup
#91
nicolasassi
opened
3 years ago
8
ossar test
#90
lipoja
closed
1 year ago
0
Adding CodeQL security scan
#89
lipoja
closed
3 years ago
0
Fixing caching
#87
lipoja
closed
3 years ago
0
Fixes RE for IPv4 addresses
#86
kak-bo-che
closed
3 years ago
1
Upgrade to GitHub-native Dependabot
#85
dependabot-preview[bot]
closed
3 years ago
0
Best practices for using a URLExtract object for speed?
#84
dfrankow
closed
3 years ago
5
GitHub actions
#83
lipoja
closed
3 years ago
0
URL Detection Problem
#82
ghost
closed
1 year ago
6
Adding skipping of whitespace before URL inside parentheses (issue #77)
#81
lipoja
closed
3 years ago
0
Removing deprecated methods `get_stop_chars`, `set_stop_chars`
#80
lipoja
closed
3 years ago
0
Adding case insensitive detection of TLDs (#76)
#79
lipoja
closed
3 years ago
0
Updating TLDs and realease
#78
lipoja
closed
3 years ago
0
Parenthesis in found urls
#77
javad94
closed
3 years ago
2
Case sensitivity in detecting URLs
#76
philshem
closed
3 years ago
2
Failing DNS tests
#75
lipoja
closed
3 years ago
1
Indices of found URLs
#74
BenoitTS
closed
3 years ago
1
Fix some inverted 'f's
#73
Yossi
closed
3 years ago
1
SyntaxError: (unicode error)
#72
bmfirst
closed
3 years ago
4
Indexes of found URLs
#71
javad94
closed
3 years ago
5
Bugfix for issue #41
#70
ghost
closed
4 years ago
1
Maximum results
#69
jayvdb
closed
4 years ago
2
pypidb issues
#68
jayvdb
opened
4 years ago
0
Previous
Next