issues
search
lipoja
/
URLExtract
URLExtract is python class for collecting (extracting) URLs from given text based on locating TLD.
MIT License
239
stars
61
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Add `py.typed` marker to source and package
#164
Daverball
opened
3 months ago
0
[Errno 11002] Temporary failure in name resolution after using URLExtract
#163
jackjyq
opened
4 months ago
1
Release v1.9.0
#162
lipoja
closed
4 months ago
0
Fixing parsing of Markdown links
#161
lipoja
closed
4 months ago
0
Adding support for Python 3.12
#160
lipoja
closed
4 months ago
0
Accepting only ASCII characters left from TLD.
#159
lipoja
closed
4 months ago
0
Fixing filter of mixed case hostnames
#158
lipoja
closed
4 months ago
0
red flag from antiviruses
#157
Khoshbayani
opened
6 months ago
2
Invalid URLs accepted with subdomains
#156
carton-of-mice
opened
7 months ago
0
Wrong indices and repeated matches when hostname contains the TLD
#155
carton-of-mice
opened
8 months ago
0
Support for private/reserved/custom TLDs
#154
carton-of-mice
opened
8 months ago
0
Support non-unicode hostname
#153
frankdilo
opened
9 months ago
3
Extracting Markdown Text, doesn't process escaped \\ correctly
#152
kevintxu
closed
4 months ago
1
Bug with flag `allow_mixed_case_hostname=False`
#151
GokulNC
closed
4 months ago
4
Update shield images
#150
lipoja
closed
1 year ago
0
Release 1.8.0
#149
lipoja
closed
1 year ago
0
Adding ability to filter out mixed case host-names
#148
lipoja
closed
1 year ago
0
Fixing typos in strings
#147
lipoja
closed
1 year ago
0
Adding the ability to set stop characters inside of a scheme
#146
lipoja
closed
1 year ago
0
Update github action triggers for tox test
#145
lipoja
closed
1 year ago
0
Unable to detect t.me links
#144
WbMarker
closed
1 year ago
1
Handle upper-case false positives
#143
GokulNC
closed
1 year ago
9
Wrong indices and incomplete extraction when string contains similar urls
#142
variablenerd
closed
4 months ago
1
Add Python 3.11
#141
elliotwutingfeng
closed
1 year ago
0
Fix index issue with uppercase characters in domain names
#140
iwangpeng
closed
1 year ago
2
Release v1.7.1
#139
lipoja
closed
1 year ago
0
Check if url_parts.authority is not NoneType
#138
lipoja
closed
1 year ago
0
urlextract without authority causes AttributeError
#137
seanbreckenridge
closed
1 year ago
2
urlextract v1.7.0
#136
lipoja
closed
1 year ago
0
Correct handling when authority starts with @ symbol
#135
lipoja
closed
1 year ago
0
Remove unreserved characters from the beginning of found URL
#134
lipoja
closed
1 year ago
0
Fixing mypy checks
#133
lipoja
closed
1 year ago
0
Add type hints
#132
mimi89999
closed
1 year ago
1
Does Not extract the URL that is leading special character
#131
praneethpj
closed
1 year ago
0
ERROR: Can not download list of TLDs. (URLError: [Errno 104] Connection reset by peer)
#130
koliaok
closed
1 year ago
2
URLExtract() init really slow
#129
gilbd
opened
2 years ago
0
Issue 124: Remove travis-ci in the README
#128
za
closed
2 years ago
0
Release 1.6.0
#127
lipoja
closed
2 years ago
0
Updating tests with correct order of actual == expected
#126
lipoja
closed
2 years ago
0
Add a list of URLs allowed to extract
#125
khoben
closed
2 years ago
1
travis-ci seems no longer active repository
#124
za
closed
2 years ago
2
comma extracted at the end if url ends with comma
#123
amoldavsky
closed
4 months ago
3
should not grab email fragments
#122
amoldavsky
closed
1 year ago
1
left walk does not stop on various unicode chars
#121
amoldavsky
closed
4 months ago
1
fix-multiple-protocols-in-url
#120
amoldavsky
closed
1 year ago
3
IPv6?
#119
larseggert
opened
2 years ago
1
Passing custom cache_dir doesnt seem to actually save the tlds...txt file in that dir
#118
Yossi
opened
2 years ago
0
Wrong indices with uppercase characters in domain name
#117
tkrissuu
closed
1 year ago
1
TLD cache filelock error on read-only systems
#116
LaundroMat
opened
2 years ago
12
URLExtract no longer support Python 3.6 because of filelock recent changes
#115
za
opened
2 years ago
0
Next