issues
search
impresso
/
impresso-pycommons
Python module with bits of code (objects, functions) highly reusable within impresso.
http://impresso-pycommons.rtfd.io/
GNU Affero General Public License v3.0
3
stars
3
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
[rebuild] Extremely long articles
#42
mromanello
closed
5 years ago
1
[rebuild] add ability to filter by language
#41
mromanello
closed
5 years ago
0
missing text on articles spanning several pages
#40
e-maud
closed
5 years ago
1
[rebuild] solr rebuild of luxwort fails
#39
mromanello
closed
5 years ago
0
version 0.9.x
#38
mromanello
closed
5 years ago
0
first attempt to correct box coord issue #34
#37
e-maud
closed
5 years ago
0
[rebuild] fix hyphenation of text at region boundaries
#36
mromanello
opened
5 years ago
0
[uima] generate data for image layer
#35
mromanello
closed
5 years ago
0
[rebuild] implement rebuild for passim format
#34
mromanello
closed
5 years ago
0
[rebuild] repeated article parts
#33
mromanello
closed
4 years ago
3
[rebuild] adapt to compressed storage format for canonical data
#32
mromanello
closed
5 years ago
0
[rebuild] remove JSON schemas
#31
mromanello
closed
5 years ago
0
[rebuild] token length has values < 1
#30
mromanello
opened
5 years ago
0
S3 filter timebucket
#29
e-maud
closed
5 years ago
0
Serialization of rebuilt in UIMA format
#28
mromanello
closed
5 years ago
3
Config
#27
e-maud
closed
5 years ago
0
[rebuild] rebuild also advertisements (not only articles)
#26
mromanello
closed
5 years ago
0
S3 partition
#25
e-maud
closed
5 years ago
0
rebuild for NZZ data
#24
simon-clematide
closed
5 years ago
20
enable S3 access directly with dask
#23
e-maud
closed
5 years ago
6
[rebuild] for each article, add a field abt coordinate conversion
#22
mromanello
closed
6 years ago
0
[rebuild] handle `EndpointConnectionError`
#21
mromanello
closed
5 years ago
1
function to read rebuilt
#20
e-maud
closed
5 years ago
1
Cleanup
#19
e-maud
closed
6 years ago
1
Rebuild schema
#18
mromanello
closed
6 years ago
1
common command line tools for development
#17
simon-clematide
closed
5 years ago
1
manifest file for a collection
#16
simon-clematide
closed
5 years ago
4
[rebuild] finalize and validate JSON schema
#15
mromanello
closed
5 years ago
5
use `get_s3_resource` (boto3) instead of `get_bucket`
#14
mromanello
closed
6 years ago
0
Text rebuilder
#13
mromanello
closed
6 years ago
0
clean up S3-related functions in path/path_s3.py
#12
mromanello
closed
6 years ago
0
refactor dask logic: from futures to bags
#11
mromanello
closed
6 years ago
1
add a CLI for `text_rebuilder`
#10
mromanello
closed
5 years ago
0
package JSON files for S3 in a compressed way
#9
mromanello
closed
6 years ago
0
boto3 updates
#8
e-maud
opened
6 years ago
1
generate proper documentation
#7
e-maud
closed
5 years ago
0
remove extension from page id
#6
mromanello
closed
6 years ago
0
[rebuild] fields to add
#5
mromanello
closed
6 years ago
18
check encoding of JSON output
#4
mromanello
closed
6 years ago
2
organisation of s3
#3
mromanello
closed
5 years ago
5
"incompatible resolution" errors
#2
mromanello
opened
6 years ago
0
homogeneize detect methods
#1
e-maud
opened
6 years ago
0
Previous