issues
search
CambioML
/
uniflow-llm-based-pdf-extraction-text-cleaning-data-clustering
LLM-based text extraction from unstructured data like PDFs, Words and HTMLs. Transform and cluster the text into your desired format. Less information loss, more interpretation, and faster R&D!
https://www.cambioml.com
Apache License 2.0
187
stars
56
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Define the number of Questions-Answers pairs and q4 or q8 quantization
#238
Chasapas
opened
3 months ago
0
OSError: 0.1.0-small is not a local folder and is not a valid model identifier listed on 'https://huggingface.co/models'
#237
C0casio45
opened
3 months ago
0
Refactor to support multi flow configuration
#236
CambioML
opened
5 months ago
0
feat: WIP news feed and report genertaion
#235
CallmeNafiy
opened
5 months ago
0
feat: Pdf extractor
#234
CallmeNafiy
closed
6 months ago
0
Customized splitter config
#233
SayaZhang
closed
7 months ago
1
Bump up version to 0.0.31
#232
goldmermaid
closed
7 months ago
0
Encapsulated model output
#231
CallmeNafiy
opened
7 months ago
0
Gemma support new
#230
ZHIHANCHEN03
opened
7 months ago
0
Gemma support new
#229
ZHIHANCHEN03
closed
7 months ago
0
update transform and rater
#228
EdTeng1
opened
7 months ago
0
Bug: RecursiveSplitter removes all spaces
#227
vicshi06
opened
7 months ago
1
Request: Customized Chunk Size for RecursiveSplitter
#226
vicshi06
opened
7 months ago
0
In extract_pdf_nougat_qa.ipynb, ExtractClient(config) gets 'NoneType' object error
#225
larryyin
opened
8 months ago
0
Add summary prompt and bump version to 0.0.30
#224
goldmermaid
closed
8 months ago
0
add Documentation Github Issue Template
#223
jojortz
closed
8 months ago
0
Add Feature Request and Questions issues
#222
jojortz
closed
8 months ago
0
add bug report ISSUE_TEMPLATE
#221
jojortz
closed
8 months ago
0
Add auto splitter advanced for huggingface config
#220
ZHIHANCHEN03
closed
8 months ago
1
Bump up version to 0.0.29
#219
goldmermaid
closed
8 months ago
0
Fix autoflake failure case and update pre-commit to run unittests
#218
goldmermaid
closed
8 months ago
0
Bump up version to 0.0.28
#217
goldmermaid
closed
8 months ago
0
Add Gemma Support
#216
ZHIHANCHEN03
closed
7 months ago
0
Paper Comparison Summary Flow
#215
CallmeNafiy
closed
7 months ago
0
Bump up version to 0.0.27
#214
goldmermaid
closed
8 months ago
1
polish gmail filter notebook
#213
goldmermaid
closed
8 months ago
1
Refinement: setting batch_size for different models
#212
riboyuan99
closed
8 months ago
5
transform and rater tests
#211
EdTeng1
closed
7 months ago
2
Update gmail filter notebook
#210
goldmermaid
closed
8 months ago
0
Add google workspace email filter uniflow application
#209
goldmermaid
closed
8 months ago
0
TransformAzureOpenAI Implementation
#208
frank-suwen
closed
8 months ago
1
added crop labeling example using google multimodal flow using gemini-vision
#207
boqiny
closed
8 months ago
1
Add application folder for flow and tests.
#206
goldmermaid
closed
8 months ago
0
Add `. gitattributes`
#205
goldmermaid
closed
8 months ago
0
Use LLM to Auto Write Example TOC Readme
#204
goldmermaid
closed
8 months ago
0
Bump up version to 0.0.26
#203
goldmermaid
closed
8 months ago
0
Remove duplicated notebooks
#202
goldmermaid
closed
8 months ago
0
Added download entry for neuron model with batch_size = 8 and benchmarked neuron models with batch_size = 1,2,4,8
#201
riboyuan99
closed
8 months ago
1
Long text spliter
#200
ZHIHANCHEN03
closed
8 months ago
2
unittest_load
#199
Real3Lee
opened
8 months ago
0
finish test_extract_txt_flow
#198
jli943
closed
8 months ago
1
Bump up version to 0.0.25
#197
goldmermaid
closed
8 months ago
0
Remove duplicated notebooks
#196
goldmermaid
closed
8 months ago
0
Refactor pipeline class
#195
goldmermaid
closed
8 months ago
0
Polish Readme with the latest features
#194
goldmermaid
closed
8 months ago
0
Remove 0.1.0-small
#193
jojortz
closed
8 months ago
0
Add TransformOp, update it instantiation into ExtractHTMLFlow to add post_extract_op, update notebook
#192
goldmermaid
closed
8 months ago
1
Fix html parser duplicate content
#191
SayaZhang
closed
8 months ago
1
Add a web summary example
#190
goldmermaid
closed
8 months ago
1
Bump up version to 0.0.24
#189
goldmermaid
closed
9 months ago
0
Next