René's URL Explorer Experiment


Title: unstructured-data · GitHub Topics · GitHub

Open Graph Title: Build software better, together

X Title: GitHub

Description: GitHub is where people build software. More than 150 million people use GitHub to discover, fork, and contribute to over 420 million projects.

Open Graph Description: GitHub is where people build software. More than 150 million people use GitHub to discover, fork, and contribute to over 420 million projects.

X Description: GitHub is where people build software. More than 150 million people use GitHub to discover, fork, and contribute to over 420 million projects.

Opengraph URL: https://github.com

X: github

direct link

Domain: patch-diff.githubusercontent.com

route-pattern/topics/:topic_name(.:format)
route-controllertopics
route-actionshow
fetch-noncev2:f35562b0-d8bc-ba88-c035-dca2fc70e069
current-catalog-service-hash82c569b93da5c18ed649ebd4c2c79437db4611a6a1373e805a3cb001c64130b7
request-idB7B4:319547:39BBC47:4BB5353:696BA6BF
html-safe-nonce2731e2633b7afec62c97440b14a31517e2ded1d74ff0c0b19ef0f416cf5d1cda
visitor-payloadeyJyZWZlcnJlciI6IiIsInJlcXVlc3RfaWQiOiJCN0I0OjMxOTU0NzozOUJCQzQ3OjRCQjUzNTM6Njk2QkE2QkYiLCJ2aXNpdG9yX2lkIjoiMzczOTMzMDA1NDQ0OTc2ODEyNyIsInJlZ2lvbl9lZGdlIjoiaWFkIiwicmVnaW9uX3JlbmRlciI6ImlhZCJ9
visitor-hmac8d6c643ab8ed40726e08864ae3c3092dcab567832a9b239cc2330f9805b7ff06
github-keyboard-shortcutscopilot
google-site-verificationApib7-x98H0j5cPqHWwSMm6dNU4GmODRoqxLiDzdx9I
octolytics-urlhttps://collector.github.com/github/collect
fb:app_id1401488693436528
apple-itunes-appapp-id=1477376905, app-argument=https://github.com/topics/unstructured-data
og:site_nameGitHub
og:imagehttps://github.githubassets.com/assets/github-octocat-13c86b8b336d.png
og:image:typeimage/png
og:image:width1200
og:image:height620
twitter:site:id13334762
twitter:creatorgithub
twitter:creator:id13334762
twitter:cardsummary_large_image
twitter:imagehttps://github.githubassets.com/assets/github-logo-55c5b9a1fe52.png
twitter:image:width1200
twitter:image:height1200
hostnamegithub.com
expected-hostnamegithub.com
None5f99f7c1d70f01da5b93e5ca90303359738944d8ab470e396496262c66e60b8d
turbo-cache-controlno-preview
turbo-body-classeslogged-out env-production page-responsive
disable-turbofalse
browser-stats-urlhttps://api.github.com/_private/browser/stats
browser-errors-urlhttps://api.github.com/_private/browser/errors
release82560a55c6b2054555076f46e683151ee28a19bc
ui-targetfull
theme-color#1e2327
color-schemelight dark

Links:

Skip to contenthttps://patch-diff.githubusercontent.com/topics/unstructured-data#start-of-content
https://patch-diff.githubusercontent.com/
Sign in https://patch-diff.githubusercontent.com/login?return_to=https%3A%2F%2Fgithub.com%2Ftopics%2Funstructured-data
GitHub CopilotWrite better code with AIhttps://github.com/features/copilot
GitHub SparkBuild and deploy intelligent appshttps://github.com/features/spark
GitHub ModelsManage and compare promptshttps://github.com/features/models
MCP RegistryNewIntegrate external toolshttps://github.com/mcp
ActionsAutomate any workflowhttps://github.com/features/actions
CodespacesInstant dev environmentshttps://github.com/features/codespaces
IssuesPlan and track workhttps://github.com/features/issues
Code ReviewManage code changeshttps://github.com/features/code-review
GitHub Advanced SecurityFind and fix vulnerabilitieshttps://github.com/security/advanced-security
Code securitySecure your code as you buildhttps://github.com/security/advanced-security/code-security
Secret protectionStop leaks before they starthttps://github.com/security/advanced-security/secret-protection
Why GitHubhttps://github.com/why-github
Documentationhttps://docs.github.com
Bloghttps://github.blog
Changeloghttps://github.blog/changelog
Marketplacehttps://github.com/marketplace
View all featureshttps://github.com/features
Enterpriseshttps://github.com/enterprise
Small and medium teamshttps://github.com/team
Startupshttps://github.com/enterprise/startups
Nonprofitshttps://github.com/solutions/industry/nonprofits
App Modernizationhttps://github.com/solutions/use-case/app-modernization
DevSecOpshttps://github.com/solutions/use-case/devsecops
DevOpshttps://github.com/solutions/use-case/devops
CI/CDhttps://github.com/solutions/use-case/ci-cd
View all use caseshttps://github.com/solutions/use-case
Healthcarehttps://github.com/solutions/industry/healthcare
Financial serviceshttps://github.com/solutions/industry/financial-services
Manufacturinghttps://github.com/solutions/industry/manufacturing
Governmenthttps://github.com/solutions/industry/government
View all industrieshttps://github.com/solutions/industry
View all solutionshttps://github.com/solutions
AIhttps://github.com/resources/articles?topic=ai
Software Developmenthttps://github.com/resources/articles?topic=software-development
DevOpshttps://github.com/resources/articles?topic=devops
Securityhttps://github.com/resources/articles?topic=security
View all topicshttps://github.com/resources/articles
Customer storieshttps://github.com/customer-stories
Events & webinarshttps://github.com/resources/events
Ebooks & reportshttps://github.com/resources/whitepapers
Business insightshttps://github.com/solutions/executive-insights
GitHub Skillshttps://skills.github.com
Documentationhttps://docs.github.com
Customer supporthttps://support.github.com
Community forumhttps://github.com/orgs/community/discussions
Trust centerhttps://github.com/trust-center
Partnershttps://github.com/partners
GitHub SponsorsFund open source developershttps://github.com/sponsors
Security Labhttps://securitylab.github.com
Maintainer Communityhttps://maintainers.github.com
Acceleratorhttps://github.com/accelerator
Archive Programhttps://archiveprogram.github.com
Topicshttps://github.com/topics
Trendinghttps://github.com/trending
Collectionshttps://github.com/collections
Enterprise platformAI-powered developer platformhttps://github.com/enterprise
GitHub Advanced SecurityEnterprise-grade security featureshttps://github.com/security/advanced-security
Copilot for BusinessEnterprise-grade AI featureshttps://github.com/features/copilot/copilot-business
Premium SupportEnterprise-grade 24/7 supporthttps://github.com/premium-support
Pricinghttps://github.com/pricing
Search syntax tipshttps://docs.github.com/search-github/github-code-search/understanding-github-code-search-syntax
documentationhttps://docs.github.com/search-github/github-code-search/understanding-github-code-search-syntax
Sign in https://patch-diff.githubusercontent.com/login?return_to=https%3A%2F%2Fgithub.com%2Ftopics%2Funstructured-data
Sign up https://patch-diff.githubusercontent.com/signup?ref_cta=Sign+up&ref_loc=header+logged+out&ref_page=%2Ftopics%2Funstructured-data&source=header
Reloadhttps://patch-diff.githubusercontent.com/topics/unstructured-data
Reloadhttps://patch-diff.githubusercontent.com/topics/unstructured-data
Reloadhttps://patch-diff.githubusercontent.com/topics/unstructured-data
Explorehttps://patch-diff.githubusercontent.com/explore
Topicshttps://patch-diff.githubusercontent.com/topics
Trendinghttps://patch-diff.githubusercontent.com/trending
Collectionshttps://patch-diff.githubusercontent.com/collections
Eventshttps://patch-diff.githubusercontent.com/events
GitHub Sponsorshttps://patch-diff.githubusercontent.com/sponsors/explore
Star https://patch-diff.githubusercontent.com/login?return_to=%2Ftopic.unstructured-data
All 225 https://github.com/topics/unstructured-data
Python 83 https://github.com/topics/unstructured-data?l=python
Jupyter Notebook 52 https://github.com/topics/unstructured-data?l=jupyter+notebook
Java 13 https://github.com/topics/unstructured-data?l=java
Go 11 https://github.com/topics/unstructured-data?l=go
TypeScript 11 https://github.com/topics/unstructured-data?l=typescript
JavaScript 10 https://github.com/topics/unstructured-data?l=javascript
HTML 5 https://github.com/topics/unstructured-data?l=html
R 5 https://github.com/topics/unstructured-data?l=r
C 2 https://github.com/topics/unstructured-data?l=c
CSS 2 https://github.com/topics/unstructured-data?l=css
Most stars https://patch-diff.githubusercontent.com/topics/unstructured-data?o=desc&s=stars
Fewest stars https://patch-diff.githubusercontent.com/topics/unstructured-data?o=asc&s=stars
Most forks https://patch-diff.githubusercontent.com/topics/unstructured-data?o=desc&s=forks
Fewest forks https://patch-diff.githubusercontent.com/topics/unstructured-data?o=asc&s=forks
Recently updated https://patch-diff.githubusercontent.com/topics/unstructured-data?o=desc&s=updated
Least recently updated https://patch-diff.githubusercontent.com/topics/unstructured-data?o=asc&s=updated
https://patch-diff.githubusercontent.com/treeverse/dvc
treeversehttps://patch-diff.githubusercontent.com/treeverse
dvchttps://patch-diff.githubusercontent.com/treeverse/dvc
Star 15.3k https://patch-diff.githubusercontent.com/login?return_to=%2Ftreeverse%2Fdvc
Code https://patch-diff.githubusercontent.com/treeverse/dvc
Issues https://patch-diff.githubusercontent.com/treeverse/dvc/issues
Pull requests https://patch-diff.githubusercontent.com/treeverse/dvc/pulls
Discussions https://patch-diff.githubusercontent.com/treeverse/dvc/discussions
data-sciencehttps://patch-diff.githubusercontent.com/topics/data-science
machine-learninghttps://patch-diff.githubusercontent.com/topics/machine-learning
aihttps://patch-diff.githubusercontent.com/topics/ai
developer-toolshttps://patch-diff.githubusercontent.com/topics/developer-tools
reproducibilityhttps://patch-diff.githubusercontent.com/topics/reproducibility
data-version-controlhttps://patch-diff.githubusercontent.com/topics/data-version-control
unstructured-datahttps://patch-diff.githubusercontent.com/topics/unstructured-data
https://patch-diff.githubusercontent.com/voxel51/fiftyone
voxel51https://patch-diff.githubusercontent.com/voxel51
fiftyonehttps://patch-diff.githubusercontent.com/voxel51/fiftyone
Star 10.3k https://patch-diff.githubusercontent.com/login?return_to=%2Fvoxel51%2Ffiftyone
Code https://patch-diff.githubusercontent.com/voxel51/fiftyone
Issues https://patch-diff.githubusercontent.com/voxel51/fiftyone/issues
Pull requests https://patch-diff.githubusercontent.com/voxel51/fiftyone/pulls
visualizationhttps://patch-diff.githubusercontent.com/topics/visualization
pythonhttps://patch-diff.githubusercontent.com/topics/python
data-sciencehttps://patch-diff.githubusercontent.com/topics/data-science
machine-learninghttps://patch-diff.githubusercontent.com/topics/machine-learning
computer-visionhttps://patch-diff.githubusercontent.com/topics/computer-vision
deep-learninghttps://patch-diff.githubusercontent.com/topics/deep-learning
artificial-intelligencehttps://patch-diff.githubusercontent.com/topics/artificial-intelligence
developer-toolshttps://patch-diff.githubusercontent.com/topics/developer-tools
image-classificationhttps://patch-diff.githubusercontent.com/topics/image-classification
object-detectionhttps://patch-diff.githubusercontent.com/topics/object-detection
data-cleaninghttps://patch-diff.githubusercontent.com/topics/data-cleaning
active-learninghttps://patch-diff.githubusercontent.com/topics/active-learning
data-qualityhttps://patch-diff.githubusercontent.com/topics/data-quality
data-curationhttps://patch-diff.githubusercontent.com/topics/data-curation
unstructured-datahttps://patch-diff.githubusercontent.com/topics/unstructured-data
vector-searchhttps://patch-diff.githubusercontent.com/topics/vector-search
data-centric-aihttps://patch-diff.githubusercontent.com/topics/data-centric-ai
https://patch-diff.githubusercontent.com/Zipstack/unstract
Zipstackhttps://patch-diff.githubusercontent.com/Zipstack
unstracthttps://patch-diff.githubusercontent.com/Zipstack/unstract
Star 6k https://patch-diff.githubusercontent.com/login?return_to=%2FZipstack%2Funstract
Code https://patch-diff.githubusercontent.com/Zipstack/unstract
Issues https://patch-diff.githubusercontent.com/Zipstack/unstract/issues
Pull requests https://patch-diff.githubusercontent.com/Zipstack/unstract/pulls
unstructured-datahttps://patch-diff.githubusercontent.com/topics/unstructured-data
etl-pipelinehttps://patch-diff.githubusercontent.com/topics/etl-pipeline
llm-platformhttps://patch-diff.githubusercontent.com/topics/llm-platform
neo4j-labshttps://patch-diff.githubusercontent.com/neo4j-labs
llm-graph-builderhttps://patch-diff.githubusercontent.com/neo4j-labs/llm-graph-builder
Star 4.3k https://patch-diff.githubusercontent.com/login?return_to=%2Fneo4j-labs%2Fllm-graph-builder
Code https://patch-diff.githubusercontent.com/neo4j-labs/llm-graph-builder
Issues https://patch-diff.githubusercontent.com/neo4j-labs/llm-graph-builder/issues
Pull requests https://patch-diff.githubusercontent.com/neo4j-labs/llm-graph-builder/pulls
Discussions https://patch-diff.githubusercontent.com/neo4j-labs/llm-graph-builder/discussions
neo4jhttps://patch-diff.githubusercontent.com/topics/neo4j
graphhttps://patch-diff.githubusercontent.com/topics/graph
data-importhttps://patch-diff.githubusercontent.com/topics/data-import
knowledge-graphhttps://patch-diff.githubusercontent.com/topics/knowledge-graph
graphdbhttps://patch-diff.githubusercontent.com/topics/graphdb
graph-searchhttps://patch-diff.githubusercontent.com/topics/graph-search
unstructured-datahttps://patch-diff.githubusercontent.com/topics/unstructured-data
raghttps://patch-diff.githubusercontent.com/topics/rag
langchainhttps://patch-diff.githubusercontent.com/topics/langchain
vectordbhttps://patch-diff.githubusercontent.com/topics/vectordb
genaihttps://patch-diff.githubusercontent.com/topics/genai
graph-raghttps://patch-diff.githubusercontent.com/topics/graph-rag
graphraghttps://patch-diff.githubusercontent.com/topics/graphrag
towhee-iohttps://patch-diff.githubusercontent.com/towhee-io
towheehttps://patch-diff.githubusercontent.com/towhee-io/towhee
Star 3.4k https://patch-diff.githubusercontent.com/login?return_to=%2Ftowhee-io%2Ftowhee
Code https://patch-diff.githubusercontent.com/towhee-io/towhee
Issues https://patch-diff.githubusercontent.com/towhee-io/towhee/issues
Pull requests https://patch-diff.githubusercontent.com/towhee-io/towhee/pulls
Discussions https://patch-diff.githubusercontent.com/towhee-io/towhee/discussions
machine-learninghttps://patch-diff.githubusercontent.com/topics/machine-learning
computer-visionhttps://patch-diff.githubusercontent.com/topics/computer-vision
pipelinehttps://patch-diff.githubusercontent.com/topics/pipeline
image-processinghttps://patch-diff.githubusercontent.com/topics/image-processing
embeddingshttps://patch-diff.githubusercontent.com/topics/embeddings
transformerhttps://patch-diff.githubusercontent.com/topics/transformer
video-processinghttps://patch-diff.githubusercontent.com/topics/video-processing
feature-extractionhttps://patch-diff.githubusercontent.com/topics/feature-extraction
convolutional-networkshttps://patch-diff.githubusercontent.com/topics/convolutional-networks
vithttps://patch-diff.githubusercontent.com/topics/vit
feature-vectorhttps://patch-diff.githubusercontent.com/topics/feature-vector
image-retrievalhttps://patch-diff.githubusercontent.com/topics/image-retrieval
unstructured-datahttps://patch-diff.githubusercontent.com/topics/unstructured-data
embedding-vectorshttps://patch-diff.githubusercontent.com/topics/embedding-vectors
milvushttps://patch-diff.githubusercontent.com/topics/milvus
vision-transformerhttps://patch-diff.githubusercontent.com/topics/vision-transformer
towheehttps://patch-diff.githubusercontent.com/topics/towhee
llmhttps://patch-diff.githubusercontent.com/topics/llm
ucbepichttps://patch-diff.githubusercontent.com/ucbepic
docetlhttps://patch-diff.githubusercontent.com/ucbepic/docetl
Star 3.4k https://patch-diff.githubusercontent.com/login?return_to=%2Fucbepic%2Fdocetl
Code https://patch-diff.githubusercontent.com/ucbepic/docetl
Issues https://patch-diff.githubusercontent.com/ucbepic/docetl/issues
Pull requests https://patch-diff.githubusercontent.com/ucbepic/docetl/pulls
pythonhttps://patch-diff.githubusercontent.com/topics/python
workflowhttps://patch-diff.githubusercontent.com/topics/workflow
datahttps://patch-diff.githubusercontent.com/topics/data
etlhttps://patch-diff.githubusercontent.com/topics/etl
semantic-datahttps://patch-diff.githubusercontent.com/topics/semantic-data
elthttps://patch-diff.githubusercontent.com/topics/elt
data-pipelineshttps://patch-diff.githubusercontent.com/topics/data-pipelines
agentshttps://patch-diff.githubusercontent.com/topics/agents
document-analysishttps://patch-diff.githubusercontent.com/topics/document-analysis
document-processinghttps://patch-diff.githubusercontent.com/topics/document-processing
unstructured-datahttps://patch-diff.githubusercontent.com/topics/unstructured-data
unstructured-data-analysishttps://patch-diff.githubusercontent.com/topics/unstructured-data-analysis
llmhttps://patch-diff.githubusercontent.com/topics/llm
milvus-iohttps://patch-diff.githubusercontent.com/milvus-io
bootcamphttps://patch-diff.githubusercontent.com/milvus-io/bootcamp
Star 2.4k https://patch-diff.githubusercontent.com/login?return_to=%2Fmilvus-io%2Fbootcamp
Code https://patch-diff.githubusercontent.com/milvus-io/bootcamp
Issues https://patch-diff.githubusercontent.com/milvus-io/bootcamp/issues
Pull requests https://patch-diff.githubusercontent.com/milvus-io/bootcamp/pulls
pythonhttps://patch-diff.githubusercontent.com/topics/python
nlphttps://patch-diff.githubusercontent.com/topics/nlp
deep-learninghttps://patch-diff.githubusercontent.com/topics/deep-learning
embeddingshttps://patch-diff.githubusercontent.com/topics/embeddings
question-answeringhttps://patch-diff.githubusercontent.com/topics/question-answering
image-classificationhttps://patch-diff.githubusercontent.com/topics/image-classification
image-recognitionhttps://patch-diff.githubusercontent.com/topics/image-recognition
image-searchhttps://patch-diff.githubusercontent.com/topics/image-search
semantic-searchhttps://patch-diff.githubusercontent.com/topics/semantic-search
unstructured-datahttps://patch-diff.githubusercontent.com/topics/unstructured-data
audio-searchhttps://patch-diff.githubusercontent.com/topics/audio-search
raghttps://patch-diff.githubusercontent.com/topics/rag
milvushttps://patch-diff.githubusercontent.com/topics/milvus
vector-databasehttps://patch-diff.githubusercontent.com/topics/vector-database
llmhttps://patch-diff.githubusercontent.com/topics/llm
instill-aihttps://patch-diff.githubusercontent.com/instill-ai
instill-corehttps://patch-diff.githubusercontent.com/instill-ai/instill-core
Star 2.3k https://patch-diff.githubusercontent.com/login?return_to=%2Finstill-ai%2Finstill-core
Code https://patch-diff.githubusercontent.com/instill-ai/instill-core
Issues https://patch-diff.githubusercontent.com/instill-ai/instill-core/issues
Pull requests https://patch-diff.githubusercontent.com/instill-ai/instill-core/pulls
pythonhttps://patch-diff.githubusercontent.com/topics/python
apihttps://patch-diff.githubusercontent.com/topics/api
clihttps://patch-diff.githubusercontent.com/topics/cli
golanghttps://patch-diff.githubusercontent.com/topics/golang
open-sourcehttps://patch-diff.githubusercontent.com/topics/open-source
typescripthttps://patch-diff.githubusercontent.com/topics/typescript
aihttps://patch-diff.githubusercontent.com/topics/ai
pipelinehttps://patch-diff.githubusercontent.com/topics/pipeline
etlhttps://patch-diff.githubusercontent.com/topics/etl
developer-toolshttps://patch-diff.githubusercontent.com/topics/developer-tools
gpthttps://patch-diff.githubusercontent.com/topics/gpt
hacktoberfesthttps://patch-diff.githubusercontent.com/topics/hacktoberfest
low-codehttps://patch-diff.githubusercontent.com/topics/low-code
no-codehttps://patch-diff.githubusercontent.com/topics/no-code
unstructured-datahttps://patch-diff.githubusercontent.com/topics/unstructured-data
llmhttps://patch-diff.githubusercontent.com/topics/llm
stable-diffusionhttps://patch-diff.githubusercontent.com/topics/stable-diffusion
generative-aihttps://patch-diff.githubusercontent.com/topics/generative-ai
nomic-aihttps://patch-diff.githubusercontent.com/nomic-ai
nomichttps://patch-diff.githubusercontent.com/nomic-ai/nomic
Star 1.9k https://patch-diff.githubusercontent.com/login?return_to=%2Fnomic-ai%2Fnomic
Code https://patch-diff.githubusercontent.com/nomic-ai/nomic
Issues https://patch-diff.githubusercontent.com/nomic-ai/nomic/issues
Pull requests https://patch-diff.githubusercontent.com/nomic-ai/nomic/pulls
pythonhttps://patch-diff.githubusercontent.com/topics/python
clusteringhttps://patch-diff.githubusercontent.com/topics/clustering
texthttps://patch-diff.githubusercontent.com/topics/text
embeddingshttps://patch-diff.githubusercontent.com/topics/embeddings
topic-modelinghttps://patch-diff.githubusercontent.com/topics/topic-modeling
duplicate-detectionhttps://patch-diff.githubusercontent.com/topics/duplicate-detection
unstructured-datahttps://patch-diff.githubusercontent.com/topics/unstructured-data
NanoNetshttps://patch-diff.githubusercontent.com/NanoNets
docexthttps://patch-diff.githubusercontent.com/NanoNets/docext
Star 1.8k https://patch-diff.githubusercontent.com/login?return_to=%2FNanoNets%2Fdocext
Code https://patch-diff.githubusercontent.com/NanoNets/docext
Issues https://patch-diff.githubusercontent.com/NanoNets/docext/issues
Pull requests https://patch-diff.githubusercontent.com/NanoNets/docext/pulls
Discussions https://patch-diff.githubusercontent.com/NanoNets/docext/discussions
https://idp-leaderboard.org/https://idp-leaderboard.org/
nlphttps://patch-diff.githubusercontent.com/topics/nlp
machine-learninghttps://patch-diff.githubusercontent.com/topics/machine-learning
ocrhttps://patch-diff.githubusercontent.com/topics/ocr
extractionhttps://patch-diff.githubusercontent.com/topics/extraction
documenthttps://patch-diff.githubusercontent.com/topics/document
onpremhttps://patch-diff.githubusercontent.com/topics/onprem
document-analysishttps://patch-diff.githubusercontent.com/topics/document-analysis
table-extractionhttps://patch-diff.githubusercontent.com/topics/table-extraction
unstructured-datahttps://patch-diff.githubusercontent.com/topics/unstructured-data
raghttps://patch-diff.githubusercontent.com/topics/rag
onpremisehttps://patch-diff.githubusercontent.com/topics/onpremise
llmshttps://patch-diff.githubusercontent.com/topics/llms
vlmshttps://patch-diff.githubusercontent.com/topics/vlms
document-information-extractionhttps://patch-diff.githubusercontent.com/topics/document-information-extraction
ocr-onpremisehttps://patch-diff.githubusercontent.com/topics/ocr-onpremise
document-data-extractionhttps://patch-diff.githubusercontent.com/topics/document-data-extraction
onprem-visionhttps://patch-diff.githubusercontent.com/topics/onprem-vision
onprem-ocrhttps://patch-diff.githubusercontent.com/topics/onprem-ocr
llm-ocrhttps://patch-diff.githubusercontent.com/topics/llm-ocr
ocr-benchmarkhttps://patch-diff.githubusercontent.com/topics/ocr-benchmark
https://patch-diff.githubusercontent.com/shcherbak-ai/contextgem
shcherbak-aihttps://patch-diff.githubusercontent.com/shcherbak-ai
contextgemhttps://patch-diff.githubusercontent.com/shcherbak-ai/contextgem
Star 1.8k https://patch-diff.githubusercontent.com/login?return_to=%2Fshcherbak-ai%2Fcontextgem
Code https://patch-diff.githubusercontent.com/shcherbak-ai/contextgem
Issues https://patch-diff.githubusercontent.com/shcherbak-ai/contextgem/issues
Pull requests https://patch-diff.githubusercontent.com/shcherbak-ai/contextgem/pulls
Discussions https://patch-diff.githubusercontent.com/shcherbak-ai/contextgem/discussions
nlphttps://patch-diff.githubusercontent.com/topics/nlp
aihttps://patch-diff.githubusercontent.com/topics/ai
text-analysishttps://patch-diff.githubusercontent.com/topics/text-analysis
docxhttps://patch-diff.githubusercontent.com/topics/docx
data-extractionhttps://patch-diff.githubusercontent.com/topics/data-extraction
contract-analysishttps://patch-diff.githubusercontent.com/topics/contract-analysis
legaltechhttps://patch-diff.githubusercontent.com/topics/legaltech
docx2txthttps://patch-diff.githubusercontent.com/topics/docx2txt
unstructured-datahttps://patch-diff.githubusercontent.com/topics/unstructured-data
document-intelligencehttps://patch-diff.githubusercontent.com/topics/document-intelligence
llmhttps://patch-diff.githubusercontent.com/topics/llm
docx2mdhttps://patch-diff.githubusercontent.com/topics/docx2md
prompt-engineeringhttps://patch-diff.githubusercontent.com/topics/prompt-engineering
llmshttps://patch-diff.githubusercontent.com/topics/llms
generative-aihttps://patch-diff.githubusercontent.com/topics/generative-ai
llm-frameworkhttps://patch-diff.githubusercontent.com/topics/llm-framework
llm-pipelinehttps://patch-diff.githubusercontent.com/topics/llm-pipeline
llm-extractionhttps://patch-diff.githubusercontent.com/topics/llm-extraction
dingodbhttps://patch-diff.githubusercontent.com/dingodb
dingohttps://patch-diff.githubusercontent.com/dingodb/dingo
Star 1.7k https://patch-diff.githubusercontent.com/login?return_to=%2Fdingodb%2Fdingo
Code https://patch-diff.githubusercontent.com/dingodb/dingo
Issues https://patch-diff.githubusercontent.com/dingodb/dingo/issues
Pull requests https://patch-diff.githubusercontent.com/dingodb/dingo/pulls
Discussions https://patch-diff.githubusercontent.com/dingodb/dingo/discussions
structured-datahttps://patch-diff.githubusercontent.com/topics/structured-data
servinghttps://patch-diff.githubusercontent.com/topics/serving
unstructured-datahttps://patch-diff.githubusercontent.com/topics/unstructured-data
unified-sqlhttps://patch-diff.githubusercontent.com/topics/unified-sql
vector-databasehttps://patch-diff.githubusercontent.com/topics/vector-database
mysql-compatibilityhttps://patch-diff.githubusercontent.com/topics/mysql-compatibility
hybrid-searchhttps://patch-diff.githubusercontent.com/topics/hybrid-search
embedding-searchhttps://patch-diff.githubusercontent.com/topics/embedding-search
embedding-storehttps://patch-diff.githubusercontent.com/topics/embedding-store
key-value-distributed-storehttps://patch-diff.githubusercontent.com/topics/key-value-distributed-store
vector-oceanhttps://patch-diff.githubusercontent.com/topics/vector-ocean
real-time-semantic-searchhttps://patch-diff.githubusercontent.com/topics/real-time-semantic-search
yobix-aihttps://patch-diff.githubusercontent.com/yobix-ai
extractoushttps://patch-diff.githubusercontent.com/yobix-ai/extractous
Star 1.7k https://patch-diff.githubusercontent.com/login?return_to=%2Fyobix-ai%2Fextractous
Code https://patch-diff.githubusercontent.com/yobix-ai/extractous
Issues https://patch-diff.githubusercontent.com/yobix-ai/extractous/issues
Pull requests https://patch-diff.githubusercontent.com/yobix-ai/extractous/pulls
nlphttps://patch-diff.githubusercontent.com/topics/nlp
rusthttps://patch-diff.githubusercontent.com/topics/rust
pdfhttps://patch-diff.githubusercontent.com/topics/pdf
machine-learninghttps://patch-diff.githubusercontent.com/topics/machine-learning
natural-language-processinghttps://patch-diff.githubusercontent.com/topics/natural-language-processing
ocrhttps://patch-diff.githubusercontent.com/topics/ocr
etlhttps://patch-diff.githubusercontent.com/topics/etl
tikahttps://patch-diff.githubusercontent.com/topics/tika
extractionhttps://patch-diff.githubusercontent.com/topics/extraction
docxhttps://patch-diff.githubusercontent.com/topics/docx
data-pipelineshttps://patch-diff.githubusercontent.com/topics/data-pipelines
pdf-parserhttps://patch-diff.githubusercontent.com/topics/pdf-parser
unstructuredhttps://patch-diff.githubusercontent.com/topics/unstructured
unstructured-datahttps://patch-diff.githubusercontent.com/topics/unstructured-data
raghttps://patch-diff.githubusercontent.com/topics/rag
etl-pipelineshttps://patch-diff.githubusercontent.com/topics/etl-pipelines
llmhttps://patch-diff.githubusercontent.com/topics/llm
lotus-datahttps://patch-diff.githubusercontent.com/lotus-data
lotushttps://patch-diff.githubusercontent.com/lotus-data/lotus
Star 1.5k https://patch-diff.githubusercontent.com/login?return_to=%2Flotus-data%2Flotus
Code https://patch-diff.githubusercontent.com/lotus-data/lotus
Issues https://patch-diff.githubusercontent.com/lotus-data/lotus/issues
Pull requests https://patch-diff.githubusercontent.com/lotus-data/lotus/pulls
Discussions https://patch-diff.githubusercontent.com/lotus-data/lotus/discussions
pythonhttps://patch-diff.githubusercontent.com/topics/python
datahttps://patch-diff.githubusercontent.com/topics/data
pandashttps://patch-diff.githubusercontent.com/topics/pandas
semantic-searchhttps://patch-diff.githubusercontent.com/topics/semantic-search
unstructured-datahttps://patch-diff.githubusercontent.com/topics/unstructured-data
llmhttps://patch-diff.githubusercontent.com/topics/llm
ai-data-processinghttps://patch-diff.githubusercontent.com/topics/ai-data-processing
semantic-operatorshttps://patch-diff.githubusercontent.com/topics/semantic-operators
llm-data-processinghttps://patch-diff.githubusercontent.com/topics/llm-data-processing
llm-document-processinghttps://patch-diff.githubusercontent.com/topics/llm-document-processing
emcfhttps://patch-diff.githubusercontent.com/emcf
thepipehttps://patch-diff.githubusercontent.com/emcf/thepipe
Star 1.5k https://patch-diff.githubusercontent.com/login?return_to=%2Femcf%2Fthepipe
Code https://patch-diff.githubusercontent.com/emcf/thepipe
Issues https://patch-diff.githubusercontent.com/emcf/thepipe/issues
Pull requests https://patch-diff.githubusercontent.com/emcf/thepipe/pulls
pythonhttps://patch-diff.githubusercontent.com/topics/python
pdfhttps://patch-diff.githubusercontent.com/topics/pdf
webhttps://patch-diff.githubusercontent.com/topics/web
scrapinghttps://patch-diff.githubusercontent.com/topics/scraping
openaihttps://patch-diff.githubusercontent.com/topics/openai
documenthttps://patch-diff.githubusercontent.com/topics/document
scrapershttps://patch-diff.githubusercontent.com/topics/scrapers
structured-datahttps://patch-diff.githubusercontent.com/topics/structured-data
unstructured-datahttps://patch-diff.githubusercontent.com/topics/unstructured-data
multimodalhttps://patch-diff.githubusercontent.com/topics/multimodal
vision-transformerhttps://patch-diff.githubusercontent.com/topics/vision-transformer
large-language-modelshttps://patch-diff.githubusercontent.com/topics/large-language-models
vision-language-modelhttps://patch-diff.githubusercontent.com/topics/vision-language-model
tstanislawekhttps://patch-diff.githubusercontent.com/tstanislawek
awesome-document-understandinghttps://patch-diff.githubusercontent.com/tstanislawek/awesome-document-understanding
Star 1.5k https://patch-diff.githubusercontent.com/login?return_to=%2Ftstanislawek%2Fawesome-document-understanding
Code https://patch-diff.githubusercontent.com/tstanislawek/awesome-document-understanding
Issues https://patch-diff.githubusercontent.com/tstanislawek/awesome-document-understanding/issues
Pull requests https://patch-diff.githubusercontent.com/tstanislawek/awesome-document-understanding/pulls
nlphttps://patch-diff.githubusercontent.com/topics/nlp
pdfhttps://patch-diff.githubusercontent.com/topics/pdf
machine-learninghttps://patch-diff.githubusercontent.com/topics/machine-learning
natural-language-processinghttps://patch-diff.githubusercontent.com/topics/natural-language-processing
awesomehttps://patch-diff.githubusercontent.com/topics/awesome
ocrhttps://patch-diff.githubusercontent.com/topics/ocr
deep-learninghttps://patch-diff.githubusercontent.com/topics/deep-learning
information-extractionhttps://patch-diff.githubusercontent.com/topics/information-extraction
awesome-listhttps://patch-diff.githubusercontent.com/topics/awesome-list
pdf-documentshttps://patch-diff.githubusercontent.com/topics/pdf-documents
document-analysishttps://patch-diff.githubusercontent.com/topics/document-analysis
rpahttps://patch-diff.githubusercontent.com/topics/rpa
unstructured-datahttps://patch-diff.githubusercontent.com/topics/unstructured-data
robotic-process-automationhttps://patch-diff.githubusercontent.com/topics/robotic-process-automation
document-layout-analysishttps://patch-diff.githubusercontent.com/topics/document-layout-analysis
document-understandinghttps://patch-diff.githubusercontent.com/topics/document-understanding
key-information-extractionhttps://patch-diff.githubusercontent.com/topics/key-information-extraction
document-aihttps://patch-diff.githubusercontent.com/topics/document-ai
document-intelligencehttps://patch-diff.githubusercontent.com/topics/document-intelligence
intelligent-processinghttps://patch-diff.githubusercontent.com/topics/intelligent-processing
amphi-aihttps://patch-diff.githubusercontent.com/amphi-ai
amphi-etlhttps://patch-diff.githubusercontent.com/amphi-ai/amphi-etl
Star 1.3k https://patch-diff.githubusercontent.com/login?return_to=%2Famphi-ai%2Famphi-etl
Code https://patch-diff.githubusercontent.com/amphi-ai/amphi-etl
Issues https://patch-diff.githubusercontent.com/amphi-ai/amphi-etl/issues
Pull requests https://patch-diff.githubusercontent.com/amphi-ai/amphi-etl/pulls
data-sciencehttps://patch-diff.githubusercontent.com/topics/data-science
datahttps://patch-diff.githubusercontent.com/topics/data
etlhttps://patch-diff.githubusercontent.com/topics/etl
data-analysishttps://patch-diff.githubusercontent.com/topics/data-analysis
structured-datahttps://patch-diff.githubusercontent.com/topics/structured-data
data-pipelineshttps://patch-diff.githubusercontent.com/topics/data-pipelines
data-preparationhttps://patch-diff.githubusercontent.com/topics/data-preparation
unstructured-datahttps://patch-diff.githubusercontent.com/topics/unstructured-data
datatransformationhttps://patch-diff.githubusercontent.com/topics/datatransformation
analytics-automationhttps://patch-diff.githubusercontent.com/topics/analytics-automation
Renumicshttps://patch-diff.githubusercontent.com/Renumics
spotlighthttps://patch-diff.githubusercontent.com/Renumics/spotlight
Star 1.2k https://patch-diff.githubusercontent.com/login?return_to=%2FRenumics%2Fspotlight
Code https://patch-diff.githubusercontent.com/Renumics/spotlight
Issues https://patch-diff.githubusercontent.com/Renumics/spotlight/issues
Pull requests https://patch-diff.githubusercontent.com/Renumics/spotlight/pulls
audiohttps://patch-diff.githubusercontent.com/topics/audio
machine-learninghttps://patch-diff.githubusercontent.com/topics/machine-learning
videohttps://patch-diff.githubusercontent.com/topics/video
computer-visionhttps://patch-diff.githubusercontent.com/topics/computer-vision
timeserieshttps://patch-diff.githubusercontent.com/topics/timeseries
imageshttps://patch-diff.githubusercontent.com/topics/images
exploratory-data-analysishttps://patch-diff.githubusercontent.com/topics/exploratory-data-analysis
data-visualizationhttps://patch-diff.githubusercontent.com/topics/data-visualization
hacktoberfesthttps://patch-diff.githubusercontent.com/topics/hacktoberfest
mesheshttps://patch-diff.githubusercontent.com/topics/meshes
data-curationhttps://patch-diff.githubusercontent.com/topics/data-curation
unstructured-datahttps://patch-diff.githubusercontent.com/topics/unstructured-data
data-centric-aihttps://patch-diff.githubusercontent.com/topics/data-centric-ai
Open-Source-Legalhttps://patch-diff.githubusercontent.com/Open-Source-Legal
OpenContractshttps://patch-diff.githubusercontent.com/Open-Source-Legal/OpenContracts
Star 1.1k https://patch-diff.githubusercontent.com/login?return_to=%2FOpen-Source-Legal%2FOpenContracts
Code https://patch-diff.githubusercontent.com/Open-Source-Legal/OpenContracts
Issues https://patch-diff.githubusercontent.com/Open-Source-Legal/OpenContracts/issues
Pull requests https://patch-diff.githubusercontent.com/Open-Source-Legal/OpenContracts/pulls
agenthttps://patch-diff.githubusercontent.com/topics/agent
etlhttps://patch-diff.githubusercontent.com/topics/etl
unstructured-datahttps://patch-diff.githubusercontent.com/topics/unstructured-data
etl-pipelinehttps://patch-diff.githubusercontent.com/topics/etl-pipeline
vector-databasehttps://patch-diff.githubusercontent.com/topics/vector-database
llmhttps://patch-diff.githubusercontent.com/topics/llm
prompt-engineeringhttps://patch-diff.githubusercontent.com/topics/prompt-engineering
agentic-aihttps://patch-diff.githubusercontent.com/topics/agentic-ai
https://patch-diff.githubusercontent.com/databricks/lilac
databrickshttps://patch-diff.githubusercontent.com/databricks
lilachttps://patch-diff.githubusercontent.com/databricks/lilac
Star 1.1k https://patch-diff.githubusercontent.com/login?return_to=%2Fdatabricks%2Flilac
Code https://patch-diff.githubusercontent.com/databricks/lilac
Issues https://patch-diff.githubusercontent.com/databricks/lilac/issues
Pull requests https://patch-diff.githubusercontent.com/databricks/lilac/pulls
Discussions https://patch-diff.githubusercontent.com/databricks/lilac/discussions
artificial-intelligencehttps://patch-diff.githubusercontent.com/topics/artificial-intelligence
data-analysishttps://patch-diff.githubusercontent.com/topics/data-analysis
unstructured-datahttps://patch-diff.githubusercontent.com/topics/unstructured-data
dataset-analysishttps://patch-diff.githubusercontent.com/topics/dataset-analysis
Curate this topic https://github.com/github/explore/tree/master/CONTRIBUTING.md?source=add-description-unstructured-data
Learn more https://docs.github.com/en/articles/classifying-your-repository-with-topics
https://github.com
Termshttps://docs.github.com/site-policy/github-terms/github-terms-of-service
Privacyhttps://docs.github.com/site-policy/privacy-policies/github-privacy-statement
Securityhttps://github.com/security
Statushttps://www.githubstatus.com/
Communityhttps://github.community/
Docshttps://docs.github.com/
Contacthttps://support.github.com?tags=dotcom-footer

Viewport: width=device-width


URLs of crawlers that visited me.