René's URL Explorer Experiment


Title: quantization · GitHub Topics · GitHub

Open Graph Title: Build software better, together

X Title: GitHub

Description: GitHub is where people build software. More than 150 million people use GitHub to discover, fork, and contribute to over 420 million projects.

Open Graph Description: GitHub is where people build software. More than 150 million people use GitHub to discover, fork, and contribute to over 420 million projects.

X Description: GitHub is where people build software. More than 150 million people use GitHub to discover, fork, and contribute to over 420 million projects.

Opengraph URL: https://github.com

X: github

direct link

Domain: patch-diff.githubusercontent.com

route-pattern/topics/:topic_name(.:format)
route-controllertopics
route-actionshow
fetch-noncev2:d1c5cb23-7ad6-9a7b-88cb-5c59dcdb104e
current-catalog-service-hash82c569b93da5c18ed649ebd4c2c79437db4611a6a1373e805a3cb001c64130b7
request-idC4B2:193CDB:3C86129:4F64933:696BA6CE
html-safe-nonce7e1b826614ed418b2ecd8a236d325659890d15e7c1874ec46b9da83fd08ffbbe
visitor-payloadeyJyZWZlcnJlciI6IiIsInJlcXVlc3RfaWQiOiJDNEIyOjE5M0NEQjozQzg2MTI5OjRGNjQ5MzM6Njk2QkE2Q0UiLCJ2aXNpdG9yX2lkIjoiNjcxNzQzMDIzMTEyMDcxNzUxOCIsInJlZ2lvbl9lZGdlIjoiaWFkIiwicmVnaW9uX3JlbmRlciI6ImlhZCJ9
visitor-hmac2b83372c108c4af65949b073fc9436b0542bd2667d32496961cd189e562092f6
github-keyboard-shortcutscopilot
google-site-verificationApib7-x98H0j5cPqHWwSMm6dNU4GmODRoqxLiDzdx9I
octolytics-urlhttps://collector.github.com/github/collect
fb:app_id1401488693436528
apple-itunes-appapp-id=1477376905, app-argument=https://github.com/topics/quantization
og:site_nameGitHub
og:imagehttps://github.githubassets.com/assets/github-octocat-13c86b8b336d.png
og:image:typeimage/png
og:image:width1200
og:image:height620
twitter:site:id13334762
twitter:creatorgithub
twitter:creator:id13334762
twitter:cardsummary_large_image
twitter:imagehttps://github.githubassets.com/assets/github-logo-55c5b9a1fe52.png
twitter:image:width1200
twitter:image:height1200
hostnamegithub.com
expected-hostnamegithub.com
None5f99f7c1d70f01da5b93e5ca90303359738944d8ab470e396496262c66e60b8d
turbo-cache-controlno-preview
turbo-body-classeslogged-out env-production page-responsive
disable-turbofalse
browser-stats-urlhttps://api.github.com/_private/browser/stats
browser-errors-urlhttps://api.github.com/_private/browser/errors
release82560a55c6b2054555076f46e683151ee28a19bc
ui-targetfull
theme-color#1e2327
color-schemelight dark

Links:

Skip to contenthttps://patch-diff.githubusercontent.com/topics/quantization#start-of-content
https://patch-diff.githubusercontent.com/
Sign in https://patch-diff.githubusercontent.com/login?return_to=https%3A%2F%2Fgithub.com%2Ftopics%2Fquantization
GitHub CopilotWrite better code with AIhttps://github.com/features/copilot
GitHub SparkBuild and deploy intelligent appshttps://github.com/features/spark
GitHub ModelsManage and compare promptshttps://github.com/features/models
MCP RegistryNewIntegrate external toolshttps://github.com/mcp
ActionsAutomate any workflowhttps://github.com/features/actions
CodespacesInstant dev environmentshttps://github.com/features/codespaces
IssuesPlan and track workhttps://github.com/features/issues
Code ReviewManage code changeshttps://github.com/features/code-review
GitHub Advanced SecurityFind and fix vulnerabilitieshttps://github.com/security/advanced-security
Code securitySecure your code as you buildhttps://github.com/security/advanced-security/code-security
Secret protectionStop leaks before they starthttps://github.com/security/advanced-security/secret-protection
Why GitHubhttps://github.com/why-github
Documentationhttps://docs.github.com
Bloghttps://github.blog
Changeloghttps://github.blog/changelog
Marketplacehttps://github.com/marketplace
View all featureshttps://github.com/features
Enterpriseshttps://github.com/enterprise
Small and medium teamshttps://github.com/team
Startupshttps://github.com/enterprise/startups
Nonprofitshttps://github.com/solutions/industry/nonprofits
App Modernizationhttps://github.com/solutions/use-case/app-modernization
DevSecOpshttps://github.com/solutions/use-case/devsecops
DevOpshttps://github.com/solutions/use-case/devops
CI/CDhttps://github.com/solutions/use-case/ci-cd
View all use caseshttps://github.com/solutions/use-case
Healthcarehttps://github.com/solutions/industry/healthcare
Financial serviceshttps://github.com/solutions/industry/financial-services
Manufacturinghttps://github.com/solutions/industry/manufacturing
Governmenthttps://github.com/solutions/industry/government
View all industrieshttps://github.com/solutions/industry
View all solutionshttps://github.com/solutions
AIhttps://github.com/resources/articles?topic=ai
Software Developmenthttps://github.com/resources/articles?topic=software-development
DevOpshttps://github.com/resources/articles?topic=devops
Securityhttps://github.com/resources/articles?topic=security
View all topicshttps://github.com/resources/articles
Customer storieshttps://github.com/customer-stories
Events & webinarshttps://github.com/resources/events
Ebooks & reportshttps://github.com/resources/whitepapers
Business insightshttps://github.com/solutions/executive-insights
GitHub Skillshttps://skills.github.com
Documentationhttps://docs.github.com
Customer supporthttps://support.github.com
Community forumhttps://github.com/orgs/community/discussions
Trust centerhttps://github.com/trust-center
Partnershttps://github.com/partners
GitHub SponsorsFund open source developershttps://github.com/sponsors
Security Labhttps://securitylab.github.com
Maintainer Communityhttps://maintainers.github.com
Acceleratorhttps://github.com/accelerator
Archive Programhttps://archiveprogram.github.com
Topicshttps://github.com/topics
Trendinghttps://github.com/trending
Collectionshttps://github.com/collections
Enterprise platformAI-powered developer platformhttps://github.com/enterprise
GitHub Advanced SecurityEnterprise-grade security featureshttps://github.com/security/advanced-security
Copilot for BusinessEnterprise-grade AI featureshttps://github.com/features/copilot/copilot-business
Premium SupportEnterprise-grade 24/7 supporthttps://github.com/premium-support
Pricinghttps://github.com/pricing
Search syntax tipshttps://docs.github.com/search-github/github-code-search/understanding-github-code-search-syntax
documentationhttps://docs.github.com/search-github/github-code-search/understanding-github-code-search-syntax
Sign in https://patch-diff.githubusercontent.com/login?return_to=https%3A%2F%2Fgithub.com%2Ftopics%2Fquantization
Sign up https://patch-diff.githubusercontent.com/signup?ref_cta=Sign+up&ref_loc=header+logged+out&ref_page=%2Ftopics%2Fquantization&source=header
Reloadhttps://patch-diff.githubusercontent.com/topics/quantization
Reloadhttps://patch-diff.githubusercontent.com/topics/quantization
Reloadhttps://patch-diff.githubusercontent.com/topics/quantization
Explorehttps://patch-diff.githubusercontent.com/explore
Topicshttps://patch-diff.githubusercontent.com/topics
Trendinghttps://patch-diff.githubusercontent.com/trending
Collectionshttps://patch-diff.githubusercontent.com/collections
Eventshttps://patch-diff.githubusercontent.com/events
GitHub Sponsorshttps://patch-diff.githubusercontent.com/sponsors/explore
Star https://patch-diff.githubusercontent.com/login?return_to=%2Ftopic.quantization
All 1,047 https://github.com/topics/quantization
Python 520 https://github.com/topics/quantization?l=python
Jupyter Notebook 233 https://github.com/topics/quantization?l=jupyter+notebook
C++ 46 https://github.com/topics/quantization?l=c%2B%2B
MATLAB 30 https://github.com/topics/quantization?l=matlab
C 27 https://github.com/topics/quantization?l=c
JavaScript 18 https://github.com/topics/quantization?l=javascript
Java 16 https://github.com/topics/quantization?l=java
Rust 16 https://github.com/topics/quantization?l=rust
HTML 9 https://github.com/topics/quantization?l=html
TypeScript 9 https://github.com/topics/quantization?l=typescript
Most stars https://patch-diff.githubusercontent.com/topics/quantization?o=desc&s=stars
Fewest stars https://patch-diff.githubusercontent.com/topics/quantization?o=asc&s=stars
Most forks https://patch-diff.githubusercontent.com/topics/quantization?o=desc&s=forks
Fewest forks https://patch-diff.githubusercontent.com/topics/quantization?o=asc&s=forks
Recently updated https://patch-diff.githubusercontent.com/topics/quantization?o=desc&s=updated
Least recently updated https://patch-diff.githubusercontent.com/topics/quantization?o=asc&s=updated
https://patch-diff.githubusercontent.com/hiyouga/LlamaFactory
hiyougahttps://patch-diff.githubusercontent.com/hiyouga
LlamaFactoryhttps://patch-diff.githubusercontent.com/hiyouga/LlamaFactory
Star 65.9k https://patch-diff.githubusercontent.com/login?return_to=%2Fhiyouga%2FLlamaFactory
Code https://patch-diff.githubusercontent.com/hiyouga/LlamaFactory
Issues https://patch-diff.githubusercontent.com/hiyouga/LlamaFactory/issues
Pull requests https://patch-diff.githubusercontent.com/hiyouga/LlamaFactory/pulls
Discussions https://patch-diff.githubusercontent.com/hiyouga/LlamaFactory/discussions
nlphttps://patch-diff.githubusercontent.com/topics/nlp
agenthttps://patch-diff.githubusercontent.com/topics/agent
aihttps://patch-diff.githubusercontent.com/topics/ai
transformershttps://patch-diff.githubusercontent.com/topics/transformers
moehttps://patch-diff.githubusercontent.com/topics/moe
llamahttps://patch-diff.githubusercontent.com/topics/llama
gpthttps://patch-diff.githubusercontent.com/topics/gpt
lorahttps://patch-diff.githubusercontent.com/topics/lora
quantizationhttps://patch-diff.githubusercontent.com/topics/quantization
gemmahttps://patch-diff.githubusercontent.com/topics/gemma
fine-tuninghttps://patch-diff.githubusercontent.com/topics/fine-tuning
pefthttps://patch-diff.githubusercontent.com/topics/peft
large-language-modelshttps://patch-diff.githubusercontent.com/topics/large-language-models
llmhttps://patch-diff.githubusercontent.com/topics/llm
rlhfhttps://patch-diff.githubusercontent.com/topics/rlhf
instruction-tuninghttps://patch-diff.githubusercontent.com/topics/instruction-tuning
qlorahttps://patch-diff.githubusercontent.com/topics/qlora
qwenhttps://patch-diff.githubusercontent.com/topics/qwen
deepseekhttps://patch-diff.githubusercontent.com/topics/deepseek
llama3https://patch-diff.githubusercontent.com/topics/llama3
SYSTRANhttps://patch-diff.githubusercontent.com/SYSTRAN
faster-whisperhttps://patch-diff.githubusercontent.com/SYSTRAN/faster-whisper
Star 20.4k https://patch-diff.githubusercontent.com/login?return_to=%2FSYSTRAN%2Ffaster-whisper
Code https://patch-diff.githubusercontent.com/SYSTRAN/faster-whisper
Issues https://patch-diff.githubusercontent.com/SYSTRAN/faster-whisper/issues
Pull requests https://patch-diff.githubusercontent.com/SYSTRAN/faster-whisper/pulls
Discussions https://patch-diff.githubusercontent.com/SYSTRAN/faster-whisper/discussions
deep-learninghttps://patch-diff.githubusercontent.com/topics/deep-learning
inferencehttps://patch-diff.githubusercontent.com/topics/inference
transformerhttps://patch-diff.githubusercontent.com/topics/transformer
speech-recognitionhttps://patch-diff.githubusercontent.com/topics/speech-recognition
openaihttps://patch-diff.githubusercontent.com/topics/openai
speech-to-texthttps://patch-diff.githubusercontent.com/topics/speech-to-text
quantizationhttps://patch-diff.githubusercontent.com/topics/quantization
whisperhttps://patch-diff.githubusercontent.com/topics/whisper
https://patch-diff.githubusercontent.com/ymcui/Chinese-LLaMA-Alpaca
ymcuihttps://patch-diff.githubusercontent.com/ymcui
Chinese-LLaMA-Alpacahttps://patch-diff.githubusercontent.com/ymcui/Chinese-LLaMA-Alpaca
Star 19k https://patch-diff.githubusercontent.com/login?return_to=%2Fymcui%2FChinese-LLaMA-Alpaca
Code https://patch-diff.githubusercontent.com/ymcui/Chinese-LLaMA-Alpaca
Issues https://patch-diff.githubusercontent.com/ymcui/Chinese-LLaMA-Alpaca/issues
Pull requests https://patch-diff.githubusercontent.com/ymcui/Chinese-LLaMA-Alpaca/pulls
Discussions https://patch-diff.githubusercontent.com/ymcui/Chinese-LLaMA-Alpaca/discussions
nlphttps://patch-diff.githubusercontent.com/topics/nlp
llamahttps://patch-diff.githubusercontent.com/topics/llama
lorahttps://patch-diff.githubusercontent.com/topics/lora
quantizationhttps://patch-diff.githubusercontent.com/topics/quantization
alpacahttps://patch-diff.githubusercontent.com/topics/alpaca
plmhttps://patch-diff.githubusercontent.com/topics/plm
pre-trained-language-modelshttps://patch-diff.githubusercontent.com/topics/pre-trained-language-models
large-language-modelshttps://patch-diff.githubusercontent.com/topics/large-language-models
llmhttps://patch-diff.githubusercontent.com/topics/llm
llama-2https://patch-diff.githubusercontent.com/topics/llama-2
alpaca-2https://patch-diff.githubusercontent.com/topics/alpaca-2
https://patch-diff.githubusercontent.com/UFund-Me/Qbot
UFund-Mehttps://patch-diff.githubusercontent.com/UFund-Me
Qbothttps://patch-diff.githubusercontent.com/UFund-Me/Qbot
Star 15.9k https://patch-diff.githubusercontent.com/login?return_to=%2FUFund-Me%2FQbot
Code https://patch-diff.githubusercontent.com/UFund-Me/Qbot
Issues https://patch-diff.githubusercontent.com/UFund-Me/Qbot/issues
Pull requests https://patch-diff.githubusercontent.com/UFund-Me/Qbot/pulls
https://ufund-me.github.io/Qbothttps://ufund-me.github.io/Qbot
https://github.com/Charmve/iQuanthttps://github.com/Charmve/iQuant
machine-learninghttps://patch-diff.githubusercontent.com/topics/machine-learning
deep-learninghttps://patch-diff.githubusercontent.com/topics/deep-learning
bitcoinhttps://patch-diff.githubusercontent.com/topics/bitcoin
blockchainhttps://patch-diff.githubusercontent.com/topics/blockchain
fintechhttps://patch-diff.githubusercontent.com/topics/fintech
quantitative-financehttps://patch-diff.githubusercontent.com/topics/quantitative-finance
trademarkshttps://patch-diff.githubusercontent.com/topics/trademarks
quantizationhttps://patch-diff.githubusercontent.com/topics/quantization
fundshttps://patch-diff.githubusercontent.com/topics/funds
strategieshttps://patch-diff.githubusercontent.com/topics/strategies
backtesthttps://patch-diff.githubusercontent.com/topics/backtest
quantitative-tradinghttps://patch-diff.githubusercontent.com/topics/quantitative-trading
pytradehttps://patch-diff.githubusercontent.com/topics/pytrade
qlibhttps://patch-diff.githubusercontent.com/topics/qlib
quant-tradehttps://patch-diff.githubusercontent.com/topics/quant-trade
trade-bothttps://patch-diff.githubusercontent.com/topics/trade-bot
quant-traderhttps://patch-diff.githubusercontent.com/topics/quant-trader
https://patch-diff.githubusercontent.com/bitsandbytes-foundation/bitsandbytes
bitsandbytes-foundationhttps://patch-diff.githubusercontent.com/bitsandbytes-foundation
bitsandbyteshttps://patch-diff.githubusercontent.com/bitsandbytes-foundation/bitsandbytes
Sponsor https://patch-diff.githubusercontent.com/sponsors/bitsandbytes-foundation
Star 7.9k https://patch-diff.githubusercontent.com/login?return_to=%2Fbitsandbytes-foundation%2Fbitsandbytes
Code https://patch-diff.githubusercontent.com/bitsandbytes-foundation/bitsandbytes
Issues https://patch-diff.githubusercontent.com/bitsandbytes-foundation/bitsandbytes/issues
Pull requests https://patch-diff.githubusercontent.com/bitsandbytes-foundation/bitsandbytes/pulls
Discussions https://patch-diff.githubusercontent.com/bitsandbytes-foundation/bitsandbytes/discussions
machine-learninghttps://patch-diff.githubusercontent.com/topics/machine-learning
pytorchhttps://patch-diff.githubusercontent.com/topics/pytorch
quantizationhttps://patch-diff.githubusercontent.com/topics/quantization
llmhttps://patch-diff.githubusercontent.com/topics/llm
qlorahttps://patch-diff.githubusercontent.com/topics/qlora
kornelskihttps://patch-diff.githubusercontent.com/kornelski
pngquanthttps://patch-diff.githubusercontent.com/kornelski/pngquant
Star 5.5k https://patch-diff.githubusercontent.com/login?return_to=%2Fkornelski%2Fpngquant
Code https://patch-diff.githubusercontent.com/kornelski/pngquant
Issues https://patch-diff.githubusercontent.com/kornelski/pngquant/issues
Pull requests https://patch-diff.githubusercontent.com/kornelski/pngquant/pulls
chttps://patch-diff.githubusercontent.com/topics/c
palettehttps://patch-diff.githubusercontent.com/topics/palette
qualityhttps://patch-diff.githubusercontent.com/topics/quality
pnghttps://patch-diff.githubusercontent.com/topics/png
png-compressionhttps://patch-diff.githubusercontent.com/topics/png-compression
conversionhttps://patch-diff.githubusercontent.com/topics/conversion
smallerhttps://patch-diff.githubusercontent.com/topics/smaller
stdinhttps://patch-diff.githubusercontent.com/topics/stdin
image-optimizationhttps://patch-diff.githubusercontent.com/topics/image-optimization
quantizationhttps://patch-diff.githubusercontent.com/topics/quantization
pngquanthttps://patch-diff.githubusercontent.com/topics/pngquant
AutoGPTQhttps://patch-diff.githubusercontent.com/AutoGPTQ
AutoGPTQhttps://patch-diff.githubusercontent.com/AutoGPTQ/AutoGPTQ
Star 5k https://patch-diff.githubusercontent.com/login?return_to=%2FAutoGPTQ%2FAutoGPTQ
Code https://patch-diff.githubusercontent.com/AutoGPTQ/AutoGPTQ
Issues https://patch-diff.githubusercontent.com/AutoGPTQ/AutoGPTQ/issues
Pull requests https://patch-diff.githubusercontent.com/AutoGPTQ/AutoGPTQ/pulls
Discussions https://patch-diff.githubusercontent.com/AutoGPTQ/AutoGPTQ/discussions
nlphttps://patch-diff.githubusercontent.com/topics/nlp
deep-learninghttps://patch-diff.githubusercontent.com/topics/deep-learning
transformershttps://patch-diff.githubusercontent.com/topics/transformers
inferencehttps://patch-diff.githubusercontent.com/topics/inference
pytorchhttps://patch-diff.githubusercontent.com/topics/pytorch
transformerhttps://patch-diff.githubusercontent.com/topics/transformer
quantizationhttps://patch-diff.githubusercontent.com/topics/quantization
large-language-modelshttps://patch-diff.githubusercontent.com/topics/large-language-models
llmshttps://patch-diff.githubusercontent.com/topics/llms
OpenNMThttps://patch-diff.githubusercontent.com/OpenNMT
CTranslate2https://patch-diff.githubusercontent.com/OpenNMT/CTranslate2
Star 4.3k https://patch-diff.githubusercontent.com/login?return_to=%2FOpenNMT%2FCTranslate2
Code https://patch-diff.githubusercontent.com/OpenNMT/CTranslate2
Issues https://patch-diff.githubusercontent.com/OpenNMT/CTranslate2/issues
Pull requests https://patch-diff.githubusercontent.com/OpenNMT/CTranslate2/pulls
deep-neural-networkshttps://patch-diff.githubusercontent.com/topics/deep-neural-networks
deep-learninghttps://patch-diff.githubusercontent.com/topics/deep-learning
cpphttps://patch-diff.githubusercontent.com/topics/cpp
neonhttps://patch-diff.githubusercontent.com/topics/neon
machine-translationhttps://patch-diff.githubusercontent.com/topics/machine-translation
openmphttps://patch-diff.githubusercontent.com/topics/openmp
parallel-computinghttps://patch-diff.githubusercontent.com/topics/parallel-computing
cudahttps://patch-diff.githubusercontent.com/topics/cuda
inferencehttps://patch-diff.githubusercontent.com/topics/inference
avxhttps://patch-diff.githubusercontent.com/topics/avx
intrinsicshttps://patch-diff.githubusercontent.com/topics/intrinsics
avx2https://patch-diff.githubusercontent.com/topics/avx2
neural-machine-translationhttps://patch-diff.githubusercontent.com/topics/neural-machine-translation
opennmthttps://patch-diff.githubusercontent.com/topics/opennmt
quantizationhttps://patch-diff.githubusercontent.com/topics/quantization
gemmhttps://patch-diff.githubusercontent.com/topics/gemm
mklhttps://patch-diff.githubusercontent.com/topics/mkl
thrusthttps://patch-diff.githubusercontent.com/topics/thrust
transformer-modelshttps://patch-diff.githubusercontent.com/topics/transformer-models
onednnhttps://patch-diff.githubusercontent.com/topics/onednn
nunchaku-aihttps://patch-diff.githubusercontent.com/nunchaku-ai
nunchakuhttps://patch-diff.githubusercontent.com/nunchaku-ai/nunchaku
Star 3.6k https://patch-diff.githubusercontent.com/login?return_to=%2Fnunchaku-ai%2Fnunchaku
Code https://patch-diff.githubusercontent.com/nunchaku-ai/nunchaku
Issues https://patch-diff.githubusercontent.com/nunchaku-ai/nunchaku/issues
Pull requests https://patch-diff.githubusercontent.com/nunchaku-ai/nunchaku/pulls
Discussions https://patch-diff.githubusercontent.com/nunchaku-ai/nunchaku/discussions
fluxhttps://patch-diff.githubusercontent.com/topics/flux
lorahttps://patch-diff.githubusercontent.com/topics/lora
quantizationhttps://patch-diff.githubusercontent.com/topics/quantization
iclrhttps://patch-diff.githubusercontent.com/topics/iclr
diffusion-modelshttps://patch-diff.githubusercontent.com/topics/diffusion-models
mlsyshttps://patch-diff.githubusercontent.com/topics/mlsys
comfyuihttps://patch-diff.githubusercontent.com/topics/comfyui
genaihttps://patch-diff.githubusercontent.com/topics/genai
iclr2025https://patch-diff.githubusercontent.com/topics/iclr2025
huggingfacehttps://patch-diff.githubusercontent.com/huggingface
optimumhttps://patch-diff.githubusercontent.com/huggingface/optimum
Star 3.3k https://patch-diff.githubusercontent.com/login?return_to=%2Fhuggingface%2Foptimum
Code https://patch-diff.githubusercontent.com/huggingface/optimum
Issues https://patch-diff.githubusercontent.com/huggingface/optimum/issues
Pull requests https://patch-diff.githubusercontent.com/huggingface/optimum/pulls
traininghttps://patch-diff.githubusercontent.com/topics/training
optimizationhttps://patch-diff.githubusercontent.com/topics/optimization
intelhttps://patch-diff.githubusercontent.com/topics/intel
transformershttps://patch-diff.githubusercontent.com/topics/transformers
inferencehttps://patch-diff.githubusercontent.com/topics/inference
pytorchhttps://patch-diff.githubusercontent.com/topics/pytorch
quantizationhttps://patch-diff.githubusercontent.com/topics/quantization
onnxhttps://patch-diff.githubusercontent.com/topics/onnx
tflitehttps://patch-diff.githubusercontent.com/topics/tflite
onnxruntimehttps://patch-diff.githubusercontent.com/topics/onnxruntime
graphcorehttps://patch-diff.githubusercontent.com/topics/graphcore
habanahttps://patch-diff.githubusercontent.com/topics/habana
https://patch-diff.githubusercontent.com/neuralmagic/deepsparse
neuralmagichttps://patch-diff.githubusercontent.com/neuralmagic
deepsparsehttps://patch-diff.githubusercontent.com/neuralmagic/deepsparse
Star 3.2k https://patch-diff.githubusercontent.com/login?return_to=%2Fneuralmagic%2Fdeepsparse
Code https://patch-diff.githubusercontent.com/neuralmagic/deepsparse
Issues https://patch-diff.githubusercontent.com/neuralmagic/deepsparse/issues
Pull requests https://patch-diff.githubusercontent.com/neuralmagic/deepsparse/pulls
nlphttps://patch-diff.githubusercontent.com/topics/nlp
performancehttps://patch-diff.githubusercontent.com/topics/performance
computer-visionhttps://patch-diff.githubusercontent.com/topics/computer-vision
inferencehttps://patch-diff.githubusercontent.com/topics/inference
machinelearninghttps://patch-diff.githubusercontent.com/topics/machinelearning
pruninghttps://patch-diff.githubusercontent.com/topics/pruning
object-detectionhttps://patch-diff.githubusercontent.com/topics/object-detection
pretrained-modelshttps://patch-diff.githubusercontent.com/topics/pretrained-models
quantizationhttps://patch-diff.githubusercontent.com/topics/quantization
cpushttps://patch-diff.githubusercontent.com/topics/cpus
onnxhttps://patch-diff.githubusercontent.com/topics/onnx
sparsificationhttps://patch-diff.githubusercontent.com/topics/sparsification
llm-inferencehttps://patch-diff.githubusercontent.com/topics/llm-inference
deepsparsehttps://patch-diff.githubusercontent.com/topics/deepsparse
huawei-noahhttps://patch-diff.githubusercontent.com/huawei-noah
Pretrained-Language-Modelhttps://patch-diff.githubusercontent.com/huawei-noah/Pretrained-Language-Model
Star 3.2k https://patch-diff.githubusercontent.com/login?return_to=%2Fhuawei-noah%2FPretrained-Language-Model
Code https://patch-diff.githubusercontent.com/huawei-noah/Pretrained-Language-Model
Issues https://patch-diff.githubusercontent.com/huawei-noah/Pretrained-Language-Model/issues
Pull requests https://patch-diff.githubusercontent.com/huawei-noah/Pretrained-Language-Model/pulls
pretrained-modelshttps://patch-diff.githubusercontent.com/topics/pretrained-models
quantizationhttps://patch-diff.githubusercontent.com/topics/quantization
knowledge-distillationhttps://patch-diff.githubusercontent.com/topics/knowledge-distillation
model-compressionhttps://patch-diff.githubusercontent.com/topics/model-compression
large-scale-distributedhttps://patch-diff.githubusercontent.com/topics/large-scale-distributed
thu-mlhttps://patch-diff.githubusercontent.com/thu-ml
SageAttentionhttps://patch-diff.githubusercontent.com/thu-ml/SageAttention
Star 3.1k https://patch-diff.githubusercontent.com/login?return_to=%2Fthu-ml%2FSageAttention
Code https://patch-diff.githubusercontent.com/thu-ml/SageAttention
Issues https://patch-diff.githubusercontent.com/thu-ml/SageAttention/issues
Pull requests https://patch-diff.githubusercontent.com/thu-ml/SageAttention/pulls
cudahttps://patch-diff.githubusercontent.com/topics/cuda
tritonhttps://patch-diff.githubusercontent.com/topics/triton
attentionhttps://patch-diff.githubusercontent.com/topics/attention
vithttps://patch-diff.githubusercontent.com/topics/vit
quantizationhttps://patch-diff.githubusercontent.com/topics/quantization
video-generationhttps://patch-diff.githubusercontent.com/topics/video-generation
mlsyshttps://patch-diff.githubusercontent.com/topics/mlsys
inference-accelerationhttps://patch-diff.githubusercontent.com/topics/inference-acceleration
efficient-attentionhttps://patch-diff.githubusercontent.com/topics/efficient-attention
llmhttps://patch-diff.githubusercontent.com/topics/llm
llm-infrahttps://patch-diff.githubusercontent.com/topics/llm-infra
video-generatehttps://patch-diff.githubusercontent.com/topics/video-generate
https://patch-diff.githubusercontent.com/IntelLabs/nlp-architect
IntelLabshttps://patch-diff.githubusercontent.com/IntelLabs
nlp-architecthttps://patch-diff.githubusercontent.com/IntelLabs/nlp-architect
Star 2.9k https://patch-diff.githubusercontent.com/login?return_to=%2FIntelLabs%2Fnlp-architect
Code https://patch-diff.githubusercontent.com/IntelLabs/nlp-architect
Issues https://patch-diff.githubusercontent.com/IntelLabs/nlp-architect/issues
Pull requests https://patch-diff.githubusercontent.com/IntelLabs/nlp-architect/pulls
Discussions https://patch-diff.githubusercontent.com/IntelLabs/nlp-architect/discussions
nlphttps://patch-diff.githubusercontent.com/topics/nlp
deep-learninghttps://patch-diff.githubusercontent.com/topics/deep-learning
tensorflowhttps://patch-diff.githubusercontent.com/topics/tensorflow
nluhttps://patch-diff.githubusercontent.com/topics/nlu
transformershttps://patch-diff.githubusercontent.com/topics/transformers
pytorchhttps://patch-diff.githubusercontent.com/topics/pytorch
deeplearninghttps://patch-diff.githubusercontent.com/topics/deeplearning
quantizationhttps://patch-diff.githubusercontent.com/topics/quantization
berthttps://patch-diff.githubusercontent.com/topics/bert
dynethttps://patch-diff.githubusercontent.com/topics/dynet
aaron-xichenhttps://patch-diff.githubusercontent.com/aaron-xichen
pytorch-playgroundhttps://patch-diff.githubusercontent.com/aaron-xichen/pytorch-playground
Star 2.7k https://patch-diff.githubusercontent.com/login?return_to=%2Faaron-xichen%2Fpytorch-playground
Code https://patch-diff.githubusercontent.com/aaron-xichen/pytorch-playground
Issues https://patch-diff.githubusercontent.com/aaron-xichen/pytorch-playground/issues
Pull requests https://patch-diff.githubusercontent.com/aaron-xichen/pytorch-playground/pulls
pytorchhttps://patch-diff.githubusercontent.com/topics/pytorch
quantizationhttps://patch-diff.githubusercontent.com/topics/quantization
pytorch-tutorialhttps://patch-diff.githubusercontent.com/topics/pytorch-tutorial
pytorch-tutorialshttps://patch-diff.githubusercontent.com/topics/pytorch-tutorials
nunchaku-aihttps://patch-diff.githubusercontent.com/nunchaku-ai
ComfyUI-nunchakuhttps://patch-diff.githubusercontent.com/nunchaku-ai/ComfyUI-nunchaku
Star 2.7k https://patch-diff.githubusercontent.com/login?return_to=%2Fnunchaku-ai%2FComfyUI-nunchaku
Code https://patch-diff.githubusercontent.com/nunchaku-ai/ComfyUI-nunchaku
Issues https://patch-diff.githubusercontent.com/nunchaku-ai/ComfyUI-nunchaku/issues
Pull requests https://patch-diff.githubusercontent.com/nunchaku-ai/ComfyUI-nunchaku/pulls
Discussions https://patch-diff.githubusercontent.com/nunchaku-ai/ComfyUI-nunchaku/discussions
fluxhttps://patch-diff.githubusercontent.com/topics/flux
quantizationhttps://patch-diff.githubusercontent.com/topics/quantization
diffusionhttps://patch-diff.githubusercontent.com/topics/diffusion
mlsyshttps://patch-diff.githubusercontent.com/topics/mlsys
comfyuihttps://patch-diff.githubusercontent.com/topics/comfyui
genaihttps://patch-diff.githubusercontent.com/topics/genai
stochasticaihttps://patch-diff.githubusercontent.com/stochasticai
xTuringhttps://patch-diff.githubusercontent.com/stochasticai/xTuring
Star 2.7k https://patch-diff.githubusercontent.com/login?return_to=%2Fstochasticai%2FxTuring
Code https://patch-diff.githubusercontent.com/stochasticai/xTuring
Issues https://patch-diff.githubusercontent.com/stochasticai/xTuring/issues
Pull requests https://patch-diff.githubusercontent.com/stochasticai/xTuring/pulls
Discussions https://patch-diff.githubusercontent.com/stochasticai/xTuring/discussions
https://discord.gg/TgHXuSJEk6https://discord.gg/TgHXuSJEk6
adapterhttps://patch-diff.githubusercontent.com/topics/adapter
deep-learninghttps://patch-diff.githubusercontent.com/topics/deep-learning
llamahttps://patch-diff.githubusercontent.com/topics/llama
lorahttps://patch-diff.githubusercontent.com/topics/lora
quantizationhttps://patch-diff.githubusercontent.com/topics/quantization
language-modelhttps://patch-diff.githubusercontent.com/topics/language-model
mistralhttps://patch-diff.githubusercontent.com/topics/mistral
fine-tuninghttps://patch-diff.githubusercontent.com/topics/fine-tuning
pefthttps://patch-diff.githubusercontent.com/topics/peft
finetuninghttps://patch-diff.githubusercontent.com/topics/finetuning
mixed-precisionhttps://patch-diff.githubusercontent.com/topics/mixed-precision
gpt-2https://patch-diff.githubusercontent.com/topics/gpt-2
gpt-jhttps://patch-diff.githubusercontent.com/topics/gpt-j
llmhttps://patch-diff.githubusercontent.com/topics/llm
generative-aihttps://patch-diff.githubusercontent.com/topics/generative-ai
gen-aihttps://patch-diff.githubusercontent.com/topics/gen-ai
pytorchhttps://patch-diff.githubusercontent.com/pytorch
aohttps://patch-diff.githubusercontent.com/pytorch/ao
Star 2.6k https://patch-diff.githubusercontent.com/login?return_to=%2Fpytorch%2Fao
Code https://patch-diff.githubusercontent.com/pytorch/ao
Issues https://patch-diff.githubusercontent.com/pytorch/ao/issues
Pull requests https://patch-diff.githubusercontent.com/pytorch/ao/pulls
Discussions https://patch-diff.githubusercontent.com/pytorch/ao/discussions
traininghttps://patch-diff.githubusercontent.com/topics/training
sparsityhttps://patch-diff.githubusercontent.com/topics/sparsity
cudahttps://patch-diff.githubusercontent.com/topics/cuda
inferencehttps://patch-diff.githubusercontent.com/topics/inference
optimizerhttps://patch-diff.githubusercontent.com/topics/optimizer
pytorchhttps://patch-diff.githubusercontent.com/topics/pytorch
transformerhttps://patch-diff.githubusercontent.com/topics/transformer
offloadinghttps://patch-diff.githubusercontent.com/topics/offloading
llamahttps://patch-diff.githubusercontent.com/topics/llama
quantizationhttps://patch-diff.githubusercontent.com/topics/quantization
mxhttps://patch-diff.githubusercontent.com/topics/mx
brrrhttps://patch-diff.githubusercontent.com/topics/brrr
dtypeshttps://patch-diff.githubusercontent.com/topics/dtypes
float8https://patch-diff.githubusercontent.com/topics/float8
vllm-projecthttps://patch-diff.githubusercontent.com/vllm-project
llm-compressorhttps://patch-diff.githubusercontent.com/vllm-project/llm-compressor
Sponsor https://patch-diff.githubusercontent.com/sponsors/vllm-project
Star 2.6k https://patch-diff.githubusercontent.com/login?return_to=%2Fvllm-project%2Fllm-compressor
Code https://patch-diff.githubusercontent.com/vllm-project/llm-compressor
Issues https://patch-diff.githubusercontent.com/vllm-project/llm-compressor/issues
Pull requests https://patch-diff.githubusercontent.com/vllm-project/llm-compressor/pulls
Discussions https://patch-diff.githubusercontent.com/vllm-project/llm-compressor/discussions
sparsityhttps://patch-diff.githubusercontent.com/topics/sparsity
compressionhttps://patch-diff.githubusercontent.com/topics/compression
quantizationhttps://patch-diff.githubusercontent.com/topics/quantization
intelhttps://patch-diff.githubusercontent.com/intel
neural-compressorhttps://patch-diff.githubusercontent.com/intel/neural-compressor
Star 2.6k https://patch-diff.githubusercontent.com/login?return_to=%2Fintel%2Fneural-compressor
Code https://patch-diff.githubusercontent.com/intel/neural-compressor
Issues https://patch-diff.githubusercontent.com/intel/neural-compressor/issues
Pull requests https://patch-diff.githubusercontent.com/intel/neural-compressor/pulls
Discussions https://patch-diff.githubusercontent.com/intel/neural-compressor/discussions
sparsityhttps://patch-diff.githubusercontent.com/topics/sparsity
pruninghttps://patch-diff.githubusercontent.com/topics/pruning
quantizationhttps://patch-diff.githubusercontent.com/topics/quantization
knowledge-distillationhttps://patch-diff.githubusercontent.com/topics/knowledge-distillation
auto-tuninghttps://patch-diff.githubusercontent.com/topics/auto-tuning
int8https://patch-diff.githubusercontent.com/topics/int8
low-precisionhttps://patch-diff.githubusercontent.com/topics/low-precision
quantization-aware-traininghttps://patch-diff.githubusercontent.com/topics/quantization-aware-training
post-training-quantizationhttps://patch-diff.githubusercontent.com/topics/post-training-quantization
awqhttps://patch-diff.githubusercontent.com/topics/awq
int4https://patch-diff.githubusercontent.com/topics/int4
large-language-modelshttps://patch-diff.githubusercontent.com/topics/large-language-models
gptqhttps://patch-diff.githubusercontent.com/topics/gptq
smoothquanthttps://patch-diff.githubusercontent.com/topics/smoothquant
sparsegpthttps://patch-diff.githubusercontent.com/topics/sparsegpt
fp4https://patch-diff.githubusercontent.com/topics/fp4
mxformathttps://patch-diff.githubusercontent.com/topics/mxformat
Curate this topic https://github.com/github/explore/tree/master/CONTRIBUTING.md?source=add-description-quantization
Learn more https://docs.github.com/en/articles/classifying-your-repository-with-topics
https://github.com
Termshttps://docs.github.com/site-policy/github-terms/github-terms-of-service
Privacyhttps://docs.github.com/site-policy/privacy-policies/github-privacy-statement
Securityhttps://github.com/security
Statushttps://www.githubstatus.com/
Communityhttps://github.community/
Docshttps://docs.github.com/
Contacthttps://support.github.com?tags=dotcom-footer

Viewport: width=device-width


URLs of crawlers that visited me.