Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Rs/docs update #1008

Open
wants to merge 155 commits into
base: main
Choose a base branch
from
Open
Changes from 1 commit
Commits
Show all changes
155 commits
Select commit Hold shift + click to select a range
7cd8dbf
added user guide
Mar 16, 2023
a818bce
Delete qa_server_config.yaml
robertgshaw2-redhat Mar 16, 2023
0dfb6e1
removed gatsby headers
Mar 16, 2023
b0d3454
update benchmarking
Mar 16, 2023
6124994
Update benchmarking.md
robertgshaw2-redhat Mar 16, 2023
a18a79b
Update and rename benchmarking.md to deepsparse-benchmarking.md
robertgshaw2-redhat Mar 16, 2023
639b8c1
Update deepsparse-pipelines.md
robertgshaw2-redhat Mar 16, 2023
5b5c23a
Update deepsparse-server.md
robertgshaw2-redhat Mar 16, 2023
15624f6
Update scheduler.md
robertgshaw2-redhat Mar 16, 2023
17a2e68
Update user-guide/scheduler.md
robertgshaw2-redhat Mar 16, 2023
8053834
Update user-guide/scheduler.md
robertgshaw2-redhat Mar 16, 2023
eb57109
Update user-guide/scheduler.md
robertgshaw2-redhat Mar 16, 2023
3946aca
Update user-guide/deepsparse-pipelines.md
robertgshaw2-redhat Mar 16, 2023
d9803bb
Update user-guide/deepsparse-pipelines.md
robertgshaw2-redhat Mar 16, 2023
7e0a97b
Update user-guide/deepsparse-pipelines.md
robertgshaw2-redhat Mar 16, 2023
ca0f27d
added README
Mar 16, 2023
61d1126
Merge branch 'rs/docs-update-user-guide' of github.com:neuralmagic/de…
Mar 16, 2023
7a55eec
Update README.md
robertgshaw2-redhat Mar 16, 2023
f18acb3
Update README.md
robertgshaw2-redhat Mar 16, 2023
9e5f05f
Update README.md
robertgshaw2-redhat Mar 16, 2023
8a2c666
Update README.md
robertgshaw2-redhat Mar 16, 2023
8ff12f5
Update README.md
robertgshaw2-redhat Mar 16, 2023
19804f2
Update README.md
robertgshaw2-redhat Mar 16, 2023
53e9a93
Update README.md
robertgshaw2-redhat Mar 16, 2023
d8972f2
added sentiment-analysis
Mar 16, 2023
e9e2685
Update sentiment-analysis.md
robertgshaw2-redhat Mar 16, 2023
dd5bdfd
added installation
Mar 16, 2023
b84e01e
Update installation.md
robertgshaw2-redhat Mar 16, 2023
c145ac4
Update installation.md
robertgshaw2-redhat Mar 16, 2023
f978ba4
Update README.md
robertgshaw2-redhat Mar 16, 2023
380cf6e
Update deepsparse-pipelines.md
robertgshaw2-redhat Mar 16, 2023
643af49
Update deepsparse-pipelines.md
robertgshaw2-redhat Mar 16, 2023
12092ee
add text classification doc
mwitiderrick Mar 20, 2023
13ac283
add text classification doc
mwitiderrick Mar 21, 2023
3c256d2
add text classification doc
mwitiderrick Mar 21, 2023
eaaca7a
Use Engine
mwitiderrick Mar 21, 2023
1b5cf02
add question answering document
mwitiderrick Mar 21, 2023
dd2bed8
add token classification document
mwitiderrick Mar 22, 2023
2108ed6
update benchmarks
mwitiderrick Mar 22, 2023
d057130
add transformers extraction embedding doc
mwitiderrick Mar 22, 2023
ddb3091
add general embedding doc
mwitiderrick Mar 22, 2023
e99a196
add image classification doc
mwitiderrick Mar 23, 2023
a7acf3c
add image classification doc
mwitiderrick Mar 23, 2023
2b74cf1
add yolo document
mwitiderrick Mar 23, 2023
9fed9c8
add YOLACT doc
mwitiderrick Mar 24, 2023
bd227f0
update yolov5 doc
mwitiderrick Mar 24, 2023
1a7334d
update yolov5 doc
mwitiderrick Mar 24, 2023
46e78c2
Update yolov5-object-detection.md
mwitiderrick Apr 3, 2023
92e4dc8
Update image-classification.md
mwitiderrick Apr 3, 2023
d259fbc
Update image-segmentation-yolact.md
mwitiderrick Apr 3, 2023
59c3efe
Apply suggestions from code review
mgoin Apr 12, 2023
0412a98
Merge branch 'main' into rs/docs-update-user-guide
mgoin Apr 12, 2023
5f77059
Merge branch 'main' into rs/docs-update-use-cases
mgoin Apr 13, 2023
6a1a7d9
RS Edits to CV
Apr 17, 2023
ae25136
updated embedding extraction example
Apr 17, 2023
94b2f04
updated sentiment analysis and text classification examples
Apr 17, 2023
4ebdc59
added zero shot text classification
Apr 17, 2023
0a56876
RS edited token classification
Apr 18, 2023
e31c22e
updated question answering example
Apr 18, 2023
f260be7
updated embedding extraction case
Apr 18, 2023
558773c
Merge pull request #1003 from neuralmagic/rs/docs-update-user-guide
robertgshaw2-redhat Apr 18, 2023
05a7f07
updated directory structure
Apr 18, 2023
588b658
updated dir structure
Apr 18, 2023
7cb5671
updated dir structure
Apr 18, 2023
b483960
Update image-classification.md
robertgshaw2-redhat Apr 18, 2023
0fd2eb1
Update image-classification.md
robertgshaw2-redhat Apr 18, 2023
d89739b
Update image-classification.md
robertgshaw2-redhat Apr 18, 2023
6942c5c
Update object-detection-yolov5.md
robertgshaw2-redhat Apr 18, 2023
bc28cf1
Update object-detection-yolov5.md
robertgshaw2-redhat Apr 18, 2023
2297bec
Update object-detection-yolov5.md
robertgshaw2-redhat Apr 18, 2023
ef32b3a
Update image-segmentation-yolact.md
robertgshaw2-redhat Apr 18, 2023
5390079
Update image-segmentation-yolact.md
robertgshaw2-redhat Apr 18, 2023
0755b3a
Update embedding-extraction.md
robertgshaw2-redhat Apr 18, 2023
739ba15
Update sentiment-analysis.md
mwitiderrick Apr 19, 2023
dc724e1
Update question-answering.md
mwitiderrick Apr 19, 2023
59790f9
Update text-classification.md
mwitiderrick Apr 19, 2023
8fbee7a
Update embedding-extraction.md
robertgshaw2-redhat Apr 19, 2023
c7287c5
Create README.md
robertgshaw2-redhat Apr 19, 2023
b00ea3c
Update README.md
robertgshaw2-redhat Apr 19, 2023
cab0f2c
Update README.md
robertgshaw2-redhat Apr 19, 2023
c02a9a4
Update README.md
robertgshaw2-redhat Apr 19, 2023
4e9a61c
Update README.md
robertgshaw2-redhat Apr 19, 2023
e0b3fc6
Update README.md
robertgshaw2-redhat Apr 19, 2023
ea2915c
Update README.md
robertgshaw2-redhat Apr 19, 2023
f05ba01
Update README.md
robertgshaw2-redhat Apr 19, 2023
847621b
Update deepsparse-pipelines.md
robertgshaw2-redhat Apr 19, 2023
1b672c9
Update deepsparse-pipelines.md
robertgshaw2-redhat Apr 19, 2023
0482754
Update deepsparse-pipelines.md
robertgshaw2-redhat Apr 19, 2023
a78f494
Update deepsparse-server.md
robertgshaw2-redhat Apr 19, 2023
f9914bb
Update deepsparse-pipelines.md
robertgshaw2-redhat Apr 19, 2023
e6420d3
Update deepsparse-server.md
robertgshaw2-redhat Apr 19, 2023
66e264b
Update README.md
robertgshaw2-redhat Apr 19, 2023
3633383
Update README.md
robertgshaw2-redhat Apr 19, 2023
981774a
Update README.md
robertgshaw2-redhat Apr 19, 2023
a2ecfa7
Update README.md
robertgshaw2-redhat Apr 19, 2023
b8af6ba
Update README.md
robertgshaw2-redhat Apr 19, 2023
0e123ef
Update image-segmentation-yolact.md
robertgshaw2-redhat Apr 19, 2023
c2aa202
Update image-classification.md
robertgshaw2-redhat Apr 19, 2023
2e267cd
Update object-detection-yolov5.md
robertgshaw2-redhat Apr 19, 2023
bc12263
Update question-answering.md
robertgshaw2-redhat Apr 19, 2023
d18f1eb
Update sentiment-analysis.md
robertgshaw2-redhat Apr 19, 2023
c55572c
Update text-classification.md
robertgshaw2-redhat Apr 19, 2023
a444ee3
Update token-classification.md
robertgshaw2-redhat Apr 19, 2023
94f2644
Update transformers-embedding-extraction.md
robertgshaw2-redhat Apr 19, 2023
cb8b5dd
Update zero-shot-text-classification.md
robertgshaw2-redhat Apr 19, 2023
8bca17d
Update question-answering.md
robertgshaw2-redhat Apr 19, 2023
248ea13
Update question-answering.md
robertgshaw2-redhat Apr 19, 2023
c6dda09
Update question-answering.md
robertgshaw2-redhat Apr 19, 2023
3f1a30d
Update sentiment-analysis.md
robertgshaw2-redhat Apr 19, 2023
2d3a89e
Update sentiment-analysis.md
robertgshaw2-redhat Apr 19, 2023
c90cb3e
Update sentiment-analysis.md
robertgshaw2-redhat Apr 19, 2023
a519ba0
Update text-classification.md
robertgshaw2-redhat Apr 19, 2023
417cf3a
Update text-classification.md
robertgshaw2-redhat Apr 19, 2023
b13e28e
Update text-classification.md
robertgshaw2-redhat Apr 19, 2023
f5a535c
Update text-classification.md
robertgshaw2-redhat Apr 19, 2023
bae0836
Update token-classification.md
robertgshaw2-redhat Apr 19, 2023
98ec61e
Update token-classification.md
robertgshaw2-redhat Apr 19, 2023
8d1c257
Update zero-shot-text-classification.md
robertgshaw2-redhat Apr 19, 2023
c3403d5
Update zero-shot-text-classification.md
robertgshaw2-redhat Apr 19, 2023
f86a976
Update zero-shot-text-classification.md
robertgshaw2-redhat Apr 19, 2023
ffc4911
Update transformers-embedding-extraction.md
robertgshaw2-redhat Apr 19, 2023
645fd24
Update embedding-extraction.md
robertgshaw2-redhat Apr 19, 2023
f48b58d
Update image-classification.md
robertgshaw2-redhat Apr 19, 2023
aa59bc6
Update image-segmentation-yolact.md
robertgshaw2-redhat Apr 19, 2023
a62be31
Update object-detection-yolov5.md
robertgshaw2-redhat Apr 19, 2023
cb7d614
Update README.md
robertgshaw2-redhat Apr 19, 2023
352f7e3
Update README.md
robertgshaw2-redhat Apr 19, 2023
d215b47
Update README.md
robertgshaw2-redhat Apr 19, 2023
f365fba
Update README.md
robertgshaw2-redhat Apr 19, 2023
73ec549
Update README.md
robertgshaw2-redhat Apr 19, 2023
08b5c69
Update README.md
robertgshaw2-redhat Apr 19, 2023
4581506
Update README.md
robertgshaw2-redhat Apr 19, 2023
3e73375
Update README.md
robertgshaw2-redhat Apr 19, 2023
c560c09
Update README.md
robertgshaw2-redhat Apr 19, 2023
2435705
Update README.md
robertgshaw2-redhat Apr 19, 2023
8675523
Add files via upload
robertgshaw2-redhat Apr 19, 2023
eef8116
Update README.md
robertgshaw2-redhat Apr 19, 2023
0bc84a8
Update README.md
robertgshaw2-redhat Apr 19, 2023
a31969a
Update README.md
robertgshaw2-redhat Apr 19, 2023
d1fa169
added copyrights
Apr 19, 2023
b99fd9f
Merge branch 'main' into rs/docs-update-use-cases
robertgshaw2-redhat Apr 19, 2023
593e130
Update README.md
robertgshaw2-redhat Apr 19, 2023
60b8dbd
Update README.md
robertgshaw2-redhat Apr 19, 2023
9be3f50
Update README.md
robertgshaw2-redhat Apr 19, 2023
7ec9e9e
Update README.md
robertgshaw2-redhat Apr 19, 2023
6be665e
Update README.md
robertgshaw2-redhat Apr 19, 2023
1e85839
Update README.md
robertgshaw2-redhat Apr 19, 2023
517d5d8
Update README.md
robertgshaw2-redhat Apr 19, 2023
a84edb0
Update README.md
robertgshaw2-redhat Apr 19, 2023
89714eb
Update README.md
robertgshaw2-redhat Apr 19, 2023
bc4590b
Update README.md
robertgshaw2-redhat Apr 19, 2023
f8893ed
Update README.md
robertgshaw2-redhat Apr 19, 2023
0d799b7
reset to qaed state
Apr 24, 2023
3b800a8
Merge branch 'main' into rs/docs-update
robertgshaw2-redhat Apr 24, 2023
4a15621
restored src/*
Apr 24, 2023
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
Prev Previous commit
Next Next commit
Update README.md
robertgshaw2-redhat authored Mar 16, 2023

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.
commit 9e5f05fa1c63f4bdfe7aeee403f9781e10ef714d
9 changes: 4 additions & 5 deletions user-guide/README.md
Original file line number Diff line number Diff line change
@@ -20,10 +20,9 @@ See the [DeepSparse Installation page](https://docs.neuralmagic.com/get-started/

DeepSparse's key feature is its performance on commodity CPUs. DeepSparse is competitive with other CPU runtimes
like ONNX Runtime for unoptimized dense models. However, when optimization techniques like pruning and quantization
are applied to a model, DeepSparse can achieve an order-of-magnitude speedup.

As an example, let's compare DeepSparse and ORT's performance on BERT. In SparseZoo, there is [90% pruned-quantized
BERT]() which retains >99% of the accuracy of the baseline dense model. Running this model on a `c6i.16xlarge` instance, DeepSparse achieves a ***12x speedup*** over ORT!
are applied to a model, DeepSparse can achieve an order-of-magnitude speedup. As an example, let's compare DeepSparse and ORT's performance on BERT. In SparseZoo, there is [90% pruned-quantized BERT](https://sparsezoo.neuralmagic.com/models/nlp%2Fsentiment_analysis%2Fobert-base%2Fpytorch%2Fhuggingface%2Fsst2%2Fpruned90_quant-none).

Running this model on a `c6i.16xlarge` instance, DeepSparse achieves a ***12x speedup*** over ORT!

ORT achieves 18.5 items/second running BERT (make sure you have ORT installed `pip install onnxruntime`):
```bash
@@ -38,7 +37,7 @@ deepsparse.benchmark zoo:nlp/text_classification/obert-base/pytorch/huggingface/
DeepSparse achieves 226 items/second running the pruned-quantized version of BERT:

```bash
deepsparse.benchmark zoo:nlp/sentiment_analysis/obert-base/pytorch/huggingface/sst2/pruned90_quant-none -b 32 -s sync -nstreams 1 -e onnxruntime
deepsparse.benchmark zoo:nlp/sentiment_analysis/obert-base/pytorch/huggingface/sst2/pruned90_quant-none -b 64 -s sync -nstreams 1 -e onnxruntime

>> Original Model Path: zoo:nlp/text_classification/obert-base/pytorch/huggingface/mnli/pruned90_quant-none
>> Batch Size: 64