onnxruntime_transformers

transformers for production runtime, 3x faster on cpu, no pytorch nor tensorflow included

convert models to onnx

install converter pip install optimum[exporters]

convert embedding model to onnx

optimum-cli export onnx --task sentence-similarity --model "infgrad/stella-base-zh-v3-1792d" bert_embed

convert sentence correction model to onnx

optimum-cli export onnx --task fill-mask --model "shibing624/macbert4csc-base-chinese" bert_csc

convert ner model to onnx

optimum-cli export onnx --task token-classification --model "shibing624/bert4ner-base-chinese" bert_ner

inference with onnx

generate embeddings

from onnxruntime_transformers import OnnxruntimeTransformers
encoder = OnnxruntimeTransformers("./bert_embed/tokenizer.json", "./bert_embed/model.onnx")
embeddings = encoder.encode([
    "how are you",
    "I'm fine thank you, and you?",
])

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
onnxruntime_transformers		onnxruntime_transformers
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

onnxruntime_transformers

convert models to onnx

inference with onnx

About

Releases

Packages

Languages

License

billju/onnxruntime_transformers

Folders and files

Latest commit

History

Repository files navigation

onnxruntime_transformers

convert models to onnx

inference with onnx

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages