Draft:Apache TVM

Apache TVM
Apache TVM
Developer(s)	Apache Software Foundation
Initial release	2017
Written in	Python (programming language), C++
Type	Compiler
Website	tvm.apache.org

Submission declined on 6 June 2025 by Caleb Stanford (talk).

This submission appears to read more like an advertisement than an entry in an encyclopedia. Encyclopedia articles need to be written from a neutral point of view, and should refer to a range of independent, reliable, published sources, not just to materials produced by the creator of the subject being discussed. This is important so that the article can meet Wikipedia's verifiability policy and the notability of the subject can be established. If you still feel that this subject is worthy of inclusion in Wikipedia, please rewrite your submission to comply with these policies.

If you would like to continue working on the submission, click on the "Edit" tab at the top of the window.
If you have not resolved the issues listed above, your draft will be declined again and potentially deleted.
If you need extra help, please ask us a question at the AfC Help Desk or get live help from experienced editors.
Please do not remove reviewer comments or this notice until the submission is accepted.

Where to get help

If you need help editing or submitting your draft, please ask us a question at the AfC Help Desk or get live help from experienced editors. These venues are only for help with editing and the submission process, not to get reviews.
If you need feedback on your draft, or if the review is taking a lot of time, you can try asking for help on the talk page of a relevant WikiProject. Some WikiProjects are more active than others so a speedy reply is not guaranteed.

How to improve a draft

Wikipedia:Contributing to Wikipedia – a basic overview on how to edit Wikipedia.
Help:Wikitext – how to use the markup
Help:Referencing for beginners – how to include references
Wikipedia:Article development – how to develop your article
Wikipedia:Writing better articles – how to improve your article
Wikipedia:Verifiability – make sure your article includes reliable third-party sources

You can also browse Wikipedia:Featured articles and Wikipedia:Good articles to find examples of Wikipedia's best writing on topics similar to your proposed article.

Improving your odds of a speedy review

To improve your odds of a faster review, tag your draft with relevant WikiProject tags using the button below. This will let reviewers know a new draft has been submitted in their area of interest. For instance, if you wrote about a female astronomer, you would want to add the Biography, Astronomy, and Women scientists tags.

Add tags to your draft

Editor resources

Easy tools: Citation bot (help) | Advanced: Fix bare URLs

Declined by Caleb Stanford 24 days ago. Last edited by Caleb Stanford 23 days ago. Reviewer: Inform author.

Resubmit

Please note that if the issues are not fixed, the draft will be declined again.

Submission declined on 5 June 2025 by Caleb Stanford (talk).

This submission appears to read more like an advertisement than an entry in an encyclopedia. Encyclopedia articles need to be written from a neutral point of view, and should refer to a range of independent, reliable, published sources, not just to materials produced by the creator of the subject being discussed. This is important so that the article can meet Wikipedia's verifiability policy and the notability of the subject can be established. If you still feel that this subject is worthy of inclusion in Wikipedia, please rewrite your submission to comply with these policies.

Declined by Caleb Stanford 25 days ago.

Comment: Red text, sentences missing citations, promotional language, lacks context for a general audience. Not sure if it sufficiently demonstrates notability. Caleb Stanford (talk) 05:48, 6 June 2025 (UTC)

Comment: The topic may be notable but currently reads like an advertisement.
Can you please revise the article to provide context to a general audience? The article focuses too much on architecture and applications. Why is this topic notable according to independent sources? Who has talked about it and what have they said?
Also, the article is missing a lead. Include the "Apache TVM" section as a lead instead of as the first section.
Please address the above and resubmit. Thanks! Caleb Stanford (talk) 02:01, 5 June 2025 (UTC)

Apache TVM (Tensor Virtual Machine) is an open source machine learning optimizing compiler. It supports machine learning models written in frameworks such as TensorFlow, PyTorch, and ONNX and compiles to target platforms including central processing units, graphics processing units, and field-programmable gate arrays.^{[citation needed]}

Originally developed at the University of Washington, TVM has been adopted by several companies including Amazon Web Services^[1] , AMD^[2], Nvidia^[3] , and Qualcomm.^[4]

History

TVM was developed as a research project at the University of Washington in 2017 by Tianqi Chen and the SAMPL group^{[clarification needed]} at the Paul G. Allen School of Computer Science and Engineering.^{[citation needed]} It was published at the 2018 Symposium on Operating Systems Design and Implementation.^[5] The project entered the Apache Incubator in March 2019 as part of the process for becoming an official Apache Software Foundation project.^{[citation needed]}

Architecture

TVM is composed of several major components:

Relay IR: Relay IR is a high-level functional intermediate representation (IR) to represent neural networks prior to low-level optimization and code generation.^{[citation needed]} Introduced as a successor to NNVM IR, Relay encodes computation graphs as abstract syntax trees and includes features such as first-class functions, recursion, and a dependent-like type system that supports shape and tensor types.^{[citation needed]}

Relay supports model transformation through a functional programming paradigm. It uses a Static single-assignment form in which expressions are bound to named variables.^{[citation needed]} Relay includes a Python interface for building and manipulating computation graphs.^{[citation needed]}

import tvm
from tvm import relay

# Define a simple function using Relay
def simple_addition(x, y):
    return relay.add(x, y)

# Create Relay variables for the function
x = relay.var("x", relay.TensorType((3, 3), dtype="float32"))
y = relay.var("y", relay.TensorType((3, 3), dtype="float32"))

# Call the function
add_fn = simple_addition(x, y)

Relay supports reverse-mode automatic differentiation by transforming functions to compute both output values values and corresponding partial derivatives by employing functional techniques such as dual numbers and dynamic closures for backpropagation.^{[citation needed]} This enables differentiation for higher-order functions and models with complex control flow.^[6]

TE (Tensor Expression) language: A domain-specific language to represent low-level tensor computations. It includes transformations such as loop reordering, memory layout modifications, and parallel execution for performance optimization.^{[citation needed]}

AutoTVM and Ansor: AutoTVM and Ansor are automated tuning systems used to optimize performance of tensor computations. AutoTVM uses machine learning models and statistical techniques to identify scheduling parameters that improve runtime efficiency across different hardware targets. Ansor extends this approach with search-based methods to explore a broader configuration space.^[7]^[8]

BYOC (Bring Your Own Codegen): A plugin mechanism enabling integration of external code generation backends. It supports the use of hardware-specific instruction sets, libraries, and custom optimizations for specialized or proprietary hardware platforms.^{[citation needed]}

Applications

Apache TVM has been applied in embedded systems, data center inference workloads, and edge computing devices.^{[citation needed]} Cloud providers and hardware vendors including AWS, AMD, ARM, and Qualcomm have adopted TVM for compiling deep learning workloads to run on specific hardware platforms.^[9] ^[10] ^[11]

In research contexts, TVM has been used for topics such as automatic scheduling, hardware-aware neural architecture search, and integration with compiler infrastructures such as LLVM and MLIR.^{[citation needed]}

References

^ "Introducing NNVM Compiler: A New Open End-to-End Compiler for AI Frameworks". Amazon Web Services. June 28, 2017. Retrieved June 5, 2025.
^ "Apache TVM". AMD Documentation. Retrieved June 5, 2025.
^ "Using Apache TVM for Automatic, Machine Learning-powered TensorCore Code Generation". NVIDIA. November 2021. Retrieved June 5, 2025.
^ "TVM Open Source Compiler Now Includes Initial Support for Qualcomm Hexagon DSP". Qualcomm Developer Blog. October 2020. Retrieved June 5, 2025.
^ Chen, Tianqi, et al. "TVM: An Automated End-to-End Optimizing Compiler for Deep Learning." OSDI 2018. [1]
^ Roesch, Jared, et al. "Relay: A New IR for Machine Learning Frameworks." Proceedings of the 2nd ACM SIGPLAN International Workshop on Machine Learning and Programming Languages (MAPL '18), 2018. [2]
^ Zheng, Lianmin, et al. "Ansor: Generating High-Performance Tensor Programs for Deep Learning." OSDI 20, 2020. [3]
^ Kwon, Donggyu, et al. "Learning to Optimize Tensor Programs with a Graph-Based Approach." ICLR 2021. [4]
^ AWS Labs. "AWS Neuron SDK and Apache TVM." GitHub Repository. [5]
^ AMD. "Accelerate PyTorch Models using torch.compile on AMD GPUs with ROCm." AMD Developer Blog, 2023. [6]
^ Arm Developer. "Resources for Ethos-U." [7]

[1] "Introducing NNVM Compiler: A New Open End-to-End Compiler for AI Frameworks". Amazon Web Services. June 28, 2017. Retrieved June 5, 2025.

[2] "Apache TVM". AMD Documentation. Retrieved June 5, 2025.

[3] "Using Apache TVM for Automatic, Machine Learning-powered TensorCore Code Generation". NVIDIA. November 2021. Retrieved June 5, 2025.

[4] "TVM Open Source Compiler Now Includes Initial Support for Qualcomm Hexagon DSP". Qualcomm Developer Blog. October 2020. Retrieved June 5, 2025.

[osdi18-5] Chen, Tianqi, et al. "TVM: An Automated End-to-End Optimizing Compiler for Deep Learning." OSDI 2018. [1]

[relay-ir-6] Roesch, Jared, et al. "Relay: A New IR for Machine Learning Frameworks." Proceedings of the 2nd ACM SIGPLAN International Workshop on Machine Learning and Programming Languages (MAPL '18), 2018. [2]

[ansor-7] Zheng, Lianmin, et al. "Ansor: Generating High-Performance Tensor Programs for Deep Learning." OSDI 20, 2020. [3]

[autotvm-8] Kwon, Donggyu, et al. "Learning to Optimize Tensor Programs with a Graph-Based Approach." ICLR 2021. [4]

[aws-neuron-9] AWS Labs. "AWS Neuron SDK and Apache TVM." GitHub Repository. [5]

[amd-blog-10] AMD. "Accelerate PyTorch Models using torch.compile on AMD GPUs with ROCm." AMD Developer Blog, 2023. [6]

[arm-ethos-11] Arm Developer. "Resources for Ethos-U." [7]

[1]

[2]

[3]

[4]

[5]

[6]

[7]

[8]

[9]

[10]

[11]

History

Architecture

Applications

See also

References