Movatterモバイル変換

Theano

維基百科，自由的百科全書

Theano

原作者	蒙特婁大學的蒙特婁學習算法研究所（英語：Montreal Institute for Learning Algorithms）（MILA）
開發者	PyMC開發團隊
首次發布	2007年，18年前（2007）
當前版本	2.28.1（2025年2月24日；穩定版本）^[1]
原始碼庫	github.com/pymc-devs/pytensor
程式語言	Python,CUDA
平台	Linux,macOS,Windows
類型	機器學習,函式庫
許可協議	3條款BSD許可證
網站	pytensor.readthedocs.io/en/latest/

Theano及其分叉PyTensor，是一個Python庫和優化的編譯器，用來操縱和求值數學表達式特別是矩陣值表達式^[2]。在其中，計算使用NumPy風格的語法來表達並被編譯，用來在CPU或者GPU架構上高效的運行。

歷史

[編輯]

Theano是開源項目^[3]，主要由蒙特婁大學的蒙特婁學習算法研究所（英語：Montreal Institute for Learning Algorithms）（MILA）開發^[4]。軟體名字取自古代哲學家Theano（英語：Theano (philosopher)）。在2017年9月28日，Pascal Lamblin發布了來自約書亞·本希奧的一則信息，MILA負責人說：由於更強大的工業參與者的競爭，主要的開發在1.0發行之後將會停止^[5]。Theano 1.0.0隨後在2017年11月15日發行^[6]。

在2018年5月17日，Chris Fonnesbeck代表PyMC開發團隊寫道：PyMC開發者將在他們退場後取得對Theano維護的控制權^[7]。在2021年1月絕大部份的Theano代碼基被重新建造，並增加了通過JAX和Numba的編譯，修訂後的這個計算後端以新名字Aesara發行。2022年11月28日，PyMC團隊宣布採用從Aesara計劃分叉出PyTensor^[8]。

樣例代碼

[編輯]

下列代碼以PyTensor用作介紹的例子：

importpytensorfrompytensorimporttensoraspt# 声明2个符号浮点标量a=pt.dscalar("a")b=pt.dscalar("b")# 建立一个简单的表达式c=a+b# 将这个表达式转换成一个可调用对象，# 它接收'(a, b)'值作为输入并计算出一个值给'c'f_c=pytensor.function([a,b],c)assertf_c(1.5,2.5)==4.0# 计算样例表达式关于'a'的梯度dc=pytensor.grad(c,a)f_dc=pytensor.function([a,b],dc)assertf_dc(1.5,2.5)==1.0

>>>importpytensor>>>frompytensorimporttensoraspt>>>>>># 通过'pytensor.function'编译函数还能优化表达式图>>># 它会移除不必要的运算并将特定运算替代为更有效的运算>>>>>>v=pt.vector("v")>>>M=pt.matrix("M")>>>>>>d=a/a+(M+a).dot(v)>>>>>>pytensor.dprint(d)Add [id A] ├─ ExpandDims{axis=0} [id B] │  └─ True_div [id C] │     ├─ a [id D] │     └─ a [id D] └─ dot [id E]    ├─ Add [id F]    │  ├─ M [id G]    │  └─ ExpandDims{axes=[0, 1]} [id H]    │     └─ a [id D]    └─ v [id I]<_io.TextIOWrapper name='<stdout>' mode='w' encoding='utf-8'>>>>>>>f_d=pytensor.function([a,v,M],d)>>>>>># 'a/a' -> '1'而点积被替代为BLAS函数(i.e. CGemv)>>>pytensor.dprint(f_d)Add [id A] 5 ├─ [1.] [id B] └─ CGemv{inplace} [id C] 4    ├─ AllocEmpty{dtype='float64'} [id D] 3    │  └─ Shape_i{0} [id E] 2    │     └─ M [id F]    ├─ 1.0 [id G]    ├─ Add [id H] 1    │  ├─ M [id F]    │  └─ ExpandDims{axes=[0, 1]} [id I] 0    │     └─ a [id J]    ├─ v [id K]    └─ 0.0 [id L]<_io.TextIOWrapper name='<stdout>' mode='w' encoding='utf-8'>

參見

[編輯]

深度學習軟體比較（英語：Comparison of deep learning software）
可微分編程

引用

[編輯]

^Release 2.28.1. 2025年2月24日 [2025年3月1日].
^Bergstra, J.; O. Breuleux; F. Bastien; P. Lamblin; R. Pascanu; G. Desjardins; J. Turian; D. Warde-Farley; Y. Bengio.Theano: A CPU and GPU Math Expression Compiler(PDF). Proceedings of the Python for Scientific Computing Conference (SciPy) 2010. 30 June 2010 [2020-11-06]. （原始內容存檔(PDF)於2020-11-01）.
^Github Repository. [2020-11-06]. （原始內容存檔於2020-11-16）.
^deeplearning.net. [2020-11-06]. （原始內容存檔於2017-12-13）.
^Lamblin, Pascal.MILA and the future of Theano. theano-users (郵件列表). 28 September 2017 [28 September 2017]. （原始內容存檔於2011-01-22）.
^Release Notes – Theano 1.0.0 documentation. [2020-11-06]. （原始內容存檔於2020-09-14）.
^Developers, PyMC.Theano, TensorFlow and the Future of PyMC. Medium. 2019-06-01 [2019-08-27]. （原始內容存檔於2020-08-06）（英語）.
^PyMC forked Aesara to PyTensor. [2023-08-17]. （原始內容存檔於2023-07-18）.

外部連結

[編輯]

官方網站 (GitHub)
Theano（頁面存檔備份，存於網際網路檔案館） at Deep Learning, Université de Montréal

閱論編深度學習軟體（英語：Comparison of deep learning software）
開源軟體	Apache Singa（英語：Apache Singa） Blocks（英語：Blocks） Caffe Deeplearning4j Dlib（英語：Dlib） Microsoft Cognitive Toolkit MXNet OpenNN ONNX Runtime PyTorch scikit-learn LangChain Gradio RETURNN（英語：RETURNN） TensorFlow Keras Theano Torch（英語：Torch (machine learning)）
專有	蘋果公司 Core ML IBM 沃森 Neural Designer（英語：Neural Designer） Wolfram Mathematica MATLAB Deep Learning Toolbox
分類比較

閱論編可微分計算
概論	可微分編程自動微分張量微積分信息幾何統計流形神經形態工程（英語：Neuromorphic engineering）模式識別運算學習理論（英語：Computational learning theory）歸納偏置
概念	梯度下降 SGD（英語：Stochastic gradient descent）聚類回歸過適幻覺對抗（英語：Adversarial machine learning）注意力卷積損失函數反向傳播激勵函數 softmax sigmoid ReLU 正則化資料集擴散（英語：Diffusion process）自回歸
應用	機器學習人工神經網絡深度學習科學計算人工智慧語言模型大型語言模型
硬體	TPU VPU IPU（英語：Graphcore）憶阻器 SpiNNaker（英語：SpiNNaker）
軟體庫	Theano TensorFlow Keras PyTorch JAX Flux.jl（英語：Flux (machine-learning framework)）
架構	多層感知器（MLP）循環神經網絡（RNN）長短期記憶（LSTM）門控循環單元（英語：Gated recurrent unit）（GRU）卷積神經網絡（CNN）殘差神經網絡（ResNet）變換器自編碼器變分自編碼器（VAE）生成對抗網絡（GAN）圖神經網絡（英語：Graph neural network）（GNN）迴響狀態網絡（英語：Echo state network）（ESN）神經圖靈機（NTM）可微分神經計算機（英語：Differentiable neural computer）（DNC）
主題計算機編程技術分類人工神經網絡機器學習

取自「https://zh.wikipedia.org/w/index.php?title=Theano&oldid=79763221」

分類：

隱藏分類：

[8]ページ先頭