Movatterモバイル変換

Jan Verschelde¹¹

Part of the book series:Lecture Notes in Computer Science ((LNCS,volume 14938))

Included in the following conference series:

International Workshop on Computer Algebra in Scientific Computing

254Accesses

Abstract

A polynomial homotopy is a family of polynomial systems, typically in one parametert. Our problem is to compute power series expansions of the coordinates of the solutions in the parametert, accurately, using multiple double arithmetic. One application of this problem is the location of the nearest singular solution in a polynomial homotopy, via the theorem of Fabry. Power series serve as input to construct Padé approximations.

Exploiting the massive parallelism of Graphics Processing Units (GPUs) capable of performing several trillions floating-point operations per second, the objective is to compensate for the cost overhead caused by arithmetic with power series in multiple double precision. The application of Newton’s method for this problem requires the evaluation and differentiation of polynomials, followed by solving a blocked lower triangular linear system. Experimental results are obtained on NVIDIA GPUs, in particular the RTX 2080, RTX 4080, P100, V100, and A100.

Code generated by the CAMPARY software is used to obtain results in double double, quad double, and octo double precision. The programs in this study are self contained, available in a public github repository under the GPL-v3.0 License.

Supported by the National Science Foundation under grant DMS 1854513.

This is a preview of subscription content,log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic

¥17,985 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Subscribe now

Buy Now

Chapter: JPY 3498; Price includes VAT (Japan)

eBook: JPY 14871; Price includes VAT (Japan)

Softcover Book: JPY 18589; Price includes VAT (Japan)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Computing matrix trigonometric functions with GPUs through Matlab

Article07 April 2018

An Efficient Quasi-Newton Method with Tensor Product Implementation for Solving Quasi-Linear Elliptic Equations and Systems

ArticleOpen access30 April 2025

Paramotopy: Parameter Homotopies in Parallel

References

Abdelfattah, A., et al.: A survey of numerical linear algebra methods utilizing mixed-precision arithmetic. Int. J. High Perform. Comput. Appl.35(4), 344–369 (2021)
Article Google Scholar
Agullo, E., Augonnet, C., Dongarra, J., Faverge, M.: QR factorization on a multicore node enhanced with multiple GPU accelerators. In: 2011 IEEE International Parallel and Distributed Processing Symposium, pp. 932–943. IEEE (2011)
Google Scholar
Alexander, J.C., Yorke, J.A.: The homotopy continuation method: numerically implementable topological procedures. Trans. Am. Math. Soc.242, 271–284 (1978)
Article MathSciNet Google Scholar
Allgower, E.L., Georg, K.: Introduction to Numerical Continuation Methods, Classics in Applied Mathematics, vol. 45. SIAM (2003)
Google Scholar
Anderson, M., Ballard, G., Demmel, J., Kreutzer, K.: Communication-avoiding QR decomposition for GPUs. In: 2011 IEEE International Parallel and Distributed Processing Symposium, pp. 48–58. IEEE (2011)
Google Scholar
Baboulin, M., Dongarra, J., Tomov, S.: Some issues in dense linear algebra for multicore and special purpose architectures. Technical Report UT-CS-08-200, University of Tennessee (2008)
Google Scholar
Baker, Jr, G. A., Graves-Morris, P.: Padé Approximants. Cambridge University Press, Cambridge (1996)
Google Scholar
Bischof, C., Van Loan, C.F.: The WY representation for products of Householder matrices. SIAM J. Sci. Stat. Comput.8(1), s1–s13 (1987)
Article MathSciNet Google Scholar
Bliss, N., Verschelde, J.: The method of Gauss-Newton to compute power series solutions of polynomial homotopies. Linear Algebra Appl.542, 569–588 (2018)
Article MathSciNet Google Scholar
Brezinski, C., Redivo Zaglia, M.: Extrapolation Methods. Studies in Computational Mathematics, vol. 2. North-Holland (1991)
Google Scholar
Dronamraju, A., et al.: Implications of Stahl’s theorems to holomorphic embedding part II: numerical convergence. CSEE J. Power Energy Syst.7(4), 773–784 (2021)
Google Scholar
Fabry, E.: Sur les points singuliers d’une fonction donnée par son développement en série et l’impossibilité du prolongement analytique dans des cas très généraux. Annales scientifiques de l’École Normale Supérieure13, 367–399 (1896)
Article MathSciNet Google Scholar
Fornberg, B.: Numerical differentiation of analytic functions. ACM Trans. Math. Softw.7(4), 512–526 (1981)
Article MathSciNet Google Scholar
Griewank, A., Walther, A.: Evaluating Derivatives: Principles and Techniques of Algorithmic Differentiation, 2 edn. SIAM (2008)
Google Scholar
Heller, D.H.: A survey of parallel algorithms in numerical linear algebra. SIAM Rev.20(4), 740–777 (1978)
Article MathSciNet Google Scholar
Henrici, P.: An algorithm for analytic continuation. SIAM J. Numer. Anal.3(1), 67–78 (1966)
Article MathSciNet Google Scholar
Hida, Y., Li, X.S., Bailey, D.H.: Algorithms for quad-double precision floating point arithmetic. In: 15th IEEE Symposium on Computer Arithmetic (Arith-15 2001), pp. 155–162. IEEE Computer Society (2001)
Google Scholar
Higham, N.J., Mary, T.: Mixed precision algorithms in numerical linear algebra. Acta Numerica 347–414 (2022)
Google Scholar
Isupov, K., Knyazkov, V.: Multiple-precision matrix-vector multiplication on graphics processing units. Program Syst. Theory Appl.11(3), 62–84 (2020)
Google Scholar
Joldes, M., Muller, J.-M., Popescu, V., Tucker, W.: CAMPARY: Cuda multiple precision arithmetic library and applications. In: Greuel, G.-M., Koch, T., Paule, P., Sommese, A. (eds.) ICMS 2016. LNCS, vol. 9725, pp. 232–240. Springer, Cham (2016).https://doi.org/10.1007/978-3-319-42432-3_29
Chapter Google Scholar
Kelley, C.T.: Newton’s method in mixed precision. SIAM Rev.64(1), 191–211 (2022)
Article MathSciNet Google Scholar
Kerr, A., Campbell, D., Richards, M.: QR decomposition on GPUs. In: Kaeli, D., Leeser, M. (eds.) Proceedings of 2nd Workshop on General Purpose Processing on Graphics Processing Units (GPGPU 2009), pp. 71–78. ACM (2009)
Google Scholar
Kirk, D.B., Hwu, W.W.: Programming Massively Parallel Processors. A Hands-on Approach. Morgan Kaufmann (2010)
Google Scholar
Kouya, T.: Acceleration of complex matrix multiplication using arbitrary precision floating-point arithmetic,arXiv:2307.06072v1 (2023)
Kurzak, J., Nath, R., Du, P., Dongarra, J.: An implementation of the tile QR factorization for a GPU and multiple CPUs. In: Jónasson, K. (ed.) PARA 2010. LNCS, vol. 7134, pp. 248–257. Springer, Heidelberg (2012).https://doi.org/10.1007/978-3-642-28145-7_25
Chapter Google Scholar
LLC: Multiprecision computing toolbox for MATLAB.www.advanpix.com
Lu, M., He, B., Luo, Q.: Supporting extended precision on graphics processors. In: Proceedings of the Sixth International Workshop on Data Management on New Hardware (DaMoN 2010), pp. 19–26 (2010)
Google Scholar
Maho, N.: MPLAPACK version 2.0.1. user manual,arXiv:2109.13406v2 (2022)
Morgan, A.: Solving Polynomial Systems Using Continuation for Engineering and Scientific Problems, Classics in Applied Mathematics, vol. 57. SIAM (2009)
Google Scholar
Muller, J.M., et al.: Handbook of Floating-Point Arithmetic, 2nd edn. Springer, Cham (2018).https://doi.org/10.1007/978-3-319-76526-6
Book Google Scholar
Nasri, W., Mahjoub, Z.: Optimal parallelization of a recursive algorithm for triangular matrix inversion on MIMD computers. Parallel Comput.27, 1767–1782 (2001)
Article MathSciNet Google Scholar
Sidi, A.: Practical Extrapolation Methods. Theory and Applications. Cambridge Monographs on Applied and Computational Mathematics, vol. 10. Cambridge University Press, Cambridge (2003)
Google Scholar
Telen, S., Van Barel, M., Verschelde, J.: A robust numerical path tracking algorithm for polynomial homotopy continuation. SIAM J. Sci. Comput.42(6), 3610-A3637 (2020)
Article MathSciNet Google Scholar
Telen, S., Van Barel, M., Verschelde, J.: Robust numerical tracking of one path of a polynomial homotopy on parallel shared memory computers. In: Boulier, F., England, M., Sadykov, T.M., Vorozhtsov, E.V. (eds.) CASC 2020. LNCS, vol. 12291, pp. 563–582. Springer, Cham (2020).https://doi.org/10.1007/978-3-030-60026-6_33
Chapter Google Scholar
Tomov, S., Dongarra, J., Baboulin, M.: Towards dense linear algebra for hybrid GPU accelerated manycore systems. Parallel Comput.36(5), 232–240 (2010)
Article Google Scholar
Tomov, S., Nath, R., Ltaief, H., Dongarra, J.: Dense linear algebra solvers for multicore with GPU accelerators. In: The 2010 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW), pp. 1–8. IEEE (2010)
Google Scholar
Trefethen, L.N.: Approximation Theory and Approximation Practice. SIAM (2013)
Google Scholar
Trefethen, L.N.: Quantifying the ill-conditioning of analytic continuation. BIT Numer. Math.60(4), 901–915 (2020)
Article MathSciNet Google Scholar
Utsugiri, T., Kouya, T.: Acceleration of multiple precision matrix multiplication using Ozaki scheme,arXiv:2301.09960v2 (2023)
Verschelde, J.: Algorithm 795: PHCpack: a general-purpose solver for polynomial systems by homotopy continuation. ACM Trans. Math. Softw.25(2), 251–276 (1999)
Article Google Scholar
Verschelde, J.: Parallel software to offset the cost of higher precision. ACM SIGAda Ada Lett.40(2), 59–64 (2020)
Article Google Scholar
Verschelde, J.: Accelerated polynomial evaluation and differentiation at power series in multiple double precision. In: The 2021 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW), pp. 740–749. IEEE (2021)
Google Scholar
Verschelde, J.: Least squares on GPUs in multiple double precision. In: The 2022 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW), pp. 828–837. IEEE (2022)
Google Scholar
Verschelde, J., Viswanathan, K.: Locating the closest singularity in a polynomial homotopy. In: Boulier, F., England, M., Sadykov, T.M., Vorozhtsov, E.V. (eds.) CASC 2022. LNCS, vol. 13366, pp. 333–352. Springer, Cham (2022).https://doi.org/10.1007/978-3-031-14788-3_19
Chapter Google Scholar
Volkov, V., Demmel, J.: Benchmarking GPUs to tune dense linear algebra. In: Proceedings of the 2008 ACM/IEEE Conference on Supercomputing. IEEE Press (2008). Article No. 31
Google Scholar
Williams, S., Waterman, A., Patterson, D.: Roofline: an insightful visual performance model for multicore architectures. Commun. ACM52(4), 65–76 (2009)
Article Google Scholar

Download references

Author information

Authors and Affiliations

Department of Mathematics, Statistics, and Computer Science, University of Illinois at Chicago, 851 S. Morgan St. (m/c 249), Chicago, IL, 60607-7045, USA
Jan Verschelde

Authors

Jan Verschelde
View author publications
You can also search for this author inPubMed Google Scholar

Corresponding author

Correspondence toJan Verschelde.

Editor information

Editors and Affiliations

Université de Lille, Villeneuve d’Ascq, France
François Boulier
School of Mathematical Sciences, Beihang University, Beijing, China
Chenqi Mou
Plekhanov Russian University of Economics, Moscow, Russia
Timur M. Sadykov
Khristianovich Institute of Theoretical and Applied Mechanics, Novosibirsk, Russia
Evgenii V. Vorozhtsov

Ethics declarations

Disclosure of Interests

The author has no competing interests to declare that are relevant to the content of this article.

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Verschelde, J. (2024). GPU Accelerated Newton for Taylor Series Solutions of Polynomial Homotopies in Multiple Double Precision. In: Boulier, F., Mou, C., Sadykov, T.M., Vorozhtsov, E.V. (eds) Computer Algebra in Scientific Computing. CASC 2024. Lecture Notes in Computer Science, vol 14938. Springer, Cham. https://doi.org/10.1007/978-3-031-69070-9_19

Download citation

DOI:https://doi.org/10.1007/978-3-031-69070-9_19
Published:21 August 2024
Publisher Name:Springer, Cham
Print ISBN:978-3-031-69069-3
Online ISBN:978-3-031-69070-9
eBook Packages:Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Movatterモバイル変換

GPU Accelerated Newton for Taylor Series Solutions of Polynomial Homotopies in Multiple Double Precision

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Computing matrix trigonometric functions with GPUs through Matlab

An Efficient Quasi-Newton Method with Tensor Product Implementation for Solving Quasi-Linear Elliptic Equations and Systems

Paramotopy: Parameter Homotopies in Parallel

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Ethics declarations

Disclosure of Interests

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Access this chapter

Subscribe and save

Buy Now