Movatterモバイル変換


[0]ホーム

URL:


Jump to content
WikipediaThe Free Encyclopedia
Search

Hensel's lemma

From Wikipedia, the free encyclopedia
Result in modular arithmetic

Inmathematics,Hensel's lemma, also known asHensel's lifting lemma, named afterKurt Hensel, is a result inmodular arithmetic, stating that if aunivariate polynomial has asimple root modulo aprime numberp, then this root can belifted to a unique root modulo any higher power ofp. More generally, if a polynomial factors modulop into twocoprime polynomials, this factorization can be lifted to a factorization modulo any higher power ofp (the case of roots corresponds to the case of degree1 for one of the factors).

By passing to the "limit" (in fact this is aninverse limit) when the power ofp tends to infinity, it follows that a root or a factorization modulop can be lifted to a root or a factorization over thep-adic integers.

These results have been widely generalized, under the same name, to the case of polynomials over an arbitrarycommutative ring, wherep is replaced by anideal, and "coprime polynomials" means "polynomials that generate an ideal containing1".

Hensel's lemma is fundamental inp-adic analysis, a branch ofanalytic number theory.

The proof of Hensel's lemma isconstructive, and leads to an efficient algorithm forHensel lifting, which is fundamental forfactoring polynomials, and gives the most efficient known algorithm for exactlinear algebra over therational numbers.

Modular reduction and lifting

[edit]

Hensel's original lemma concerns the relation betweenpolynomial factorization over the integers and over the integersmodulo aprime numberp and its powers. It can be straightforwardly extended to the case where the integers are replaced by anycommutative ring, andp is replaced by anymaximal ideal (indeed, the maximal ideals ofZ{\displaystyle \mathbb {Z} } have the formpZ,{\displaystyle p\mathbb {Z} ,} wherep is a prime number).

Making this precise requires a generalization of the usualmodular arithmetic, and so it is useful to define accurately the terminology that is commonly used in this context.

LetR be a commutative ring, andI anideal ofR.Reduction moduloI refers to the replacement of every element ofR by its image under thecanonical mapRR/I.{\displaystyle R\to R/I.} For example, iffR[X]{\displaystyle f\in R[X]} is apolynomial with coefficients inR, its reduction moduloI, denotedfmodI,{\displaystyle f{\bmod {I}},} is the polynomial in(R/I)[X]=R[X]/IR[X]{\displaystyle (R/I)[X]=R[X]/IR[X]} obtained by replacing the coefficients off by their image inR/I.{\displaystyle R/I.} Two polynomialsf andg inR[X]{\displaystyle R[X]} arecongruent moduloI, denotedfg(modI){\textstyle f\equiv g{\pmod {I}}} if they have the same coefficients moduloI, that is iffgIR[X].{\displaystyle f-g\in IR[X].} IfhR[X],{\displaystyle h\in R[X],} a factorization ofh moduloI consists in two (or more) polynomialsf, g inR[X]{\displaystyle R[X]} such thathfg(modI).{\textstyle h\equiv fg{\pmod {I}}.}

Thelifting process is the inverse of reduction. That is, givenobjects depending on elements ofR/I,{\displaystyle R/I,} the lifting process replaces these elements by elements ofR{\displaystyle R} (or ofR/Ik{\displaystyle R/I^{k}} for somek > 1) that maps to them in a way that keeps the properties of the objects.

For example, given a polynomialhR[X]{\displaystyle h\in R[X]} and a factorization moduloI expressed ashfg(modI),{\textstyle h\equiv fg{\pmod {I}},} lifting this factorization moduloIk{\displaystyle I^{k}} consists of finding polynomialsf,gR[X]{\displaystyle f',g'\in R[X]} such thatff(modI),{\textstyle f'\equiv f{\pmod {I}},}gg(modI),{\textstyle g'\equiv g{\pmod {I}},} andhfg(modIk).{\textstyle h\equiv f'g'{\pmod {I^{k}}}.} Hensel's lemma asserts that such a lifting is always possible under mild conditions; see next section.

Statement

[edit]

Originally, Hensel's lemma was stated (and proved) for lifting afactorization modulo aprime numberp of a polynomial over the integers to a factorization modulo any power ofp and to a factorization over thep-adic integers. This can be generalized easily, with the same proof to the case where the integers are replaced by anycommutative ring, the prime number is replaced by amaximal ideal, and thep-adic integers are replaced by thecompletion with respect to the maximal ideal. It is this generalization, which is also widely used, that is presented here.

Letm{\displaystyle {\mathfrak {m}}} be a maximal ideal of a commutative ringR, and

h=α0Xn++αn1X+αn{\displaystyle h=\alpha _{0}X^{n}+\cdots +\alpha _{n-1}X+\alpha _{n}}

be apolynomial inR[X]{\displaystyle R[X]} with aleading coefficientα0{\displaystyle \alpha _{0}} not inm.{\displaystyle {\mathfrak {m}}.}

Sincem{\displaystyle {\mathfrak {m}}} is a maximal ideal, thequotient ringR/m{\displaystyle R/{\mathfrak {m}}} is afield, and(R/m)[X]{\displaystyle (R/{\mathfrak {m}})[X]} is aprincipal ideal domain, and, in particular, aunique factorization domain, which means that every nonzero polynomial in(R/m)[X]{\displaystyle (R/{\mathfrak {m}})[X]} can be factorized in a unique way as the product of a nonzero element of(R/m){\displaystyle (R/{\mathfrak {m}})} andirreducible polynomials that aremonic (that is, their leading coefficients are 1).

Hensel's lemma asserts that every factorization ofh modulom{\displaystyle {\mathfrak {m}}} into coprime polynomials can be lifted in a unique way into a factorization modulomk{\displaystyle {\mathfrak {m}}^{k}} for everyk.

More precisely, with the above hypotheses, ifhα0fg(modm),{\textstyle h\equiv \alpha _{0}fg{\pmod {\mathfrak {m}}},} wheref andg are monic andcoprime modulom,{\displaystyle {\mathfrak {m}},} then, for every positive integerk there are monic polynomialsfk{\displaystyle f_{k}} andgk{\displaystyle g_{k}} such that

hα0fkgk(modmk),fkf(modm),gkg(modm),{\displaystyle {\begin{aligned}h&\equiv \alpha _{0}f_{k}g_{k}{\pmod {{\mathfrak {m}}^{k}}},\\f_{k}&\equiv f{\pmod {\mathfrak {m}}},\\g_{k}&\equiv g{\pmod {\mathfrak {m}}},\end{aligned}}}

andfk{\displaystyle f_{k}} andgk{\displaystyle g_{k}} are unique (with these properties) modulomk.{\displaystyle {\mathfrak {m}}^{k}.}

Lifting simple roots

[edit]

An important special case is whenf=Xr.{\displaystyle f=X-r.} In this case the coprimality hypothesis means thatr is asimple root ofhmodm.{\displaystyle h{\bmod {\mathfrak {m}}}.} This gives the following special case of Hensel's lemma, which is often also called Hensel's lemma.

With above hypotheses and notations, ifr is a simple root ofhmodm,{\displaystyle h{\bmod {\mathfrak {m}}},} thenr can be lifted in a unique way to a simple root ofhmodmn{\displaystyle h{\bmod {{\mathfrak {m}}^{n}}}} for every positive integern. Explicitly, for every positive integern, there is a uniquernR/mn{\displaystyle r_{n}\in R/{\mathfrak {m}}^{n}} such thatrnr(modm){\textstyle r_{n}\equiv r{\pmod {\mathfrak {m}}}} andrn{\displaystyle r_{n}} is a simple root ofhmodmn.{\displaystyle h{\bmod {\mathfrak {m}}}^{n}.}

Lifting to adic completion

[edit]

The fact that one can lift toR/mn{\displaystyle R/{\mathfrak {m}}^{n}} for every positive integern suggests to "pass to the limit" whenn tends to the infinity. This was one of the main motivations for introducingp-adic integers.

Given a maximal idealm{\displaystyle {\mathfrak {m}}} of a commutative ringR, the powers ofm{\displaystyle {\mathfrak {m}}} form a basis ofopen neighborhoods for atopology onR, which is called them{\displaystyle {\mathfrak {m}}}-adic topology. Thecompletion of this topology can be identified with thecompletion of the local ringRm,{\displaystyle R_{\mathfrak {m}},} and with theinverse limitlimR/mn.{\displaystyle \lim _{\leftarrow }R/{\mathfrak {m}}^{n}.} This completion is acomplete local ring, generally denotedR^m.{\displaystyle {\widehat {R}}_{\mathfrak {m}}.} WhenR is the ring of the integers, andm=pZ,{\displaystyle {\mathfrak {m}}=p\mathbb {Z} ,} wherep is a prime number, this completion is the ring ofp-adic integersZp.{\displaystyle \mathbb {Z} _{p}.}

The definition of the completion as an inverse limit, and the above statement of Hensel's lemma imply that every factorization into pairwise coprime polynomials modulom{\displaystyle {\mathfrak {m}}} of a polynomialhR[X]{\displaystyle h\in R[X]} can be uniquely lifted to a factorization of the image ofh inR^m[X].{\displaystyle {\widehat {R}}_{\mathfrak {m}}[X].} Similarly, every simple root ofh modulom{\displaystyle {\mathfrak {m}}} can be lifted to a simple root of the image ofh inR^m[X].{\displaystyle {\widehat {R}}_{\mathfrak {m}}[X].}

Proof

[edit]

Hensel's lemma is generally proved incrementally by lifting a factorization overR/mn{\displaystyle R/{\mathfrak {m}}^{n}} to either a factorization overR/mn+1{\displaystyle R/{\mathfrak {m}}^{n+1}} (Linear lifting), or a factorization overR/m2n{\displaystyle R/{\mathfrak {m}}^{2n}} (Quadratic lifting).

The main ingredient of the proof is thatcoprime polynomials over a field satisfyBézout's identity. That is, iff andg are coprimeunivariate polynomials over afield (hereR/m{\displaystyle R/{\mathfrak {m}}}), there are polynomialsa andb such thatdega<degg,{\displaystyle \deg a<\deg g,}degb<degf,{\displaystyle \deg b<\deg f,} and

af+bg=1.{\displaystyle af+bg=1.}

Bézout's identity allows defining coprime polynomials and proving Hensel's lemma, even if the idealm{\displaystyle {\mathfrak {m}}} is not maximal. Therefore, in the following proofs, one starts from a commutative ringR, anidealI, a polynomialhR[X]{\displaystyle h\in R[X]} that has a leading coefficient that is invertible moduloI (that is its image inR/I{\displaystyle R/I} is aunit inR/I{\displaystyle R/I}), andfactorization ofh moduloI or modulo a power ofI, such that the factors satisfy a Bézout's identity moduloI. In these proofs,AB(modI){\textstyle A\equiv B{\pmod {I}}} meansABIR[X].{\displaystyle A-B\in IR[X].}

Linear lifting

[edit]

LetI be anideal of acommutative ringR, andhR[X]{\displaystyle h\in R[X]} be aunivariate polynomial with coefficients inR that has aleading coefficientα{\displaystyle \alpha } that is invertible moduloI (that is, the image ofα{\displaystyle \alpha } inR/I{\displaystyle R/I} is aunit inR/I{\displaystyle R/I}).

Suppose that for some positive integerk there is a factorization

hαfg(modIk),{\displaystyle h\equiv \alpha fg{\pmod {I^{k}}},}

such thatf andg aremonic polynomials that are coprime moduloI, in the sense that there exista,bR[X],{\displaystyle a,b\in R[X],} such thataf+bg1(modI).{\textstyle af+bg\equiv 1{\pmod {I}}.} Then, there are polynomialsδf,δgIkR[X],{\displaystyle \delta _{f},\delta _{g}\in I^{k}R[X],} such thatdegδf<degf,{\displaystyle \deg \delta _{f}<\deg f,}degδg<degg,{\displaystyle \deg \delta _{g}<\deg g,} and

hα(f+δf)(g+δg)(modIk+1).{\displaystyle h\equiv \alpha (f+\delta _{f})(g+\delta _{g}){\pmod {I^{k+1}}}.}

Under these conditions,δf{\displaystyle \delta _{f}} andδg{\displaystyle \delta _{g}} are unique moduloIk+1R[X].{\displaystyle I^{k+1}R[X].}

Moreover,f+δf{\displaystyle f+\delta _{f}} andg+δg{\displaystyle g+\delta _{g}} satisfy the same Bézout's identity asf andg, that is,a(f+δf)+b(g+δg)1(modI).{\displaystyle a(f+\delta _{f})+b(g+\delta _{g})\equiv 1{\pmod {I}}.} This follows immediately from the preceding assertions, but is needed to apply iteratively the result with increasing values ofk.

The proof that follows is written for computingδf{\displaystyle \delta _{f}} andδg{\displaystyle \delta _{g}} by using only polynomials with coefficients inR/I{\displaystyle R/I} orIk/Ik+1.{\displaystyle I^{k}/I^{k+1}.} WhenR=Z{\displaystyle R=\mathbb {Z} } andI=pZ,{\displaystyle I=p\mathbb {Z} ,} this allows manipulating only integers modulop.

Proof:By hypothesis,α{\displaystyle \alpha } is invertible moduloI. This means that there existsβR{\displaystyle \beta \in R} andγIR[X]{\displaystyle \gamma \in IR[X]} such thatαβ=1γ.{\displaystyle \alpha \beta =1-\gamma .}

LetδhIkR[X],{\displaystyle \delta _{h}\in I^{k}R[X],} of degree less thandegh,{\displaystyle \deg h,} such that

δhhαfg(modIk+1).{\displaystyle \delta _{h}\equiv h-\alpha fg{\pmod {I^{k+1}}}.}

(One may chooseδh=hαfg,{\displaystyle \delta _{h}=h-\alpha fg,} but other choices may lead to simpler computations. For example, ifR=Z{\displaystyle R=\mathbb {Z} } andI=pZ,{\displaystyle I=p\mathbb {Z} ,} it is possible and better to chooseδh=pkδh{\displaystyle \delta _{h}=p^{k}\delta '_{h}} where the coefficients ofδh{\displaystyle \delta '_{h}} are integers in the interval[0,p1].{\displaystyle [0,p-1].})

Asg is monic, theEuclidean division ofaδh{\displaystyle a\delta _{h}} byg is defined, and providesq andc such thataδh=qg+c,{\displaystyle a\delta _{h}=qg+c,} anddegc<degg.{\displaystyle \deg c<\deg g.} Moreover, bothq andc are inIkR[X].{\displaystyle I^{k}R[X].} Similarly, letbδh=qf+d,{\displaystyle b\delta _{h}=q'f+d,} withdegd<degf,{\displaystyle \deg d<\deg f,} andq,dIkR[X].{\displaystyle q',d\in I^{k}R[X].}

One hasq+qIk+1R[X].{\displaystyle q+q'\in I^{k+1}R[X].} Indeed, one has

fc+gd=afδh+bgδhfg(q+q)δhfg(q+q)(modIk+1).{\displaystyle fc+gd=af\delta _{h}+bg\delta _{h}-fg(q+q')\equiv \delta _{h}-fg(q+q'){\pmod {I^{k+1}}}.}

Asfg{\displaystyle fg} is monic, the degree moduloIk+1{\displaystyle I^{k+1}} offg(q+q){\displaystyle fg(q+q')} can be less thandegfg{\displaystyle \deg fg} only ifq+qIk+1R[X].{\displaystyle q+q'\in I^{k+1}R[X].}

Thus, considering congruences moduloIk+1,{\displaystyle I^{k+1},} one has

α(f+βd)(g+βc)hαfgh+αβ(f(aδhqg)+g(bδhqf))δh(1+αβ(af+bg))αβfg(q+q)0(modIk+1).{\displaystyle {\begin{aligned}\alpha (f+\beta d)&(g+\beta c)-h\\&\equiv \alpha fg-h+\alpha \beta (f(a\delta _{h}-qg)+g(b\delta _{h}-q'f))\\&\equiv \delta _{h}(-1+\alpha \beta (af+bg))-\alpha \beta fg(q+q')\\&\equiv 0{\pmod {I^{k+1}}}.\end{aligned}}}

So, the existence assertion is verified with

δf=βd,δg=βc.{\displaystyle \delta _{f}=\beta d,\qquad \delta _{g}=\beta c.}

Uniqueness

[edit]

LetR,I,h andα{\displaystyle \alpha } as a in the preceding section. Let

hαfg(modI){\displaystyle h\equiv \alpha fg{\pmod {I}}}

be a factorization into coprime polynomials (in the above sense), suchdegf0+degg0=degh.{\displaystyle \deg f_{0}+\deg g_{0}=\deg h.} The application of linear lifting fork=1,2,,n1,{\displaystyle k=1,2,\ldots ,n-1\ldots ,} shows the existence ofδf{\displaystyle \delta _{f}} andδg{\displaystyle \delta _{g}} such thatdegδf<degf,{\displaystyle \deg \delta _{f}<\deg f,}degδg<degg,{\displaystyle \deg \delta _{g}<\deg g,} and

hα(f+δf)(g+δg)(modIn).{\displaystyle h\equiv \alpha (f+\delta _{f})(g+\delta _{g}){\pmod {I^{n}}}.}

The polynomialsδf{\displaystyle \delta _{f}} andδg{\displaystyle \delta _{g}} are uniquely defined moduloIn.{\displaystyle I^{n}.} This means that, if another pair(δf,δg){\displaystyle (\delta '_{f},\delta '_{g})} satisfies the same conditions, then one has

δfδf(modIn)andδgδg(modIn).{\displaystyle \delta '_{f}\equiv \delta _{f}{\pmod {I^{n}}}\qquad {\text{and}}\qquad \delta '_{g}\equiv \delta _{g}{\pmod {I^{n}}}.}

Proof: Since a congruence moduloIn{\displaystyle I^{n}} implies the same concruence moduloIn1,{\displaystyle I^{n-1},} one can proceed byinduction and suppose that the uniqueness has been proved forn − 1, the casen = 0 being trivial. That is, one can suppose that

δfδfIn1R[X]andδgδgIn1R[X].{\displaystyle \delta _{f}-\delta '_{f}\in I^{n-1}R[X]\qquad {\text{and}}\qquad \delta _{g}-\delta '_{g}\in I^{n-1}R[X].}

By hypothesis, has

hα(f+δf)(g+δg)α(f+δf)(g+δg)(modIn),{\displaystyle h\equiv \alpha (f+\delta _{f})(g+\delta _{g})\equiv \alpha (f+\delta '_{f})(g+\delta '_{g}){\pmod {I^{n}}},}

and thus

α(f+δf)(g+δg)α(f+δf)(g+δg)=α(f(δgδg)+g(δfδf))+α(δf(δgδg)δg(δfδf))InR[X].{\displaystyle {\begin{aligned}\alpha (f+\delta _{f})(g+\delta _{g})&-\alpha (f+\delta '_{f})(g+\delta '_{g})\\&=\alpha (f(\delta _{g}-\delta '_{g})+g(\delta _{f}-\delta '_{f}))+\alpha (\delta _{f}(\delta _{g}-\delta '_{g})-\delta _{g}(\delta _{f}-\delta '_{f}))\in I^{n}R[X].\end{aligned}}}

By induction hypothesis, the second term of the latter sum belongs toIn,{\displaystyle I^{n},} and the same is thus true for the first term. Asα{\displaystyle \alpha } is invertible moduloI, there existβR{\displaystyle \beta \in R} andγI{\displaystyle \gamma \in I} such thatαβ=1+γ.{\displaystyle \alpha \beta =1+\gamma .} Thus

f(δgδg)+g(δfδf)=αβ(f(δgδg)+g(δfδf))γ(f(δgδg)+g(δfδf))InR[X],{\displaystyle {\begin{aligned}f(\delta _{g}-\delta '_{g})&+g(\delta _{f}-\delta '_{f})\\&=\alpha \beta (f(\delta _{g}-\delta '_{g})+g(\delta _{f}-\delta '_{f}))-\gamma (f(\delta _{g}-\delta '_{g})+g(\delta _{f}-\delta '_{f}))\in I^{n}R[X],\end{aligned}}}

using the induction hypothesis again.

The coprimality moduloI implies the existence ofa,bR[X]{\displaystyle a,b\in R[X]} such that1af+bg(modI).{\textstyle 1\equiv af+bg{\pmod {I}}.} Using the induction hypothesis once more, one gets

δgδg(af+bg)(δgδg)g(b(δgδg)a(δfδf))(modIn).{\displaystyle {\begin{aligned}\delta _{g}-\delta '_{g}&\equiv (af+bg)(\delta _{g}-\delta '_{g})\\&\equiv g(b(\delta _{g}-\delta '_{g})-a(\delta _{f}-\delta '_{f})){\pmod {I^{n}}}.\end{aligned}}}

Thus one has a polynomial of degree less thandegg{\displaystyle \deg g} that is congruent moduloIn{\displaystyle I^{n}} to the product of themonic polynomialg and another polynomialw. This is possible only ifwInR[X],{\displaystyle w\in I^{n}R[X],} and impliesδgδgInR[X].{\displaystyle \delta _{g}-\delta '_{g}\in I^{n}R[X].} Similarly,δfδf{\displaystyle \delta _{f}-\delta '_{f}} is also inInR[X],{\displaystyle I^{n}R[X],} and this proves the uniqueness.

Quadratic lifting

[edit]

Linear lifting allows lifting a factorization moduloIn{\displaystyle I^{n}} to a factorization moduloIn+1.{\displaystyle I^{n+1}.} Quadratic lifting allows lifting directly to a factorization moduloI2n,{\displaystyle I^{2n},} at the cost of lifting also theBézout's identity and of computing moduloIn{\displaystyle I^{n}} instead of moduloI (if one uses the above description of linear lifting).

For lifting up to moduloIN{\displaystyle I^{N}} for largeN one can use either method. If, say,N=2k,{\displaystyle N=2^{k},} a factorization moduloIN{\displaystyle I^{N}} requiresN − 1 steps of linear lifting or onlyk − 1 steps of quadratic lifting. However, in the latter case the size of the coefficients that have to be manipulated increase during the computation. This implies that the best lifting method depends on the context (value ofN, nature ofR, multiplication algorithm that is used,hardware specificities, etc.).[citation needed]

Quadratic lifting is based on the following property.

Suppose that for some positive integerk there is a factorization

hαfg(modIk),{\displaystyle h\equiv \alpha fg{\pmod {I^{k}}},}

such thatf andg aremonic polynomials that are coprime moduloI, in the sense that there exista,bR[X],{\displaystyle a,b\in R[X],} such thataf+bg1(modIk).{\textstyle af+bg\equiv 1{\pmod {I^{k}}}.} Then, there are polynomialsδf,δgIkR[X],{\displaystyle \delta _{f},\delta _{g}\in I^{k}R[X],} such thatdegδf<degf,{\displaystyle \deg \delta _{f}<\deg f,}degδg<degg,{\displaystyle \deg \delta _{g}<\deg g,} and

hα(f+δf)(g+δg)(modI2k).{\displaystyle h\equiv \alpha (f+\delta _{f})(g+\delta _{g}){\pmod {I^{2k}}}.}

Moreover,f+δf{\displaystyle f+\delta _{f}} andg+δg{\displaystyle g+\delta _{g}} satisfy a Bézout's identity of the form

(a+δa)(f+δf)+(b+δb)(g+δg)1(modI2k).{\displaystyle (a+\delta _{a})(f+\delta _{f})+(b+\delta _{b})(g+\delta _{g})\equiv 1{\pmod {I^{2k}}}.}

(This is required for allowing iterations of quadratic lifting.)

Proof: The first assertion is exactly that of linear lifting applied withk = 1 to the idealIk{\displaystyle I^{k}} instead ofI.{\displaystyle I.}

Letα=af+bg1IkR[X].{\displaystyle \alpha =af+bg-1\in I^{k}R[X].} One has

a(f+δf)+b(g+δg)=1+Δ,{\displaystyle a(f+\delta _{f})+b(g+\delta _{g})=1+\Delta ,}

where

Δ=α+aδf+bδgIkR[X].{\displaystyle \Delta =\alpha +a\delta _{f}+b\delta _{g}\in I^{k}R[X].}

Settingδa=aΔ{\displaystyle \delta _{a}=-a\Delta } andδb=bΔ,{\displaystyle \delta _{b}=-b\Delta ,} one gets

(a+δa)(f+δf)+(b+δb)(g+δg)=1Δ2I2kR[X],{\displaystyle (a+\delta _{a})(f+\delta _{f})+(b+\delta _{b})(g+\delta _{g})=1-\Delta ^{2}\in I^{2k}R[X],}

which proves the second assertion.

Explicit example

[edit]

Letf(X)=X62Q[X].{\displaystyle f(X)=X^{6}-2\in \mathbb {Q} [X].}

Modulo 2, Hensel's lemma cannot be applied since the reduction off(X){\displaystyle f(X)} modulo 2 is simply[1]pg 15-16

f¯(X)=X62¯=X6{\displaystyle {\bar {f}}(X)=X^{6}-{\overline {2}}=X^{6}}

with 6 factorsX{\displaystyle X} not being relatively prime to each other. ByEisenstein's criterion, however, one can conclude that the polynomialf(X){\displaystyle f(X)} is irreducible inQ2[X].{\displaystyle \mathbb {Q} _{2}[X].}
Overk=F7{\displaystyle k=\mathbb {F} _{7}}, on the other hand, one has

f¯(X)=X62¯=X616¯=(X34¯)(X3+4¯){\displaystyle {\bar {f}}(X)=X^{6}-{\overline {2}}=X^{6}-{\overline {16}}=(X^{3}-{\overline {4}})\;(X^{3}+{\overline {4}})}

where4{\displaystyle 4} is the square root of 2 inF7{\displaystyle \mathbb {F} _{7}}. As 4 is not a cube inF7,{\displaystyle \mathbb {F} _{7},} these two factors are irreducible overF7{\displaystyle \mathbb {F} _{7}}. Hence the complete factorization ofX62{\displaystyle X^{6}-2} inZ7[X]{\displaystyle \mathbb {Z} _{7}[X]} andQ7[X]{\displaystyle \mathbb {Q} _{7}[X]} is

f(X)=X62=(X3α)(X3+α),{\displaystyle f(X)=X^{6}-2=(X^{3}-\alpha )\;(X^{3}+\alpha ),}

whereα=4504547{\displaystyle \alpha =\ldots 450\,454_{7}} is a square root of 2 inZ7{\displaystyle \mathbb {Z} _{7}} that can be obtained by lifting the above factorization.
Finally, inF727[X]{\displaystyle \mathbb {F} _{727}[X]} the polynomial splits into

f¯(X)=X62¯=(X3¯)(X116¯)(X119¯)(X608¯)(X611¯)(X724¯){\displaystyle {\bar {f}}(X)=X^{6}-{\overline {2}}=(X-{\overline {3}})\;(X-{\overline {116}})\;(X-{\overline {119}})\;(X-{\overline {608}})\;(X-{\overline {611}})\;(X-{\overline {724}})}

with all factors relatively prime to each other, so that inZ727[X]{\displaystyle \mathbb {Z} _{727}[X]} andQ727[X]{\displaystyle \mathbb {Q} _{727}[X]} there are 6 factorsXβ{\displaystyle X-\beta } with the (non-rational) 727-adic integers

β={3+545727+5377272+1617273+116+48727+1307272+4987273+119+593727+6677272+6597273+608+133727+597272+677273+611+678727+5967272+2287273+724+181727+1897272+5657273+{\displaystyle \beta =\left\{{\begin{array}{rrr}3\;+&\!\!\!545\cdot 727\;+&\!\!\!537\cdot 727^{2}\,+&\!\!\!161\cdot 727^{3}+\ldots \\116\;+&\!\!\!48\cdot 727\;+&\!\!\!130\cdot 727^{2}\,+&\!\!\!498\cdot 727^{3}+\ldots \\119\;+&\!\!\!593\cdot 727\;+&\!\!\!667\cdot 727^{2}\,+&\!\!\!659\cdot 727^{3}+\ldots \\608\;+&\!\!\!133\cdot 727\;+&\!\!\!59\cdot 727^{2}\,+&\!\!\!67\cdot 727^{3}+\ldots \\611\;+&\!\!\!678\cdot 727\;+&\!\!\!596\cdot 727^{2}\,+&\!\!\!228\cdot 727^{3}+\ldots \\724\;+&\!\!\!181\cdot 727\;+&\!\!\!189\cdot 727^{2}\,+&\!\!\!565\cdot 727^{3}+\ldots \end{array}}\right.}

Using derivatives for lifting roots

[edit]

Letf(x){\displaystyle f(x)} be apolynomial withinteger (orp-adic integer) coefficients, and letm,k be positive integers such thatmk. Ifr is an integer such that

f(r)0modpkandf(r)0modp{\displaystyle f(r)\equiv 0{\bmod {p}}^{k}\quad {\text{and}}\quad f'(r)\not \equiv 0{\bmod {p}}}

then, for everym>0{\displaystyle m>0} there exists an integers such that

f(s)0modpk+mandrsmodpk.{\displaystyle f(s)\equiv 0{\bmod {p}}^{k+m}\quad {\text{and}}\quad r\equiv s{\bmod {p}}^{k}.}

Furthermore, thiss is unique modulopk+m, and can be computed explicitly as the integer such that

s=rf(r)a,{\displaystyle s=r-f(r)\cdot a,}

wherea{\displaystyle a} is an integer satisfying

a[f(r)]1modpm.{\displaystyle a\equiv [f'(r)]^{-1}{\bmod {p}}^{m}.}

Note thatf(r)0modpk{\displaystyle f(r)\equiv 0{\bmod {p}}^{k}} so that the conditionsrmodpk{\displaystyle s\equiv r{\bmod {p}}^{k}} is met. As an aside, iff(r)0modp{\displaystyle f'(r)\equiv 0{\bmod {p}}}, then 0, 1, or severals may exist (see Hensel Lifting below).

Derivation

[edit]

We use the Taylor expansion off aroundr to write:

f(s)=n=0Ncn(sr)n,cn=f(n)(r)/n!.{\displaystyle f(s)=\sum _{n=0}^{N}c_{n}(s-r)^{n},\qquad c_{n}=f^{(n)}(r)/n!.}

Fromrsmodpk,{\displaystyle r\equiv s{\bmod {p}}^{k},} we see thatsr =tpk for some integert. Let

f(s)=n=0Ncn(tpk)n=f(r)+tpkf(r)+n=2Ncntnpkn=f(r)+tpkf(r)+p2kt2g(t)g(t)Z[t]=zpk+tpkf(r)+p2kt2g(t)f(r)0modpk=(z+tf(r))pk+p2kt2g(t){\displaystyle {\begin{aligned}f(s)&=\sum _{n=0}^{N}c_{n}\left(tp^{k}\right)^{n}\\&=f(r)+tp^{k}f'(r)+\sum _{n=2}^{N}c_{n}t^{n}p^{kn}\\&=f(r)+tp^{k}f'(r)+p^{2k}t^{2}g(t)&&g(t)\in \mathbb {Z} [t]\\&=zp^{k}+tp^{k}f'(r)+p^{2k}t^{2}g(t)&&f(r)\equiv 0{\bmod {p}}^{k}\\&=(z+tf'(r))p^{k}+p^{2k}t^{2}g(t)\end{aligned}}}

Formk,{\displaystyle m\leqslant k,} we have:

f(s)0modpk+m(z+tf(r))pk0modpk+mz+tf(r)0modpmtf(r)zmodpmtz[f(r)]1modpmpf(r){\displaystyle {\begin{aligned}f(s)\equiv 0{\bmod {p}}^{k+m}&\Longleftrightarrow (z+tf'(r))p^{k}\equiv 0{\bmod {p}}^{k+m}\\&\Longleftrightarrow z+tf'(r)\equiv 0{\bmod {p}}^{m}\\&\Longleftrightarrow tf'(r)\equiv -z{\bmod {p}}^{m}\\&\Longleftrightarrow t\equiv -z[f'(r)]^{-1}{\bmod {p}}^{m}&&p\nmid f'(r)\end{aligned}}}

The assumption thatf(r){\displaystyle f'(r)} is not divisible byp ensures thatf(r){\displaystyle f'(r)} has an inverse modpm{\displaystyle p^{m}} which is necessarily unique. Hence a solution fort exists uniquely modulopm,{\displaystyle p^{m},} ands exists uniquely modulopk+m.{\displaystyle p^{k+m}.}

Observations

[edit]

Criterion for irreducible polynomials

[edit]

Using the above hypotheses, if we consider an irreducible polynomial

f(x)=a0+a1x++anxnK[X]{\displaystyle f(x)=a_{0}+a_{1}x+\cdots +a_{n}x^{n}\in K[X]}

such thata0,an0{\displaystyle a_{0},a_{n}\neq 0}, then

|f|=max{|a0|,|an|}{\displaystyle |f|=\max\{|a_{0}|,|a_{n}|\}}

In particular, forf(X)=X6+10X1{\displaystyle f(X)=X^{6}+10X-1}, we find inQ2[X]{\displaystyle \mathbb {Q} _{2}[X]}

|f(X)|=max{|a0|,,|an|}=max{0,1,0}=1{\displaystyle {\begin{aligned}|f(X)|&=\max\{|a_{0}|,\ldots ,|a_{n}|\}\\&=\max\{0,1,0\}=1\end{aligned}}}

butmax{|a0|,|an|}=0{\displaystyle \max\{|a_{0}|,|a_{n}|\}=0}, hence the polynomial cannot be irreducible. Whereas inQ7[X]{\displaystyle \mathbb {Q} _{7}[X]} we have both values agreeing, meaning the polynomialcould be irreducible. In order to determine irreducibility, the Newton polygon must be employed.[2]: 144 

Frobenius

[edit]

Note that given anaFp{\displaystyle a\in \mathbb {F} _{p}} theFrobenius endomorphismyyp{\displaystyle y\mapsto y^{p}} gives a nonzero polynomialxpa{\displaystyle x^{p}-a} that has zero derivative

ddx(xpa)=pxp10xp1modp0modp{\displaystyle {\begin{aligned}{\frac {d}{dx}}(x^{p}-a)&=p\cdot x^{p-1}\\&\equiv 0\cdot x^{p-1}{\bmod {p}}\\&\equiv 0{\bmod {p}}\end{aligned}}}

hence thepth roots ofa{\displaystyle a} do not exist inZp{\displaystyle \mathbb {Z} _{p}}. Fora=1{\displaystyle a=1}, this implies thatZp{\displaystyle \mathbb {Z} _{p}} cannot contain theroot of unityμp{\displaystyle \mu _{p}}.

Roots of unity

[edit]

Although thepth roots of unity are not contained inFp{\displaystyle \mathbb {F} _{p}}, there are solutions ofxpx=x(xp11){\displaystyle x^{p}-x=x(x^{p-1}-1)}. Note that

ddx(xpx)=pxp111modp{\displaystyle {\begin{aligned}{\frac {d}{dx}}(x^{p}-x)&=px^{p-1}-1\\&\equiv -1{\bmod {p}}\end{aligned}}}

is never zero, so if there exists a solution, it necessarily lifts toZp{\displaystyle \mathbb {Z} _{p}}. Because the Frobenius givesap=a,{\displaystyle a^{p}=a,} all of the non-zero elementsFp×{\displaystyle \mathbb {F} _{p}^{\times }} are solutions. In fact, these are the only roots of unity contained inQp{\displaystyle \mathbb {Q} _{p}}.[3]

Hensel lifting

[edit]

Using the lemma, one can "lift" a rootr of the polynomialf modulopk to a new roots modulopk+1 such thatrs modpk (by takingm = 1; taking largerm follows by induction). In fact, a root modulopk+1 is also a root modulopk, so the roots modulopk+1 are precisely the liftings of roots modulopk. The new roots is congruent tor modulop, so the new root also satisfiesf(s)f(r)0modp.{\displaystyle f'(s)\equiv f'(r)\not \equiv 0{\bmod {p}}.} So the lifting can be repeated, and starting from a solutionrk off(x)0modpk{\displaystyle f(x)\equiv 0{\bmod {p}}^{k}} we can derive a sequence of solutionsrk+1,rk+2, ... of the same congruence for successively higher powers ofp, provided thatf(rk)0modp{\displaystyle f'(r_{k})\not \equiv 0{\bmod {p}}} for the initial rootrk. This also shows thatf has the same number of roots modpk as modpk+1, modpk+2, or any other higher power ofp, provided that the roots off modpk are all simple.

What happens to this process ifr is not a simple root modp? Suppose that

f(r)0modpkandf(r)0modp.{\displaystyle f(r)\equiv 0{\bmod {p}}^{k}\quad {\text{and}}\quad f'(r)\equiv 0{\bmod {p}}.}

Thensrmodpk{\displaystyle s\equiv r{\bmod {p}}^{k}} impliesf(s)f(r)modpk+1.{\displaystyle f(s)\equiv f(r){\bmod {p}}^{k+1}.} That is,f(r+tpk)f(r)modpk+1{\displaystyle f(r+tp^{k})\equiv f(r){\bmod {p}}^{k+1}} for all integerst. Therefore, we have two cases:

Example. To see both cases we examine two different polynomials withp = 2:

f(x)=x2+1{\displaystyle f(x)=x^{2}+1} andr = 1. Thenf(1)0mod2{\displaystyle f(1)\equiv 0{\bmod {2}}} andf(1)0mod2.{\displaystyle f'(1)\equiv 0{\bmod {2}}.} We havef(1)0mod4{\displaystyle f(1)\not \equiv 0{\bmod {4}}} which means that no lifting of 1 to modulus 4 is a root off(x) modulo 4.

g(x)=x217{\displaystyle g(x)=x^{2}-17} andr = 1. Theng(1)0mod2{\displaystyle g(1)\equiv 0{\bmod {2}}} andg(1)0mod2.{\displaystyle g'(1)\equiv 0{\bmod {2}}.} However, sinceg(1)0mod4,{\displaystyle g(1)\equiv 0{\bmod {4}},} we can lift our solution to modulus 4 and both lifts (i.e. 1, 3) are solutions. The derivative is still 0 modulo 2, soa priori we don't know whether we can lift them to modulo 8, but in fact we can, sinceg(1) is 0 mod 8 andg(3) is 0 mod 8, giving solutions at 1, 3, 5, and 7 mod 8. Since of these onlyg(1) andg(7) are 0 mod 16 we can lift only 1 and 7 to modulo 16, giving 1, 7, 9, and 15 mod 16. Of these, only 7 and 9 giveg(x) = 0 mod 32, so these can be raised giving 7, 9, 23, and 25 mod 32. It turns out that for every integerk ≥ 3, there are four liftings of 1 mod 2 to a root ofg(x) mod 2k.

Hensel's lemma forp-adic numbers

[edit]

In thep-adic numbers, where we can make sense of rational numbers modulo powers ofp as long as the denominator is not a multiple ofp, the recursion fromrk (roots modpk) tork+1 (roots modpk+1) can be expressed in a much more intuitive way. Instead of choosingt to be an(y) integer which solves the congruence

tf(rk)(f(rk)/pk)modpm,{\displaystyle tf'(r_{k})\equiv -(f(r_{k})/p^{k}){\bmod {p}}^{m},}

lett be the rational number (thepk here is not really a denominator sincef(rk) is divisible bypk):

(f(rk)/pk)/f(rk).{\displaystyle -(f(r_{k})/p^{k})/f'(r_{k}).}

Then set

rk+1=rk+tpk=rkf(rk)f(rk).{\displaystyle r_{k+1}=r_{k}+tp^{k}=r_{k}-{\frac {f(r_{k})}{f'(r_{k})}}.}

This fraction may not be an integer, but it is ap-adic integer, and the sequence of numbersrk converges in thep-adic integers to a root off(x) = 0. Moreover, the displayed recursive formula for the (new) numberrk+1 in terms ofrk is preciselyNewton's method for finding roots to equations in the real numbers.

By working directly in thep-adics and using thep-adic absolute value, there is a version of Hensel's lemma which can be applied even if we start with a solution off(a) ≡ 0 modp such thatf(a)0modp.{\displaystyle f'(a)\equiv 0{\bmod {p}}.} We just need to make sure the numberf(a){\displaystyle f'(a)} is not exactly 0. This more general version is as follows: if there is an integera which satisfies:

|f(a)|p<|f(a)|p2,{\displaystyle |f(a)|_{p}<|f'(a)|_{p}^{2},}

then there is a uniquep-adic integerb suchf(b) = 0 and|ba|p<|f(a)|p.{\displaystyle |b-a|_{p}<|f'(a)|_{p}.} The construction ofb amounts to showing that the recursion from Newton's method with initial valuea converges in thep-adics and we letb be the limit. The uniqueness ofb as a root fitting the condition|ba|p<|f(a)|p{\displaystyle |b-a|_{p}<|f'(a)|_{p}} needs additional work.

The statement of Hensel's lemma given above (takingm=1{\displaystyle m=1}) is a special case of this more general version, since the conditions thatf(a) ≡ 0 modp andf(a)0modp{\displaystyle f'(a)\not \equiv 0{\bmod {p}}} say that|f(a)|p<1{\displaystyle |f(a)|_{p}<1} and|f(a)|p=1.{\displaystyle |f'(a)|_{p}=1.}

Examples

[edit]

Suppose thatp is an odd prime anda is a non-zeroquadratic residue modulop. Then Hensel's lemma implies thata has a square root in the ring ofp-adic integersZp.{\displaystyle \mathbb {Z} _{p}.} Indeed, letf(x)=x2a.{\displaystyle f(x)=x^{2}-a.} Ifr is a square root ofa modulop then:

f(r)=r2a0modpandf(r)=2r0modp,{\displaystyle f(r)=r^{2}-a\equiv 0{\bmod {p}}\quad {\text{and}}\quad f'(r)=2r\not \equiv 0{\bmod {p}},}

where the second condition is dependent on the fact thatp is odd. The basic version of Hensel's lemma tells us that starting fromr1 =r we can recursively construct a sequence of integers{rk}{\displaystyle \{r_{k}\}} such that:

rk+1rkmodpk,rk2amodpk.{\displaystyle r_{k+1}\equiv r_{k}{\bmod {p}}^{k},\quad r_{k}^{2}\equiv a{\bmod {p}}^{k}.}

This sequence converges to somep-adic integerb which satisfiesb2 =a. In fact,b is the unique square root ofa inZp{\displaystyle \mathbb {Z} _{p}} congruent tor1 modulop. Conversely, ifa is a perfect square inZp{\displaystyle \mathbb {Z} _{p}} and it is not divisible byp then it is a nonzero quadratic residue modp. Note that thequadratic reciprocity law allows one to easily test whethera is a nonzero quadratic residue modp, thus we get a practical way to determine whichp-adic numbers (forp odd) have ap-adic square root, and it can be extended to cover the casep = 2 using the more general version of Hensel's lemma (an example with 2-adic square roots of 17 is given later).

To make the discussion above more explicit, let us find a "square root of 2" (the solution tox22=0{\displaystyle x^{2}-2=0}) in the 7-adic integers. Modulo 7 one solution is 3 (we could also take 4), so we setr1=3{\displaystyle r_{1}=3}. Hensel's lemma then allows us to findr2{\displaystyle r_{2}} as follows:

f(r1)=322=7f(r1)/p1=7/7=1f(r1)=2r1=6{\displaystyle {\begin{aligned}f(r_{1})&=3^{2}-2=7\\f(r_{1})/p^{1}&=7/7=1\\f'(r_{1})&=2r_{1}=6\end{aligned}}}

Based on which the expression

tf(r1)(f(r1)/pk)modp,{\displaystyle tf'(r_{1})\equiv -(f(r_{1})/p^{k}){\bmod {p}},}

turns into:

t61mod7{\displaystyle t\cdot 6\equiv -1{\bmod {7}}}

which impliest=1.{\displaystyle t=1.} Now:

r2=r1+tp1=3+17=10=137.{\displaystyle r_{2}=r_{1}+tp^{1}=3+1\cdot 7=10=13_{7}.}

And sure enough,1022mod72.{\displaystyle 10^{2}\equiv 2{\bmod {7}}^{2}.} (If we had used the Newton method recursion directly in the 7-adics, thenr2=r1f(r1)/f(r1)=37/6=11/6,{\displaystyle r_{2}=r_{1}-f(r_{1})/f'(r_{1})=3-7/6=11/6,} and11/610mod72.{\displaystyle 11/6\equiv 10{\bmod {7}}^{2}.})

We can continue and findr3=108=3+7+272=2137{\displaystyle r_{3}=108=3+7+2\cdot 7^{2}=213_{7}}. Each time we carry out the calculation (that is, for each successive value ofk), one more base 7 digit is added for the next higher power of 7. In the 7-adic integers this sequence converges, and the limit is a square root of 2 inZ7{\displaystyle \mathbb {Z} _{7}} which has initial 7-adic expansion

3+7+272+673+74+275+76+277+478+.{\displaystyle 3+7+2\cdot 7^{2}+6\cdot 7^{3}+7^{4}+2\cdot 7^{5}+7^{6}+2\cdot 7^{7}+4\cdot 7^{8}+\cdots .}

If we started with the initial choicer1=4{\displaystyle r_{1}=4} then Hensel's lemma would produce a square root of 2 inZ7{\displaystyle \mathbb {Z} _{7}} which is congruent to 4 (mod 7) instead of 3 (mod 7) and in fact this second square root would be the negative of the first square root (which is consistent with 4 = −3 mod 7).

As an example where the original version of Hensel's lemma is not valid but the more general one is, letf(x)=x217{\displaystyle f(x)=x^{2}-17} anda=1.{\displaystyle a=1.} Thenf(a)=16{\displaystyle f(a)=-16} andf(a)=2,{\displaystyle f'(a)=2,} so

|f(a)|2<|f(a)|22,{\displaystyle |f(a)|_{2}<|f'(a)|_{2}^{2},}

which implies there is a unique 2-adic integerb satisfying

b2=17and|ba|2<|f(a)|2=12,{\displaystyle b^{2}=17\quad {\text{and}}\quad |b-a|_{2}<|f'(a)|_{2}={\frac {1}{2}},}

i.e.,b ≡ 1 mod 4. There are two square roots of 17 in the 2-adic integers, differing by a sign, and although they are congruent mod 2 they are not congruent mod 4. This is consistent with the general version of Hensel's lemma only giving us a unique 2-adic square root of 17 that is congruent to 1 mod 4 rather than mod 2. If we had started with the initial approximate roota = 3 then we could apply the more general Hensel's lemma again to find a unique 2-adic square root of 17 which is congruent to 3 mod 4. This is the other 2-adic square root of 17.

In terms of lifting the roots ofx217{\displaystyle x^{2}-17} from modulus 2k to 2k+1, the lifts starting with the root 1 mod 2 are as follows:

1 mod 2 → 1, 3 mod 4
1 mod 4 → 1, 5 mod 8 and 3 mod 4 → 3, 7 mod 8
1 mod 8 → 1, 9 mod 16 and 7 mod 8 → 7, 15 mod 16, while 3 mod 8 and 5 mod 8 don't lift to roots mod 16
9 mod 16 → 9, 25 mod 32 and 7 mod 16 → 7, 23 mod 16, while 1 mod 16 and 15 mod 16 don't lift to roots mod 32.

For everyk at least 3, there arefour roots ofx2 − 17 mod 2k, but if we look at their 2-adic expansions we can see that in pairs they are converging to justtwo 2-adic limits. For instance, the four roots mod 32 break up into two pairs of roots which each look the same mod 16:

9 = 1 + 23 and 25 = 1 + 23 + 24.
7 = 1 + 2 + 22 and 23 = 1 + 2 + 22 + 24.

The 2-adic square roots of 17 have expansions

1+23+25+26+27+29+210+{\displaystyle 1+2^{3}+2^{5}+2^{6}+2^{7}+2^{9}+2^{10}+\cdots }
1+2+22+24+28+211+{\displaystyle 1+2+2^{2}+2^{4}+2^{8}+2^{11}+\cdots }

Another example where we can use the more general version of Hensel's lemma but not the basic version is a proof that any 3-adic integerc ≡ 1 mod 9 is a cube inZ3.{\displaystyle \mathbb {Z} _{3}.} Letf(x)=x3c{\displaystyle f(x)=x^{3}-c} and take initial approximationa = 1. The basic Hensel's lemma cannot be used to find roots off(x) sincef(r)0mod3{\displaystyle f'(r)\equiv 0{\bmod {3}}} for everyr. To apply the general version of Hensel's lemma we want|f(1)|3<|f(1)|32,{\displaystyle |f(1)|_{3}<|f'(1)|_{3}^{2},} which meansc1mod27.{\displaystyle c\equiv 1{\bmod {2}}7.} That is, ifc ≡ 1 mod 27 then the general Hensel's lemma tells usf(x) has a 3-adic root, soc is a 3-adic cube. However, we wanted to have this result under the weaker condition thatc ≡ 1 mod 9. Ifc ≡ 1 mod 9 thenc ≡ 1, 10, or 19 mod 27. We can apply the general Hensel's lemma three times depending on the value ofc mod 27: ifc ≡ 1 mod 27 then usea = 1, ifc ≡ 10 mod 27 then usea = 4 (since 4 is a root off(x) mod 27), and ifc ≡ 19 mod 27 then usea = 7. (It is not true that everyc ≡ 1 mod 3 is a 3-adic cube, e.g., 4 is not a 3-adic cube since it is not a cube mod 9.)

In a similar way, after some preliminary work, Hensel's lemma can be used to show that for anyodd prime numberp, anyp-adic integerc congruent to 1 modulop2 is ap-th power inZp.{\displaystyle \mathbb {Z} _{p}.} (This is false forp = 2.)

Generalizations

[edit]

SupposeA is acommutative ring,complete with respect to anidealm,{\displaystyle {\mathfrak {m}},} and letf(x)A[x].{\displaystyle f(x)\in A[x].}aA is called an "approximate root" off, if

f(a)0modf(a)2m.{\displaystyle f(a)\equiv 0{\bmod {f}}'(a)^{2}{\mathfrak {m}}.}

Iff has an approximate root then it has an exact rootbA "close to"a; that is,

f(b)=0andbamodm.{\displaystyle f(b)=0\quad {\text{and}}\quad b\equiv a{\bmod {\mathfrak {m}}}.}

Furthermore, iff(a){\displaystyle f'(a)} is not a zero-divisor thenb is unique.

This result can be generalized to several variables as follows:

Theorem. LetA be a commutative ring that is complete with respect to idealmA.{\displaystyle {\mathfrak {m}}\subset A.} Letf1,,fnA[x1,,xn]{\displaystyle f_{1},\ldots ,f_{n}\in A[x_{1},\ldots ,x_{n}]} be a system ofn polynomials inn variables overA. Viewf=(f1,,fn),{\displaystyle \mathbf {f} =(f_{1},\ldots ,f_{n}),} as a mapping fromAn to itself, and letJf(x){\displaystyle J_{\mathbf {f} }(\mathbf {x} )} denote itsJacobian matrix. Supposea = (a1, ...,an) ∈An is an approximate solution tof =0 in the sense that
fi(a)0mod(detJf(a))2m,1in.{\displaystyle f_{i}(\mathbf {a} )\equiv 0{\bmod {(}}{\det J_{\mathbf {f} }(a)})^{2}{\mathfrak {m}},\qquad 1\leqslant i\leqslant n.}
Then there is someb = (b1, ...,bn) ∈An satisfyingf(b) =0, i.e.,
fi(b)=0,1in.{\displaystyle f_{i}(\mathbf {b} )=0,\qquad 1\leqslant i\leqslant n.}
Furthermore this solution is "close" toa in the sense that
biaimoddetJf(a)m,1in.{\displaystyle b_{i}\equiv a_{i}{\bmod {\det }}J_{\mathbf {f} }(a){\mathfrak {m}},\qquad 1\leqslant i\leqslant n.}

As a special case, iffi(a)0modm{\displaystyle f_{i}(\mathbf {a} )\equiv 0{\bmod {\mathfrak {m}}}} for alli anddetJf(a){\displaystyle \det J_{\mathbf {f} }(\mathbf {a} )} is a unit inA then there is a solution tof(b) =0 withbiaimodm{\displaystyle b_{i}\equiv a_{i}{\bmod {\mathfrak {m}}}} for alli.

Whenn = 1,a =a is an element ofA andJf(a)=Jf(a)=f(a).{\displaystyle J_{\mathbf {f} }(\mathbf {a} )=J_{f}(a)=f'(a).} The hypotheses of this multivariable Hensel's lemma reduce to the ones which were stated in the one-variable Hensel's lemma.

Related concepts

[edit]

Completeness of a ring is not a necessary condition for the ring to have the Henselian property:Goro Azumaya in 1950 defined a commutativelocal ring satisfying the Henselian property for themaximal idealm to be aHenselian ring.

Masayoshi Nagata proved in the 1950s that for any commutative local ringA with maximal idealm there always exists a smallest ringAh containingA such thatAh is Henselian with respect tomAh. ThisAh is called theHenselization ofA. IfA isnoetherian,Ah will also be noetherian, andAh is manifestly algebraic as it is constructed as a limit ofétale neighbourhoods. This means thatAh is usually much smaller than the completion while still retaining the Henselian property and remaining in the samecategory[clarification needed].

See also

[edit]

References

[edit]
  1. ^Gras, Georges (2003).Class field theory : from theory to practice. Berlin.ISBN 978-3-662-11323-3.OCLC 883382066.{{cite book}}: CS1 maint: location missing publisher (link)
  2. ^Neukirch, Jürgen (1999).Algebraic Number Theory. Berlin, Heidelberg: Springer Berlin Heidelberg.ISBN 978-3-662-03983-0.OCLC 851391469.
  3. ^Conrad, Keith."Hensel's Lemma"(PDF). p. 4.
Retrieved from "https://en.wikipedia.org/w/index.php?title=Hensel%27s_lemma&oldid=1275481796"
Categories:
Hidden categories:

[8]ページ先頭

©2009-2025 Movatter.jp