Movatterモバイル変換

RFC 8767	DNS Serve-Stale	March 2020
Lawrence, et al.	Standards Track	[Page]

3.Background

There are a number of reasons why an authoritative server may becomeunreachable, including Denial-of-Service (DoS) attacks, networkissues, and so on. If a recursive server is unable to contact theauthoritative servers for a query but still has relevant data that hasaged past its TTL, that information can still be useful for generatingan answer under the metaphorical assumption that "stale bread isbetter than no bread."¶

[RFC1035],Section 3.2.1 says that the TTL "specifies the timeinterval that the resource record may be cached before the source ofthe information should again be consulted."[RFC1035],Section 4.1.3 furthersays that the TTL "specifies the time interval (in seconds) that theresource record may be cached before it should be discarded."¶

A natural English interpretation of these remarks would seem to beclear enough that records past their TTL expiration must not be used.However,[RFC1035] predates the more rigorous terminology of[RFC2119], which softened the interpretation of "may" and "should".¶

[RFC2181] aimed to provide "the precise definition of the Time toLive," butSection 8 of [RFC2181]was mostly concerned with the numeric range ofvalues rather than data expiration behavior. It does, however, closethat section by noting, "The TTL specifies a maximum time to live, nota mandatory time to live." This wording again does not contain BCP 14key words[RFC2119], but it does convey the natural languageconnotation that data becomes unusable past TTL expiry.¶

As of the time of this writing, several large-scale operators use staledata for answers in some way. A number of recursive resolver packages,including BIND, Knot Resolver, OpenDNS, and Unbound, provide options to use stale data.Apple macOS can also use stale data as part of the Happy Eyeballs algorithms inmDNSResponder. The collective operational experience is that using stale datacan provide significant benefit with minimal downside.¶

4.Standards Action

The definition of TTL in Sections 3.2.1 and4.1.3 of[RFC1035] isamended to read:¶

TTL: a 32-bit unsigned integer number of seconds that specifies theduration that the resource recordMAY be cached before the source ofthe informationMUST again be consulted. Zero values are interpretedto mean that the RR can only be used for the transaction in progress,and should not be cached. ValuesSHOULD be capped on the order ofdays to weeks, with a recommended cap of 604,800 seconds (7 days). If thedata is unable to be authoritatively refreshed when the TTL expires,the recordMAY be used as though it is unexpired. See Sections 5and6 of[RFC8767] for details.¶

Interpreting values that have the high-order bit set as beingpositive, rather than 0, is a change from[RFC2181], the rationalefor which is explained inSection 6.Suggesting a cap of 7 days, rather than the 68 years allowed by the full31 bits ofSection 8 of [RFC2181], reflects the current practice of major modern DNSresolvers.¶

When returning a response containing stale records, a recursiveresolverMUST set the TTL of each expired record in the message to avalue greater than 0, with aRECOMMENDED value of 30 seconds. SeeSection 6 for explanation.¶

Answers from authoritative servers that have a DNS response code ofeither 0 (NoError) or 3 (NXDomain) and the Authoritative Answer (AA)bit setMUST be considered to have refreshed the data at the resolver.Answers from authoritative servers that have any other response codeSHOULD be considered a failure to refresh the data and therefore leaveany previous state intact. SeeSection 6 fora discussion.¶

5.Example Method

There is more than one way a recursive resolver couldresponsibly implement this resiliency feature while still respectingthe intent of the TTL as a signal for when data is to be refreshed.¶

In this example method, four notable timers drive considerations forthe use of stale data:¶

A client response timer, which is the maximum amount of time arecursive resolver should allow between the receipt of a resolutionrequest and sending its response.¶
A query resolution timer, which caps the total amount of time arecursive resolver spends processing the query.¶
A failure recheck timer, which limits the frequency at which afailed lookup will be attempted again.¶
A maximum stale timer, which caps the amount of timethat records will be kept past their expiration.¶

Most recursive resolvers already have the query resolution timer and,effectively, some kind of failure recheck timer. The clientresponse timer and maximum stale timer are new concepts for thismechanism.¶

When a recursive resolver receives a request, it should startthe client response timer. This timer is used to avoid clienttimeouts. It should be configurable, with a recommended value of 1.8seconds as being just under a common timeout value of 2 seconds whilestill giving the resolver a fair shot at resolving the name.¶

The resolver then checks its cache for any unexpired records thatsatisfy the request and returns them if available. If itfinds no relevant unexpired data and the Recursion Desired flag is notset in the request, it should immediately return the response withoutconsulting the cache for expired records. Typically, this responsewould be a referral to authoritative nameservers covering the zone,but the specifics are implementation dependent.¶

If iterative lookups will be done, then the failure recheck timer isconsulted. Attempts to refresh from non-responsive or otherwisefailing authoritative nameservers are recommended to be done no morefrequently than every 30 seconds. If this request was received withinthis period, the cache may be immediately consulted for stale data tosatisfy the request.¶

Outside the period of the failure recheck timer, the resolvershould start the query resolution timer and begin the iterativeresolution process. This timer bounds the work done by the resolverwhen contacting external authorities and is commonly around 10 to 30seconds. If this timer expires on an attempted lookup that is stillbeing processed, the resolution effort is abandoned.¶

If the answer has not been completely determined by the time theclient response timer has elapsed, the resolver should then check itscache to see whether there is expired data that would satisfy therequest. If so, it adds that data to the response message with a TTLgreater than 0 (as specified inSection 4). The response is then sent tothe client while the resolver continues its attempt to refresh thedata.¶

When no authorities are able to be reached during a resolutionattempt, the resolver should attempt to refresh the delegation andrestart the iterative lookup process with the remaining time on thequery resolution timer. This resumption should be done only onceper resolution effort.¶

Outside the resolution process, the maximum stale timer is used forcache management and is independent of the query resolutionprocess. This timer is conceptually different from the maximum cacheTTL that exists in many resolvers, the latter being a clamp on thevalue of TTLs as received from authoritative servers and recommendedto be 7 days in the TTL definition inSection 4.The maximum stale timershould be configurable. It defines the length of time after a recordexpires that it should be retained in the cache. The suggested valueis between 1 and 3 days.¶

6.Implementation Considerations

This document mainly describes the issues behind serving stale dataand intentionally does not provide a formal algorithm. The concept isnot overly complex, and the details are best left to resolver authorsto implement in their codebases. The processing of serve-stale is alocal operation, and consistent variables between deployments are notneeded for interoperability. However, we would like to highlight theimpact of various implementation choices, starting with the timersinvolved.¶

The most obvious of these is the maximum stale timer. If this variableis too large, it could cause excessive cache memory usage, but if it istoo small, the serve-stale technique becomes less effective, as therecord may not be in the cache to be used if needed. Shorter values,even less than a day, can effectively handle the vast majority ofoutages. Longer values, as much as a week, give time for monitoringsystems to notice a resolution problem and for human intervention tofix it; operational experience has been that sometimes the rightpeople can be hard to track down and unfortunately slow to remedy thesituation.¶

Increased memory consumption could be mitigated by prioritizing removal of stale records over non-expired records during cache exhaustion. Eviction strategies could consider additional factors, including the last time of use or the popularity of a record, to retain active but stale records. A feature to manually flush only stale records could also be useful.¶

The client response timer is another variable that deservesconsideration. If this value is too short, there exists the risk thatstale answers may be used even when the authoritative server isactually reachable but slow; this may result in undesirable answersbeing returned. Conversely, waiting too long will negatively impactuser experience.¶

The balance for the failure recheck timer is responsiveness indetecting the renewed availability of authorities versus the extraresource use for resolution. If this variable is set too large, staleanswers may continue to be returned even after the authoritativeserver is reachable; per[RFC2308],Section 7, this should be nomore than 5 minutes. If this variable is too small, authoritativeservers may be targeted with a significant amount of excess traffic.¶

Regarding the TTL to set on stale records in the response,historically TTLs of 0 seconds have been problematic for someimplementations, and negative values can't effectively be communicatedto existing software. Other very short TTLs could lead to congestivecollapse as TTL-respecting clients rapidly try to refresh. Therecommended value of 30 seconds not only sidesteps those potential problemswith no practical negative consequences, it also rate-limitsfurther queries from any client that honors the TTL, such as aforwarding resolver.¶

As for the change to treat a TTL with the high-order bit set aspositive and then clamping it, as opposed to[RFC2181] treating itas zero, the rationale here is basically one of engineering simplicityversus an inconsequential operational history. Negative TTLs had norational intentional meaning that wouldn't have been satisfied by justsending 0 instead, and similarly there was realistically no practicalpurpose for sending TTLs of 2^25 seconds (1 year) or more. There'salso no record of TTLs in the wild having the most significant bit setin the DNS Operations, Analysis, and Research Center's (DNS-OARC's) "Day in the Life" samples[DITL]. With no apparentreason foroperators to use them intentionally, that leaves either errors ornon-standard experiments as explanations as to why such TTLs might beencountered, with neither providing an obviously compelling reason asto why having the leading bit set should be treated differently fromhaving any of the next eleven bits set and then capped perSection 4.¶

Another implementation consideration is the use ofstale nameserver addresses for lookups. This is mentioned explicitlybecause, in some resolvers, getting the addresses for nameservers isa separate path from a normal cache lookup. If authoritative serveraddresses are not able to be refreshed, resolution can possibly stillbe successful if the authoritative servers themselves are up. Forinstance, consider an attack on a top-level domain that takes itsnameservers offline; serve-stale resolvers that had expired glueaddresses for subdomains within that top-level domain would still be able toresolve names within those subdomains, even those it had notpreviously looked up.¶

The directive inSection 4 that only NoError and NXDomainresponses should invalidate any previously associated answer stemsfrom the fact that no other RCODEs that a resolver normallyencounters make any assertions regarding the name in the question orany data associated with it. This comports with existing resolverbehavior where a failed lookup (say, during prefetching) doesn'timpact the existing cache state. Some authoritative server operatorshave said that they would prefer stale answers to be used in the eventthat their servers are responding with errors like ServFail instead ofgiving true authoritative answers. ImplementersMAY decide to returnstale answers in this situation.¶

Since the goal of serve-stale is to provide resiliency for all obviouserrors to refresh data, these other RCODEs are treated as though theyare equivalent to not getting an authoritative response. AlthoughNXDomain for a previously existing name might well be an error, it isnot handled that way because there is no effective way to distinguishoperator intent for legitimate cases versus error cases.¶

During discussion in the IETF, it was suggested that,if all authorities return responses with an RCODE of Refused,it may be an explicit signal to take down the zone fromservers that still have the zone's delegation pointed to them.Refused, however, is alsooverloaded to mean multiple possible failures that could representtransient configuration failures. Operational experience has shownthat purposely returning Refused is a poor way to achieve anexplicit takedown of a zone compared to either updating the delegationor returning NXDomain with a suitable SOA for extended negativecaching. ImplementersMAY nonetheless consider whether totreat all authorities returning Refused as preempting the use of staledata.¶

7.Implementation Caveats

Stale data is used only when refreshing has failed in order to adhereto the original intent of the design of the DNS and the behaviorexpected by operators. If stale data were to always be usedimmediately and then a cache refresh attempted after the clientresponse has been sent, the resolver would frequently be sending datathat it would have had no trouble refreshing. Because modern resolvers usetechniques like prefetching and request coalescing for efficiency, itis not necessary that every client request needs to trigger a newlookup flow in the presence of stale data, but rather that agood-faith effort has been recently made to refresh the stale databefore it is delivered to any client.¶

It is important to continue the resolution attempt after the staleresponse has been sent, until the query resolution timeout, becausesome pathological resolutions can take many seconds to succeed as theycope with unavailable servers, bad networks, and other problems.Stopping the resolution attempt when the response with expired datahas been sent would mean that answers in these pathological caseswould never be refreshed.¶

The continuing prohibition against using data with a 0-second TTLbeyond the current transaction explicitly extends to it being unusableeven for stale fallback, as it is not to be cached at all.¶

Be aware that Canonical Name (CNAME) and DNAME records[RFC6672] mingled in the expiredcache with other records at the same owner name can cause surprisingresults. This was observed with an initial implementation in BINDwhen a hostname changed from having an IPv4 Address (A) record to aCNAME. The version of BIND being used did not evict other types inthe cache when a CNAME was received, which in normal operations is nota significant issue. However, after both records expired and theauthorities became unavailable, the fallback to stale answers returnedthe older A instead of the newer CNAME.¶

8.Implementation Status

The algorithm described inSection 5 wasoriginally implemented as a patch to BIND 9.7.0. It has been in useon Akamai's production network since 2011; it effectivelysmoothed over transient failures and longer outages that would haveresulted in major incidents. The patch was contributed to the InternetSystems Consortium, and the functionality is now available in BIND 9.12and later via the options stale-answer-enable, stale-answer-ttl, andmax-stale-ttl.¶

Unbound has a similar feature for serving stale answers and willrespond with stale data immediately if it has recently tried andfailed to refresh the answer by prefetching. Starting fromversion 1.10.0, Unbound can also be configured to follow thealgorithm described inSection 5. Both behaviors can beconfigured and fine-tuned with the available serve-expired-*options.¶

Knot Resolver has a demo module here:<https://knot-resolver.readthedocs.io/en/stable/modules-serve_stale.html>.¶

Apple's system resolvers are also known to use stale answers, but thedetails are not readily available.¶

In the research paper "When the Dike Breaks: Dissecting DNS DefensesDuring DDoS"[DikeBreaks], the authors detected some use ofstale answers by resolvers when authorities came under attack. Theirresearch results suggest that more widespread adoption of the techniquewould significantly improve resiliency for the large number of requeststhat fail or experience abnormally long resolution times during an attack.¶

10.Security Considerations

The most obvious security issue is the increased likelihood of DNSSECvalidation failures when using stale data because signatures could bereturned outside their validity period. Stale negative records can increasethe time window where newly published TLSA or DS RRs may not be used dueto cached NSEC or NSEC3 records. These scenarios would only be an issue ifthe authoritative servers are unreachable (the only time the techniques inthis document are used), and thus serve-stale does not introduce a newfailure in place of what would have otherwise been success.¶

Additionally, bad actors have been known to use DNS caches to keeprecords alive even after their authorities have gone away. The serve-stalefeature potentially makes the attack easier, although without introducinga new risk. In addition, attackers could combine this with a DDoS attack onauthoritative servers with the explicit intent of having stale informationcached for a longer period of time. But if attackers have this capacity, they probably coulddo much worse than prolonging the life of old data.¶

In[CloudStrife], it was demonstrated how stale DNS data, namelyhostnames pointing to addresses that are no longer in use by the ownerof the name, can be used to co-opt security -- for example, to getdomain-validated certificates fraudulently issued to an attacker.While this document does not create a new vulnerability in this area, itdoes potentially enlarge the window in which such an attack could bemade. A proposed mitigation is that certificate authorities should fullylook up each name starting at the DNS root for every name lookup.Alternatively, certificate authorities should use a resolver that is not serving stale data.¶

14.References

14.1.Normative References

[RFC1034]: Mockapetris, P.,"Domain names - concepts and facilities",STD 13,RFC 1034,DOI 10.17487/RFC1034,November 1987,<https://www.rfc-editor.org/info/rfc1034>.
[RFC1035]: Mockapetris, P.,"Domain names - implementation and specification",STD 13,RFC 1035,DOI 10.17487/RFC1035,November 1987,<https://www.rfc-editor.org/info/rfc1035>.
[RFC2119]: Bradner, S.,"Key words for use in RFCs to Indicate Requirement Levels",BCP 14,RFC 2119,DOI 10.17487/RFC2119,March 1997,<https://www.rfc-editor.org/info/rfc2119>.
[RFC2181]: Elz, R. and R. Bush,"Clarifications to the DNS Specification",RFC 2181,DOI 10.17487/RFC2181,July 1997,<https://www.rfc-editor.org/info/rfc2181>.
[RFC2308]: Andrews, M.,"Negative Caching of DNS Queries (DNS NCACHE)",RFC 2308,DOI 10.17487/RFC2308,March 1998,<https://www.rfc-editor.org/info/rfc2308>.
[RFC8174]: Leiba, B.,"Ambiguity of Uppercase vs Lowercase in RFC 2119 Key Words",BCP 14,RFC 8174,DOI 10.17487/RFC8174,May 2017,<https://www.rfc-editor.org/info/rfc8174>.

14.2.Informative References

[CloudStrife]: Borgolte, K., Fiebig, T., Hao, S., Kruegel, C., and G. Vigna,"Cloud Strife: Mitigating the Security Risks of Domain-Validated Certificates",DOI 10.1145/3232755.3232859,ACM 2018 Applied Networking Research Workshop,July 2018,<https://www.ndss-symposium.org/wp-content/uploads/2018/02/ndss2018_06A-4_Borgolte_paper.pdf>.
[DikeBreaks]: Moura, G.C.M., Heidemann, J., Müller, M., Schmidt, R. de O., and M. Davids,"When the Dike Breaks: Dissecting DNS Defenses During DDoS",DOI 10.1145/3278532.3278534,ACM 2018 Internet Measurement Conference,October 2018,<https://www.isi.edu/~johnh/PAPERS/Moura18b.pdf>.
[DITL]: DNS-OARC,"DITL Traces and Analysis",January 2018,<https://www.dns-oarc.net/oarc/data/ditl>.
[RFC6672]: Rose, S. and W. Wijngaards,"DNAME Redirection in the DNS",RFC 6672,DOI 10.17487/RFC6672,June 2012,<https://www.rfc-editor.org/info/rfc6672>.
[RFC6891]: Damas, J., Graff, M., and P. Vixie,"Extension Mechanisms for DNS (EDNS(0))",STD 75,RFC 6891,DOI 10.17487/RFC6891,April 2013,<https://www.rfc-editor.org/info/rfc6891>.
[RFC8499]: Hoffman, P., Sullivan, A., and K. Fujiwara,"DNS Terminology",BCP 219,RFC 8499,DOI 10.17487/RFC8499,January 2019,<https://www.rfc-editor.org/info/rfc8499>.

Movatterモバイル変換

RFC 8767

Serving Stale Data to Improve DNS Resiliency

Abstract

Status of This Memo

Copyright Notice

Table of Contents

1.Introduction

2.Terminology

3.Background

4.Standards Action

5.Example Method

6.Implementation Considerations

7.Implementation Caveats

8.Implementation Status

9.EDNS Option

10.Security Considerations

11.Privacy Considerations

12.NAT Considerations

13.IANA Considerations

14.References

14.1.Normative References

14.2.Informative References

Acknowledgements

Authors' Addresses