draft-ietf-ospf-scalability-05.txt  -->   draft-ietf-ospf-scalability-06.txt

view Side-By-Side changes




   Internet Engineering Task Force           Gagan L. Choudhury Choudhury, Editor
   Internet Draft                                  Vera D. Sapozhnikov                                                  AT&T
   Expires in November, 2003                       Gerald R. Ash February, 2004                      
   Category: Best Current Practice                 AT&T
   draft-ietf-ospf-scalability-05.txt              
                                                   Anurag S. Maunder
                                                   Erlang Technology

                                                   Vishwas Manral
                                                   Motorola Inc.

                                                   May,                         August, 2003    
   draft-ietf-ospf-scalability-06.txt              
                                                 
                                                

           Prioritized Treatment of Specific OSPF 
           Packets and Congestion Avoidance



Status of this Memo

   This document is an Internet-Draft and is in full conformance
   with all provisions of Section 10 of RFC2026.

   Internet-Drafts are working documents of the Internet Engineering
   Task Force (IETF), its areas, and its working groups.  Note that
   other groups may also distribute working documents as Internet-
   Drafts.

   Internet-Drafts are draft documents valid for a maximum of six
   months and may be updated, replaced, or obsoleted by other documents
   at any time.  It is inappropriate to use Internet-Drafts as
   reference material or to cite them other than as "work in progress."

   The list of current Internet-Drafts can be accessed at
        http://www.ietf.org/ietf/1id-abstracts.html
   The list of Internet-Draft Shadow Directories can be accessed at
        http://www.ietf.org/shadow.html.
   Distribution of this memo is unlimited.


IPR Notices

   (A)   The IETF takes no position regarding the validity or scope of
         any intellectual property or other rights that might be claimed
         to  pertain to the implementation or use of the technology
         described in this


Abstract

   This document or the extent to which any license
         under such rights might or might not be available; neither does
         it represent recommends methods that it has made any effort are intended to identify any such
         rights.  Information on improve the IETF's procedures with respect 
   scalability and stability of large networks using OSPF (Open Shortest
   Path First) protocol.  The methods include processing OSPF Hellos and
   LSA (Link State Advertisement) Acknowledgments at a higher priority
   compared to
         rights in standards-track other OSPF packets, and standards-related documentation other congestion avoidance 
   procedures.








   Choudhury et. al.     Best Current Practice               [Page 1]

   Internet Draft        Prioritized Treatment         November, 2003

         can be found in BCP-11.  Copies of claims         February, 2004

Table of rights made
         available for publication Contents

   1. Introduction...................................................2
   2. Recommendations................................................3
   3. Security Considerations........................................6
   4. Acknowledgments................................................6
   5. Normative Reference............................................6
   6. Informative References.........................................6
   7. Contributing Authors and any assurances their Addresses.......................8
   Appendix A. LSA Storm: Causes and Impact..........................8
   Appendix B. List of licenses to
         be made available, or Variables and Values.........................11
   Appendix C. Other Recommendations................................12

1. Introduction

   A large network running OSPF [Ref1] protocol may occasionally 
   experience the result simultaneous or near-simultaneous update of an attempt made
         to obtain a general license or permission for the use large
   number of such
         proprietary rights by implementors link-state-advertisements, or users LSAs.  This is particularly 
   true if OSPF traffic engineering extension [Ref2] is used which 
   may significantly increase the number of this
         specification can be obtained from LSAs in the IETF Secretariat.

   (B)   The IETF invites any interested party to bring to its
         attention any copyrights, patents or patent applications, or
         other proprietary rights which may cover technology that may be
         required to practice network.  
   We call this standard.  Please address the
         information to the IETF Executive Director.


Copyright Notice

   (C)   Copyright (C) The Internet Society (date). All Rights
         Reserved.

         This document event, an LSA storm and translations of it may be copied and
         furnished to others, and derivative works that comment on or
         otherwise explain it initiated by an
   unscheduled failure or assist in its implementation a scheduled maintenance event.
   The failure may be
         prepared, copied, published and distributed, in whole hardware, software, or procedural in
         part, without restriction of any kind, provided that the above
         copyright notice and this paragraph are included on all such
         copies nature.
 
   The LSA storm causes high CPU and derivative works.  However, this document itself may
         not be modified in any way, such as by removing memory utilization at the copyright
         notice or references router
   causing incoming packets to the Internet Society be delayed or other Internet
         organizations, except as needed for dropped.  
   Delayed acknowledgments (beyond the  purpose of developing
         Internet standards retransmission timer value) 
   result in which case retransmissions, and delayed Hello packets (beyond the procedures for copyrights
         defined 
   router-dead interval) result in the Internet Standards process must be followed, or
         as required to translate it into languages other than English. neighbor adjacencies being declared 
   down. The limited permissions granted above are perpetual retransmissions and will
         not be revoked by the Internet Society or its successors or
         assigns.

         This document additional LSA originations result in 
   further CPU and the information contained herein is provided
         on memory usage, essentially causing a positive feedback
   loop, which, in the extreme case, may drive the network to an "AS IS" basis 
   unstable state.

   The default value of retransmission timer is 5 seconds and THE INTERNET SOCIETY AND THE INTERNET
         ENGINEERING TASK FORCE DISCLAIMS ALL WARRANTIES, EXPRESS OR
         IMPLIED, INCLUDING BUT NOT LIMITED TO ANY WARRANTY THAT THE USE
         OF THE INFORMATION HEREIN WILL NOT INFRINGE ANY RIGHTS OR ANY
         IMPLIED WARRANTIES OF MERCHANTABILITY OR FITNESS FOR A
         PARTICULAR PURPOSE.

Abstract

   This document recommends methods that are intended of 
   the router-dead interval is 40 seconds.  However, recently there 
   has been a lot of interest in significantly reducing OSPF convergence
   time. As part of that plan much shorter (sub-second) Hello and
   router-dead intervals have been proposed [Ref3].  In such a scenario
   it will be more likely for Hello packets to be delayed beyond
   the router-dead interval during network congestion
   caused by an LSA storm.

   In order to improve the scalability and stability of large networks using we
   recommend steps for prioritizing critical OSPF (Open Shortest
   Path First) protocol. packets and avoiding
   congestion. The methods include processing OSPF Hellos details of the recommendations are given in Section
   2.  A simulation study is reported in [Ref12] that quantifies the
   congestion phenomenon and
   LSA (Link State Advertisement) Acknowledgments at a higher priority its impact.  It also studies several of the
   recommendations and shows that they indeed improve the scalability 
   and stability of networks using OSPF protocol.  [Ref12] is available 

  
   Choudhury et. al.     Best Current Practice               [Page 2]

   Internet Draft        Prioritized Treatment         November, 2003

   compared to other OSPF packets, and other congestion avoidance 
   procedures. Simulation results in support of some         February, 2004

   on request by contacting the editor or one of the 
   recommendations are given in the appendix sections.

Table of Contents

   1. Introduction...................................................3
   2. Recommendations................................................4
   3. Security Considerations........................................6
   4. Acknowledgments................................................6
   5. Normative References...........................................6
   6. Informative References.........................................6
   7. Authors' Addresses.............................................7
   Appendix A. LSA Storm: Causes and Impact..........................8
   Appendix B. Simulation Study.....................................10
   Appendix B.1. The Network Under Simulation.......................10
   Appendix B.2. Simulation Results.................................13
   Appendix B.3. Observations on Simulation Results.................17
   Appendix C. Other Recommendations................................17

1. Introduction

   A large network running OSPF [Ref1] or OSPF-TE [Ref2] protocol may 
   occasionally experience the simultaneous or near-simultaneous update 
   of a large number of link-state-advertisements, or LSAs.  
   We call this event, an LSA storm and it may be initiated by an
   unscheduled failure or a scheduled maintenance event.
   The failure may be hardware, software, or procedural in nature.
 
   The LSA storm causes high CPU and memory utilization at the router
   causing incoming packets to be delayed or dropped.  
   Delayed acknowledgments (beyond the retransmission timer value) 
   result in retransmissions, and delayed Hello packets (beyond the 
   router-dead interval) result in neighbor adjacencies being declared 
   down. The retransmissions and additional LSA originations result in 
   further CPU and memory usage, essentially causing a positive feedback
   loop, which, in the extreme case, may drive the network to an 
   unstable state.

   The default value of retransmission timer is 5 seconds and that of 
   the router-dead interval is 40 seconds.  However, recently there 
   has been a lot of interest in significantly reducing OSPF convergence
   time. As part of that plan much shorter (subsecond) Hello and
   router-dead intervals have been proposed [Ref3].  In such a scenario
   it will be more likely for Hello packets to be delayed beyond
   the router-dead interval during network congestion
   caused by an LSA storm.

   Appendix A explains authors.

   Appendix A explains in more detail LSA storm scenarios,
   their impact, and points out a few real-life examples of 
   control-message storms.  Appendix B presents provides a simulation study 
  
   Choudhury et. al.     Best Current Practice               [Page 3]

   Internet Draft        Prioritized Treatment         November, 2003

   on this phenomenon.

   In order to improve the scalability and stability of networks we
   recommend steps for prioritizing critical OSPF packets and avoiding
   congestion. The details list of the recommendations are given variables
   used in Section 2.  We also do a simulation study on a subset of the recommendations in Appendix B and show that they indeed improve the 
   scalability and stability of networks using OSPF protocol. their example values.
   Appendix C provides some further recommendations with similar goals.

2. Recommendations

   The Recommendations below are intended to improve the scalability
   and stability of large networks using OSPF protocol.  During
   periods of network congestion they would reduce retransmissions,
   avoid an adjacency to be declared down due to Hello packets 
   being delayed beyond the RouterDeadInterval, and take other
   congestion avoidance steps.  The recommendations are unordered
   except that Recommendation 2 is to be implemented only if
   Recommendation 1 is not implemented.

   (1) Classify all OSPF packets in two classes: a "high priority"
       class comprising of OSPF Hello packets and Link State 
       Acknowledgment packets, and a "low priority" class
       comprising of all other packets. The classification is
       accomplished by examining the OSPF packet header. While
       receiving a packet from a neighbor and while transmitting
       a packet to a neighbor, try to process a "high priority" 
       packet ahead of a "low priority" packet.

       The prioritized processing may cause OSPF packets from a neighbor
       to be received out of sequence. If
       Cryptographic Authentication (AuType = 2) is used (as specified
       in [Ref1]) then successive received valid OSPF packets from a 
       neighbor need to have a non-decreasing "Cryptographic sequence 
       number".  To comply with this requirement we recommend that
       the receiver maintain two separate sequence numbers for OSPF
       packets belonging to the two priority classes.  This will work
       since within the same priority class, OSPF packets will be
       received in sequence.
       
   (2) If the Recommendation 1 cannot be implemented then reset the
       inactivity timer for an adjacency whenever any OSPF unicast
       packet or any OSPF packet sent to 224.0.0.5 AllSPFRouters over a 
       point-to-point link is received over that adjacency instead of
       resetting the inactivity timer only on receipt of the
       Hello packet.  So OSPF would declare the adjacency to be down 
       only if no OSPF unicast packets and or no OSPF packets sent to 
       224.0.0.5 
       AllSPFRouters over a point-to-point link are received over 
       that adjacency for a period equaling or exceeding the 
       RouterDeadInterval.  The reason for not recommending this
       proposal in conjunction with Recommendation 1 is to avoid 


   Choudhury et. al.     Best Current Practice               [Page 3]

   Internet Draft        Prioritized Treatment         February, 2004

       potential undesirable side effects.  One such effect is the 
       delay in discovering the down status of
       an adjacency in a case where no high priority Hello packets are
       being received but the inactivity timer is being reset by other
       stale packets in the low priority queue. 

   (3) Use an Exponential Backoff algorithm for determining the value
       of the LSA retransmission interval (RxmtInterval).  Let R(i) 
       represent the RxmtInterval value used during the i-th 

   Choudhury et. al.     Best Current Practice               [Page 4]

   Internet Draft        Prioritized Treatment         November, 2003 
       retransmission of an LSA.  Use the following algorithm to 
       compute R(i)

                    R(1) = Rmin
                    R(i+1) = Min(KR(i),Rmax)  for i>=1

       where K, Rmin and Rmax are constants and the function
       Min(.,.) represents the minimum value of its two arguments.
       Example values for K, Rmin and Rmax may be 2, 5 seconds 
       and 40 seconds respectively.  Note that the example value for
       Rmin, the initial retransmission interval, is the same as the
       sample value of RxmtInterval in [Ref1].

       This recommendation is motivated by the observation that during
       a network congestion event caused by control messages, a major 
       source for sustaining the congestion is the repeated
       retransmission of LSAs.  The use of an Exponential Backoff 
       algorithm for the LSA retransmission interval reduces the rate 
       of LSA retransmissions while the network experiences 
       congestion (during which it is more likely that multiple 
       retransmissions of the same LSA would happen).  This in turn 
       helps the network get out of the congested state.

   (4) Implicit Congestion Detection and Action Based on That:
       If there is control message congestion at a router, its 
       neighbors do not know about that explicitly.  However, they 
       can implicitly detect it based on the number of unacknowledged
       LSAs to this router.  If this number exceeds a certain "high
       water mark" then the rate at which LSAs are sent to this router
       should be reduced. reduced progressively using an exponential backoff
       mechanism but not below a certain minimum rate.  At a future 
       time, if the number of unacknowledged LSAs to this router falls 
       below a certain "low water mark" then the normal rate of sending
       LSAs to this router should be resumed.  An example value for the "high
       water mark" may increased progressively, again 
       using an exponential backoff mechanism but not above a certain 
       maximum rate.  The whole algorithm is given below.  It is to be 20 unacknowledged LSAs and 
       noted that for the "low
       water mark" may be 10 unacknowledged LSAs.  An example
       value for the rate on exceeding the "high water mark" may be
       50% the normal rate.  This recommendation this algorithm is to be implemented applied independently to each
       neighbor and only for unicast LSAs sent to a neighbor or LSAs 
       sent to 224.0.0.5 AllSPFRouters over a point-to-point link.

       Recommendations 3 and 4 both slow down

       Let,
       U(t) = Number of unacknowledged LSAs to congested 
       neighbors neighbor at time t.

   Choudhury et. al.     Best Current Practice               [Page 4]

   Internet Draft        Prioritized Treatment         February, 2004


       H = A high water mark (in units of number of unacknowledged LSAs)
       L = A low water mark (in units of number of unacknowledged LSAs)
       G(t) = Gap between sending successive LSAs to neighbor at time t.
       F = The factor by which the above gap is to be increased during
           congestion and decreased after coming out of congestion.
       T = Minimum time that has to elapse before the existing gap
           is considered for change.
       Gmin = Minimum allowed value of gap.
       Gmax = Maximum allowed value of gap.

       The equation below shows how the gap is to be changed after a 
       time T has elapsed since the last change:
                 _
                |
                | Min(FG(t),Gmax) if U(t+T) > H
       G(t+T) = | G(t) if H >= U(t+T) >= L
                | Max(G(t)/F,Gmin) if U(t+T) < L 
                |_

       Min(.,.) and Max(.,.) represent the minimum and maximum values
       of the two arguments respectively. 
       Example values for the various parameters of the algorithm are
       as follows: H = 20, L = 10, F = 2, T = 1 second, Gmin = 20 ms, 
       Gmax = 1 second.

       Recommendations 3 and 4 both slow down LSAs to congested 
       neighbors based on implicitly detecting the congestion but
       they have important differences. Recommendation 3 progressively
       slows down successive retransmissions of the same LSA whereas
       Recommendation 3 progressively slows down all LSAs (new or 
       retransmission) to a congested neighbor.

   (5) Throttling Adjacencies to be Brought Up Simultaneously:
       If a router tries to bring up a large number of adjacencies to
       its neighbors simultaneously then that may cause severe
       congestion due to database synchronization and LSA flooding

   Choudhury et. al.     Best Current Practice               [Page 5]

   Internet Draft        Prioritized Treatment         November, 2003
       activities.  It is recommended that during such a situation
       no more than "n" adjacencies should be brought up
       simultaneously.  Once a subset of adjacencies have been brought 
       up successfully, newer adjacencies may be brought up as long as        
       the number of simultaneous adjacencies being brought up does not
       exceed "n". The appropriate value of "n" would depend on the 
       router processing power, link bandwidth and propagation delay. 
       An example 
       The value may of "n" should be 4. configurable.

       In the presence of throttling, an important issue is the order 
       in which adjacencies are to be formed.  We recommend a First 
       Come First Served (FCFS) policy based on the order in which the
       request for adjacency formation arrives.  Requests may either be


   Choudhury et. al.     Best Current Practice               [Page 5]

   Internet Draft        Prioritized Treatment         February, 2004
                 
       from neighbors or self-generated. Among the self-generated 
       requests a priority list may be used to decide the order in which
       the requests are to be made.  However, once an adjacency
       formation process starts it is not to be preempted except
       for unusual circumstances such as errors or time-outs.

3. Security Considerations
   
   This memo does not create any creates one new security issues issue for the OSPF protocol.  Security considerations for the base OSPF protocol are
   covered  
   Recommendation 1 in [Ref1].

4. Acknowledgments

   We would like Section 2 and Recommendation 2 in Appendix C 
   proposes prioritized processing of OSPF packets which may cause
   packets from a neighbor to acknowledge the support be received out of sequence.
   If Cryptographic Authentication (AuType = 2) is used (as specified
   in [Ref1]) then successive received valid OSPF WG chairs
   Rohit Dube, Acee Lindem, and John Moy.  We acknowledge packets from a 
   neighbor need to have a non-decreasing "Cryptographic sequence 
   number".  To comply with this requirement we recommend that
   the receiver maintain separate sequence numbers for OSPF
   packets belonging to different priority classes.  This will work
   since within the same priority class, OSPF packets will be
   received in sequence.

   Security considerations for the base OSPF protocol are
   covered in [Ref1].

4. Acknowledgments

   We would like to acknowledge the support and helpful comments from
   OSPF WG chairs Rohit Dube, Acee Lindem, John Moy and Routing Area 
   directors Alex Zinin and Bill Fenner.  We acknowledge Vivek Dube, 
   Mitchell Erblich, Mike Fox, Tony Przygienda, and Krishna Rao for 
   comments on previous versions of the draft.  We also acknowledge 
   Margaret Chiosi, Elie Francis, Jeff Han, Beth Munson,
   Roshan Rao, Moshe Segal, Mike Wardlow, and Pat Wirth for 
   collaboration and encouragement in our scalability
   improvement efforts for Link-State-Protocol based networks.

5. Normative References Reference

   [Ref1] J. Moy, "OSPF Version 2", RFC 2328, April, 1998.

6. Informative References

   [Ref2] D. Katz, D. Yeung, K. Kompella, "Traffic Engineering
   Extension to OSPF Version 2," Work in Progress.

   [Ref3] C. Alaettinoglu, V. Jacobson and H. Yu, "Towards Milli-
   second IGP Convergence," Work in Progress.

   [Ref4] Pappalardo, D., "AT&T, customers grapple with ATM net 
   outage," Network World, February 26, 2001.


   Choudhury et. al.     Best Current Practice               [Page 6]

   Internet Draft        Prioritized Treatment         November, 2003         February, 2004


   [Ref5] "AT&T announces cause of frame-relay network outage," AT&T 
   Press Release, April 22, 1998.

   [Ref6] Cholewka, K., "MCI Outage Has Domino Effect," Inter@ctive 
   Week, August 20, 1999.

   [Ref7] Jander, M., "In Qwest Outage, ATM Takes Some Heat," Light
   Reading, April 6, 2001.

   [Ref8] A. Zinin and M. Shand, "Flooding Optimizations in Link-State
   Routing Protocols," Work in Progress.

   [Ref9] P. Pillay-Esnault, "OSPF Refresh and flooding reduction in  
   stable topologies," Work in progress.

   [Ref10] G. Ash, G. Choudhury, V. Sapozhnikova, M. Sherif, A.  
   Maunder, V. Manral, "Congestion Avoidance & Control for OSPF 
   Networks", Work in Progress.

   [Ref11] B. M. Waxman, "Routing of Multipoint Connections," IEEE
   Journal on Selected Areas in Communications, 6(9):1617-1622, 1988.

   [Ref12] G. Choudhury, G. Ash, V. Manral, A. Maunder and V.
   Sapozhnikova, "Prioritized Treatment of Specific OSPF Packets
   and Congestion Avoidance: Algorithms and Simulations," AT&T 
   Technical Report, August, 2003.

   [Ref13] K. Nichols, S. Blake, F. Baker and D. Black, "Definition of
   the Differentiated Services Field (DS Field) in the IPV4 and IPV6
   Headers", RFC 2474, December, 1998.





















   Choudhury et. al.     Best Current Practice               [Page 7]

   Internet Draft        Prioritized Treatment         February, 2004

   
7. Authors' Contributing Authors and their Addresses

   In addition to the Editor, several people contributed to this 
   document.  The names and contact information of all authors
   are given below.
  

   Gagan L. Choudhury                     Anurag S. Maunder 
   AT&T                                   Erlang Technology
   Room D5-3C21                           2880 Scott Boulevard
   200 Laurel Avenue                      Santa Clara, CA 95052
   Middletown, NJ, 07748                  USA
   USA                                    Phone: (408)420-7617
   Phone: (732)420-3721                   email: anuragm@erlangtech.com
   email: gchoudhury@att.com              


   Gerald R. Ash                          Vera D. Sapozhnikova
   AT&T                                   AT&T
   Room D5-2A01                           Room C5-2C29
   200 Laurel Avenue
   Middletown, NJ, 07748
   USA
   Phone: (732)420-2653
   email: sapozhnikova@att.com

   Gerald R. Ash
   AT&T
   Room D5-2A01                      200 Laurel Avenue
   Middletown, NJ, 07748                  Middletown, NJ, 07748 
   USA                                    USA
   Phone: (732)420-4578                   Phone: (732)420-2653 
   email: gash@att.com

   Choudhury et. al.     Best Current Practice               [Page 7]

   Internet Draft        Prioritized Treatment         November, 2003

   Anurag S. Maunder
   Erlang Technology
   2880 Scott Boulevard
   Santa Clara, CA 95052
   Phone: (408)420-7617                    email: anuragm@erlangtech.com sapozhnikova@att.com

                                          
   Vishwas Manral                         
   Motorola Inc.                          
   189, Prashasan Nagar,                  
   Road Number 72                         
   Jubilee Hills, Hyderabad               
   India                                  
   email: vishwas@motorola.com            

   

Appendix A. LSA Storm: Causes and Impact

   An LSA storm may be initiated due to many reasons.  Here
   are some examples:  

   (a) one or more link failures due to fiber cuts,

   (b) one or more router failures for some reason, e.g., software
       crash or some type of disaster (including power outage)
       in an office complex hosting many routers,

   (c) Link/router flapping,


   Choudhury et. al.     Best Current Practice               [Page 8]

   Internet Draft        Prioritized Treatment         February, 2004


   (d) requirement of taking down and later bringing back many 
       routers during a software/hardware upgrade, 

   (e) near-synchronization of the periodic 1800 second LSA refreshes
       of a subset of LSAs, 

   (f) refresh of all LSAs in the system during a change in software 
       version,  

   (g) injecting a large number of external routes to OSPF due to
       a procedural error,

   (h) Router ID changes causing a large number of LSA re-originations
       (possibly LSA purges as well depending on the implementation).

   In addition to the LSAs originated as a direct result of link/router 
   failures, there may be other indirect LSAs as well.  One example in
   MPLS networks is traffic engineering LSAs [Ref2] originated at other
   links as a result of significant change in reserved bandwidth 
   resulting from rerouting of Label Switched Paths (LSPs) that went 
   down during the link/router failure.



   Choudhury et. al.     Best Current Practice               [Page 8]

   Internet Draft        Prioritized Treatment         November, 2003
   The LSA storm causes high CPU and memory utilization at the router
   processor causing incoming packets to be delayed or dropped.  
   Delayed acknowledgments (beyond the retransmission timer value) 
   results in retransmissions, and delayed Hello packets (beyond the 
   Router-Dead interval) results in links being declared down.  A
   trunk-down event causes Router LSA origination by its end-point
   routers.  If traffic engineering LSAs are used for each link then
   that type of LSAs would also be originated by the end-point routers
   and potentially elsewhere as well due to significant changes in
   reserved bandwidths at other links caused by the failure and reroute
   of LSPs originally using the failed trunk.  Eventually, when the
   link recovers that would also trigger additional Router LSAs and 
   traffic engineering LSAs.

   The retransmissions and additional LSA originations result in further 
   CPU and memory usage, essentially causing a positive feedback loop.  
   We define the LSA storm size as the number of LSAs in the original 
   storm and not counting any additional LSAs resulting from the  
   feedback loop described above.  If the LSA storm is too large then
   the positive feedback loop mentioned above may be large enough to 
   indefinitely sustain a large CPU and memory utilization at many 
   routers in the network, thereby driving the network to an unstable
   state. In the past, network
   outage events have been reported in IP and ATM networks using 
   link-state protocols such as OSPF, IS-IS, PNNI or some proprietary 
   variants.  See for example [Ref4-Ref7].  In many of these examples,




   Choudhury et. al.     Best Current Practice               [Page 9]

   Internet Draft        Prioritized Treatment         February, 2004

   large scale flooding of LSAs or other similar control messages 
   (either naturally or triggered by some bug or inappropriate 
   procedure) have been partly or fully responsible for network 
   instability and outage.

   In Appendix B, we use [Ref12] a simulation model is used to show that there 
   is a certain LSA storm size threshold above which the 
   network may show unstable behavior caused by large number of 
   retransmissions, link failures due to missed Hello packets and 
   subsequent link recoveries.  We  It is also show shown
   that the LSA storm size causing instability may be substantially
   increased by providing prioritized treatment to Hello and LSA 
   Acknowledgment packets and by using an exponential backoff
   algorithm for determining the LSA retransmission interval.  
   If it is not possible to prioritize Hello packets then resetting
   the inactivity timer on receiving any valid OSPF packets can also
   provide the same benefit. Furthermore, if we prioritize Hello 
   packets then even when the network operates somewhat above the 
   stability threshold, links are not declared down due to missed 
   Hellos.  This implies that even though there is 
   control plane congestion due to many retransmissions, the data plane
   stays up and no new LSAs are originated (besides the ones in the 
   original storm and the refreshes).  These observations are the
   basis of support
   the first three recommendations in Section 2.




   Choudhury et. al.     Best Current Practice               [Page 9]

   Internet Draft        Prioritized Treatment         November, 2003 The authors of this 
   draft have also done simulations to verify that the other 
   recommendations in Section 2 helps avoid congestion and allows a
   graceful exit from a congested state.

   One might argue that the scalability issue of large networks should
   be solved solely by dividing the network hierarchically into 
   multiple areas so that flooding of LSAs remains localized within 
   areas.  However, this approach increases the network management 
   and design complexity and may result in less optimal routing between 
   areas. Also, ASE LSAs are flooded throughout the AS and it may be
   a problem if there are large numbers of them.  Furthermore, 
   a large number of summary LSAs may need to be flooded across
   Areas and their numbers would increase significantly if 
   multiple Area Border Routers are employed for the purpose of
   reliability. Thus it is important to allow the network to grow 
   towards as large a size as possible under a single area.  
   
   Our  
   
   The recommendations in the draft are synergistic with a broader set 
   of scalability and stability improvement proposals. [Ref8] proposes
   flooding overhead reduction in case more than one interface goes to 
   the same neighbor.  [Ref9] proposes a mechanism for 
   greatly reducing LSA refreshes in stable topologies.

   [Ref10] proposes a wide range of congestion control and failure 
   recovery mechanisms (some of those ideas are covered in this
   draft but [Ref10] has other ideas not covered here).   


   Choudhury et. al.     Best Current Practice               [Page 10]

   Internet Draft        Prioritized Treatment          February, 2004


Appendix B. Simulation Study

   The main motivation List of this study is to show the network congestion Variables and instability caused Values

   F    = The factor by large LSA storms and which the improvement in
   stability and scalability that can gap between sending successive LSAs to
          a neighbor is to be achieved by following the
   recommendations increased during congestion and decreased 
          after coming out of congestion (used in this memo.

Appendix B.1. The Network Under Simulation

   We generate a random network over a rectangular grid using Recommendation 4).
          Example value is 2.

   G(t) = Gap between sending successive LSAs to a  
   modified version neighbor at time t
          (used in Recommendation 4).

   Gmax = Maximum allowed value of Waxman's algorithm [Ref11] that ensures that 
   the network is connected and has gap between sending successive LSAs 
          to a pre-specified number neighbor (used in Recommendation 4). Example value is 1
          second.

   Gmin = Minimum allowed value of routers, 
   links, maximum number gap between sending successive LSAs 
          to a neighbor (used in Recommendation 4). Example value is 
          20 ms.

   H    = A high water mark (in units of neighbors per router, and maximum number of adjacencies per router. The rectangular grid resembles unacknowledged LSAs).
          Exceeding this mark would trigger a potential increase in the 
   continental U.S.A. with maximum one-way propagation delay of 30 ms
          gap between sending successive LSAs to a neighbor.
          (used in Recommendation 4). Example value is 20.

   K    = A multiplicative constant used in increasing the East-West direction and maximum one-way propagation delay RxmtInterval
          value used during successive retransmissions of 
   15 ms in the North-South direction.  We consider two different 
   network sizes as explained same LSA
          (used in Section B.2.

   The network has a flat, single-area topology.
 
   Each link Recommendation 3). Example value is 2.

   L    = A low water mark (in units of number of unacknowledged LSAs)
          Dropping below this mark would trigger a point-to-point link connecting two routers.
  

   We assume that router CPU and memory (not potential decrease 
          in the link bandwidth) gap between sending successive LSAs to a neighbor.
          (used in Recommendation 4). Example value is 10.

   n    = Upper limit on the 
   main bottleneck number of adjacencies to be brought up
          simultaneously (used in Recommendation 5).

   R(i) = RxmtInterval value used during the LSA flooding process.  This will typically 
   be true for high speed links (e.g., OC3 or above) and/or links 
   where OSPF traffic gets i-th retransmission of
          an adequate Quality LSA (used in Recommendation 3).

   Rmax = The maximum allowed value of RxmtInterval (used in 
          Recommendation 3). Example value is 40 seconds.

   Rmin = The minimum allowed value of Service (QoS) RxmtInterval (used in 
          Recommendation 3). Example value is 5 seconds.

   T    = Minimum time that has to elapse before the existing gap 
          between sending successive LSAs to a neighbor 
          is considered for change (used in Recommendation 4). Example 
          value is 1 second.



   Choudhury et. al.     Best Current Practice               [Page 10] 11]

   Internet Draft        Prioritized Treatment         November, 2003

   compared to other traffic.

   Different Timers: 
     LSA refresh interval = 1800 seconds, 
     Hello refresh interval = 10 Seconds, 
     Router-Dead interval          February, 2004


   U(t) = 40 seconds, 
     LSA retransmission interval: two values are considered, 10 seconds 
       and 5 Seconds (note that Number of unacknowledged LSAs to a retransmission is disabled neighbor at time t
          (used in Recommendation 4).
         

Appendix C. Other Recommendations
 
   (1) Explicit Marking:  In Section 2 we recommended that OSPF packets
       be classified to "high" and "low" priority classes based on
       examining the 
       receipt of either an explicit acknowledgment or a duplicate LSA
       over OSPF packet header.  In some cases (particularly
       in the same interface that acts as an implicit acknowledgment) 
     Minimum time between successive origination receiver) this examination may be computationally
       costly.  An alternative would be the
       use of different TOS/Precedence field settings for the same LSA = 5 
       seconds, 
     Minimum time between successive Dijkstra SPF calculations 
       is 1 second.

   Packing of LSAs: It is assumed that for any given router, two 
       priority classes.  [Ref1] recommends setting the LSAs 
   originated over a 1-second period are packed together TOS field to form an LSU
   but no more than 3 LSAs are packed in one LSU.



   LSU/Ack/Hello Processing Times: All processing times are expressed  
   in terms of the parameter T.  Two values of T are considered, 1 ms 0 
       and 0.5 ms.

   In the case of a dedicated processor Precedence field to 6 for processing all OSPF packets the 
   processing time reported represents the true processing time. If packets.  We recommend
       this same setting for the 
   processor does other work "low" priority OSPF packets and only a fraction of its capacity can
       different setting for the "high" priority OSPF packets in order 
       to be 
   dedicated able to OSPF processing then we have classify them separately without having to inflate examine
       the processing 
   time appropriately OSPF packet header.  Two examples are given below:

       Example 1: For "low" priority packets set TOS field to get the effective processing time 0 and in that 
   case it is assumed that the inflation factor is already taken into 
   account as part of the reported processing time. 

   The fixed time
                  Precedence field to send or receive any LSU, Ack or Hello packet is T.
   In addition, a variable processing time is used 6, and for LSU "high" priority
                  packets set TOS field to 4 and Ack 
   depending on the number Precedence field to 6.

       Example 2: For "low" priority packets set TOS field to 0 and
                  Precedence field to 6, and types of LSAs packed.  No variable 
   processing time is used for Hello.
   Variable processing time per Router LSA is (0.5 + 0.17L)T where L "high" priority
                  packets set TOS field to 0 and Precedence field to 7.

       It is to be noted that the number of adjacencies advertised TOS/Precedence bits have been 
       redefined by Diffserv (RFC 2474, [Ref13]). It is also to be 
       noted that the Router LSA.  For other 
   LSA types (e.g., ASE LSA or a "Link" LSA carrying traffic 
   engineering information about a link), different TOS/Precedence field settings suggested
       above only need to be agreed among the variable processing time  
   per LSA is 0.5T.

   Variable processing time for an Ack is 25% that of systems on the corresponding
   LSA.

   It link.
       This recommendation is not needed to be noted that followed if multiple LSAs are packed it is easy
       to examine the OSPF packet header and thereby separately
       classify "high" and "low" priority packets.

   (2) Further Prioritization of OSPF Packets: Besides the packets 
       designated as "high" priority in Recommendation 1 of Section 2 
       there may be a single LSU 
   packet then need for further priority separation among the fixed processing time is needed only once but 
       "low" priority OSPF packets.  We recommend the 
   variable processing time is needed for every component use of the 
   packet. three 
       priority classes: "high", "medium" and "low". While
       receiving a packet from a neighbor and while transmitting
       a packet to a neighbor, try to process a "high priority" 
       packet ahead of "medium" and "low" priority packets and 
       a "medium" priority packet ahead of "low priority" packets.
       The "high" priority packets are as designated in Recommendation
       1 of Section 2.  We provide below two candidate examples for
       "medium" priority packets.  All OSPF packets not designated
       as "high" or "medium" priority are "low" priority.
   


   Choudhury et. al.     Best Current Practice               [Page 11] 12]

   Internet Draft        Prioritized Treatment         November, 2003

   The processing time values we use are roughly          February, 2004


       If Cryptographic Authentication (AuType = 2) is used (as 
       specified in [Ref1]) then the same range of 
   what has been observed in an operational network.

   LSU/Ack/Hello Priority: Two non-preemptive priority levels and
   three priority scenarios are considered. Within each priority level 
   processing is FIFO with new receiver should maintain separate 
       sequence numbers for OSPF packets of lower belonging to different 
       priority being
   dropped when the lower classes. 

       One example of "medium" priority queue packet is full.  The higher priority
   packets are never dropped.    
      In Priority scenario 1, all LSUs/Acks/Hellos received at a router 
      are queued at the lower priority.
      In Priority scenario 2, Hellos received at a router are queued at 
      the higher priority but LSUs/Acks are queued at lower priority.
      In Priority scenario 3, Hellos and Acks received at a router are 
      queued at the higher priority but LSUs are queued at lower 
      priority. 
   All packets generated internally to a router (usually triggered by
       Database Description (DBD) packet from a timer) are processed at the higher priority.  This includes slave (during the 
   initial LSA storm, LSA refresh, Hello refresh, LSA retransmission 
   and new LSA origination after detection of a failure or recovery.

   Buffer Size for Incoming LSUs/Acks/Hellos (lower priority): Buffer 
   size
       database synchronization process) that is assumed to be 2000 packets where a packet used as an
       acknowledgment.  

       A second example is either an Ack, 
   LSU, or Hello. 

   LSA Refresh: Each LSA is refreshed once in 1800 seconds carrying
       intra-area topology change information (this may trigger
       SPF calculation and the 
   refresh instants rerouting of various LSAs in the LSDB are assumed to be 
   uniformly distributed over Label Switched paths and so
       fast processing of this packet may improve OSPF/LDP convergence
       times). However, if the 1800 seconds period, i.e., they are 
   completely unsynchronized.  If however, an LSA is originated as part processing cost of identifying and 
       separately queueing the initial LSA storm then it goes on a new refresh schedule of 
   once in 1800 seconds starting from its origination time.   

   LSA Storm Origination: As defined earlier, "LSA storm" this example is deemed to be high 
       then the 
   simultaneous or near simultaneous origination of a large number of 
   LSAs. In implementor may decide not to do it.


Intellectual Property Considerations

   (A)   The IETF takes no position regarding the case validity or scope of only Router and ASE LSAs we normally assume
         any intellectual property or other rights that might be claimed
         to  pertain to the number implementation or use of ASE LSAs in the storm is about 4 times that of technology
         described in this document or the Router LSAs, but the ratio is allowed extent to change if either the  
   Router which any license
         under such rights might or might not be available; neither does
         it represent that it has made any effort to identify any such
         rights.  Information on the ASE LSAs have reached their maximum possible value.   
   In the case IETF's procedures with respect to
         rights in standards-track and standards-related documentation
         can be found in BCP-11.  Copies of only Router claims of rights made
         available for publication and Link LSAs (carrying traffic  
   engineering information) we normally assume that the number any assurances of Link  
   LSAs in licenses to
         be made available, or the storm is about 4 times that result of the Router LSAs, but the  
   ratio is allowed an attempt made
         to change if either the Router obtain a general license or permission for the Link LSAs have
   reached their maximum possible value.  For any given LSA storm we 
   keep originating LSAs starting from router index 1 and moving  
   upwards and stop until the correct number use of LSAs such
         proprietary rights by implementors or users of each type have  
   been originated. this
         specification can be obtained from the IETF Secretariat.

   (B)   The LSAs originated at IETF invites any given router is assumed interested party to start at an instant uniformly distributed between 20 and 30 
   seconds from the start of the simulation.  Successive LSA 
   originations at a router are assumed bring to its
         attention any copyrights, patents or patent applications, or
         other proprietary rights which may cover technology that may be spaced apart by 400 ms. 
   It is
         required to be noted that during the period of observation there are 
   other LSAs originated besides the ones in practice this standard.  Please address the storm.  These include 
   refresh of LSAs that are not part of
         information to the storm and LSAs originated IETF Executive Director.










   Choudhury et. al.     Best Current Practice               [Page 12] 13]

   Internet Draft        Prioritized Treatment         November, 2003

   due to possible link failures and subsequent possible link 
   recoveries.

   Failure/Recovery of Links: If no Hello is received over a link (due 
   to CPU/memory congestion) for longer than Router-Dead Interval then
   the link is declared down.  At a later time, if Hellos are received
   then the link would be declared up.  Whenever a link is declared
   up or down, one Router LSA is originated by each router on the
   two sides of the point-to-point link.  If "Link LSAs" carrying
   traffic engineering information is used then it is assumed that each
   router would also originate a Link LSA.  In this case it is also 
   assumed that due to rerouting of LSPs, three other links in the 
   network (selected randomly in the simulation) would have significant 
   change in reserved bandwidth which would result in one Link LSA 
   being originated by the routers on the two ends of each such link.







Appendix B.2. Simulation Results

   In this section we study the relative performance of the three 
   priority scenarios defined earlier (no priority to Hello or Ack, 
   priority to Hello only, and priority to both Hello and Ack) with a 
   range of Network sizes, LSA retransmission timer values, LSA types, 
   processing time values and Hello/Router-Dead-Interval values:

   Network size: Two networks are considered. Network 1 has 100 routers,
   1200 links, maximum number of neighbors per router is 30 and maximum 
   number of adjacencies per router is 50 (same neighbor may have more 
   than one adjacencies).   Network 2 has 50 routers, 600 links, maximum 
   number of neighbors per router is 25 and maximum number of 
   adjacencies per router is 48. Dijkstra SPF calculation time for 
   Network 1 is assumed to be 100 ms and that for Network 2 is assumed 
   to be 70 ms.

   LSA Type: Each router has 1 Router LSA (Total of 100 for Network 1 
   and 50 for Network 2). There are no Network LSAs since all links are
   point-to-point links and no Summary LSAs since the network has only 
   one area. Regarding other LSA types we consider two situations.  In  
   Situation 1 we assume that there are no ASE LSAs and each link has  
   one "Link" LSA carrying traffic engineering information (Total of  
   2400 for Network 1 and 1200 for Network 2). In Situation 2 we assume
   that there are no "Link" LSAs and half of the routers are ASA-Border  
   routers and each border router has 10 ASE LSAs (Total of 500 for  
   Network 1 and 250 for Network 2).  We identify Situation 1 as "Link  
   LSAs" and Situation 2 as "ASE LSAs".


   Choudhury et. al.     Best Current Practice               [Page 13]

   Internet Draft        Prioritized Treatment         November, 2003

   LSA retransmission timer value: Two values are considered, 10 
   seconds and 5 seconds (default value).
   
   Processing time values: Processing times for LSUs, Acks and Hello 
   packets have been previously expressed in terms of a common  
   parameter T.  Two values are considered for T, which are 1 ms 
   and 0.5 ms respectively.

   Hello/Router-Dead-Interval: It is assumed that Router-Dead interval
   is four times the Hello interval.  In one case it is assumed that
   Hello interval is 10 seconds and Router-Dead-Interval is 40
   seconds (default values), and in the other case it is assumed that 
   Hello interval is 2 seconds and Router-Dead-Interval is 8 seconds. 

   Based on Network size, LSA type and processing time values we 
   develop 6 Test cases as follows:

   Case 1: Network 1, Link LSAs, retransmission timer = 10 sec., 
           T = 1 ms, Hello/Router-Dead-Interval = 10/40 sec.

   Case 2: Network 1, ASE LSAs, retransmission timer = 10 sec., 
           T = 1 ms, Hello/Router-Dead-Interval = 10/40 sec.
   Case 3: Network 1, Link LSAs, retransmission timer = 5 sec., 
           T = 1 ms, Hello/Router-Dead-Interval = 10/40 sec.

   Case 4: Network 1, Link LSAs, retransmission timer = 10 sec., 
           T = 0.5 ms, Hello/Router-Dead-Interval = 10/40 sec.

   Case 5: Network 1, Link LSAs, retransmission timer = 10 sec., 
           T = 1 ms, Hello/Router-Dead-Interval = 2/8 sec.

   Case 6: Network 2, Link LSAs, retransmission timer = 10 sec., 
           T = 1 ms, Hello/Router-Dead-Interval = 10/40 sec.

   For each case and for each Priority scenario we study the network 
   stability as a function of the size of the LSA storm.  The stability
   is determined by looking at the number of non-converged LSUs as a   
   function of time. An example is shown in Table 1 for Case 1 and 
   Priority scenario 1 (No priority to Hellos or Acks).













   Choudhury et. al.     Best Current Practice               [Page 14]

   Internet Draft        Prioritized Treatment         November, 2003

   
=========|==========================================================
         | Number of Non-Converged LSUs in the Network at Time(in sec)
    LSA  |                    
   STORM |====|=====|=====|=====|=====|=====|=====|=====|========|==
   SIZE  |10s | 20s | 30s | 35s | 40s | 50s | 60s | 80s | 100s   |
=========|====|=====|=====|=====|=====|=====|=====|=====|========|==
    100  | 0  |  0  | 24  | 29  | 24  |  1  |  0  |  1  |  1     |
 (Stable)|    |     |     |     |     |     |     |     |        |
---------|----|-----|-----|-----|-----|-----|-----|-----|--------|--
    140  | 0  |  0  | 35  | 48  | 46  | 27  | 14  |  1  |  1     |
 (Stable)|    |     |     |     |     |     |     |     |        |
---------|----|-----|-----|-----|-----|-----|-----|-----|--------|--
    160  | 0  |  0  | 38  | 57  | 55  | 40  | 26  | 65  | 203    |
(Unstable)    |     |     |     |     |     |     |     |        |
=========|==========================================================

           Table 1: Network Stability Vs. LSA Storm 
              (Case 1, No priority to Hello/Ack)

   The LSA storm starts a little after 20 seconds and so for some 
   period of time after that the number of non-converged LSUs should
   stay high and then come down for a stable network. 
   This happens for LSA storms of sizes 100 and 140.  With an LSA storm
   of size 160, the number of non-converged LSUs stay high indefinitely
   due to repeated retransmissions, link failures due to missed Hellos  
   for more than the Router-Dead interval which originates additional 
   LSAs and also due to subsequent link recoveries which again 
   originate additional LSAs.  We define network stability threshold as
   the maximum allowable LSA storm size for which the number of 
   non-converged LSUs come down to a low level after some time. It 
   turns out that for this example the stability threshold is
   150. 

   The network behavior as a function of the LSA storm size can
   be categorized as follows:

   (1) If the LSA storm is well below the stability threshold then
       the CPU/memory congestion lasts only for a short period and
       during this period there are very few retransmissions, very
       few dropped OSPF packets and no link
       failures due to missed Hellos.  This type of LSA storm is
       observed routinely in operational networks and networks
       recover from them easily.

   (2) If the LSA storm is just below the stability threshold then
       the CPU/memory congestion lasts for a longer period and during
       this period there may be considerable amount of retransmissions
       and dropped OSPF packets.  If Hello packets are not given
       priority then there may also be some link failures due to
       missed Hellos.  However, the network does go back to a stable

   Choudhury et. al.     Best Current Practice               [Page 15]

   Internet Draft        Prioritized Treatment         November, 2003

       state eventually. This type of LSA storm may happen rarely in
       operational networks and they recover from it with some
       difficulty. 

   (3) If the LSA storm is above the stability threshold then
       the CPU/memory congestion may last indefinitely unless
       some special procedure for relieving congestion is followed. 
       During this period there are considerable amount of 
       retransmissions and dropped OSPF packets.  If Hello packets are
       not given priority then there would also be link failures due 
       to missed Hellos.  This type of LSA storm may happen very rarely
       in operational networks and usually some manual procedure such as
       taking down adjacencies in heavily congested routers is needed.

   (4) If Hello packets are given priority then the network stability
       threshold increases, i.e., the network can withstand a larger
       LSA storm. Furthermore, even if the network operates at or 
       somewhat above this higher stability threshold, Hellos are 
       still not missed and so there are no link failures.  So even 
       if there is congestion in the control plane due to increased 
       retransmissions requiring some special procedures for congestion
       reduction, the data plane remains unaffected.
        
   (5) If both Hello and Acknowledgement packets are given priority
       then the stability threshold increases even further.       
   
   In Table 2 we show the network stability threshold for the five  
   different cases and for the three different priority scenarios
   defined earlier.  

|===========|========================================================|
|           |    Maximum Allowable LSA Storm Size For                |
|   Case    |=================|==================|===================|
|  Number   | No Priority to  |Priority to Hello | Priority to Hello |
|           |  Hello or Ack   |      Only        |   and Ack         |
|===========|=================|==================|===================|
|   Case 1  |        150      |        190       |        250        |
|___________|_________________|__________________|___________________|
|   Case 2  |        185      |        215       |        285        |
|___________|_________________|__________________|___________________|
|   Case 3  |        115      |        127       |        170        |
|___________|_________________|__________________|___________________|
|   Case 4  |        320      |        375       |        580        |
|___________|_________________|__________________|___________________|
|   Case 5  |        120      |        175       |        225        |
|___________|_________________|__________________|___________________|
|   Case 6  |        185      |        224       |        285        |
|___________|_________________|__________________|___________________|

       Table 2: Maximum Allowable LSA Storm for a Stable Network


   Choudhury et. al.     Best Current Practice               [Page 16]

   Internet Draft        Prioritized Treatment         November, 2003

   We also considered one more scenario with priority to Hello and Ack
   and with a truncated binary exponential backoff of the 
   retransmission interval with an upper limit of 40 seconds (for the 
   same LSA, each successive retransmission interval
   is doubled but not to exceed 40 seconds).  The maximum allowed
   LSA storm size for this scenario significantly exceeded the numbers 
   given in the third column.

Appendix B.3. Observations on Simulation Results

   Table 2 shows that in all cases prioritizing Hello packets increases
   the network stability threshold, and in addition, prioritization of  
   LSA Acknowledgment packets increases the stability threshold even
   further.  The reasons for the above observations are as follows.
   The main sources of sustained CPU/memory congestion (or positive
   feedback loop) following an LSA storm are (1) LSA retransmissions 
   and (2) links being declared down due to missed Hellos which in 
   turn causes further LSA origination and future recovery of the link   
   causing even more LSA originations. 
   Prioritizing Hello packets avoids and practically eliminates the
   second source of congestion.  Prioritizing Acknowledgments 
   significantly reduces the first source of congestion, i.e.,
   LSA retransmissions.  It is to be noted that retransmissions can
   not be completely eliminated due to the following reasons. Firstly,
   only the explicit Acknowledgments are prioritized but duplicate
   LSAs carrying implicit Acknowledgments are still served at the 
   lower priority.  Secondly, LSAs may get greatly delayed or dropped
   at the input queue of receivers and therefore Acknowledgments may
   not even get generated in which case prioritizing Acks would not   
   help. Another factor to keep in mind is that since Hellos and Acks  
   are prioritized, the LSAs see bigger delay and potential for 
   dropping. However, the simulation results show that on the whole 
   prioritizing Hello and LSA Acks are always beneficial and 
   significantly improve the network stability threshold.    

   As stated in Section B.2, exponenetial backoff of LSA retransmission
   interval further increases the network stability threshold. 

   Our simulation study also showed that in each of the cases, instead 
   of prioritizing Hello packets if we treat any packet received over 
   a link as a surrogate for a Hello packet (an implicit Hello) then
   we get about the same stability threshold as obtained with
   prioritizing Hello packets.


Appendix C. Other Recommendations
 
   (1) Explicit Marking:  In Section 2 we recommended that OSPF packets
       be classified to "high" and "low" priority classes based on
       examining the OSPF packet header.  In some cases (particularly
       in the receiver) this examination may be computationally

   Choudhury et. al.     Best Current Practice               [Page 17]

   Internet Draft        Prioritized Treatment         November, 2003

       costly.  An alternative would be the
       use of different TOS/Precedence field settings for the two 
       priority classes.  [Ref1] recommends setting the TOS field to 0 
       and the Precedence field to 6 for all OSPF packets.  We recommend
       this same setting for the "low" priority OSPF packets and a
       different setting for the "high" priority OSPF packets in order 
       to be able to classify them separately without having to examine
       the OSPF packet header.  Two examples are given below:

       Example 1: For "low" priority packets set TOS field to 0 and
                  Precedence field to 6, and for "high" priority
                  packets set TOS field to 4 and Precedence field to 6.

       Example 2: For "low" priority packets set TOS field to 0 and
                  Precedence field to 6, and for "high" priority
                  packets set TOS field to 0 and Precedence field to 7.

       This recommendation is not needed to be followed if it is easy
       to examine the OSPF packet header and thereby separately
       classify "high" and "low" priority packets.

   (2) Further Prioritization of OSPF Packets: Besides the packets 
       designated as "high" priority in Recommendation 1 of Section 2 
       there may be a need for further priority separation among the 
       "low" priority OSPF packets.  We recommend the use of three 
       priority classes: "high", "medium" and "low". While
       receiving a packet from a neighbor and while transmitting
       a packet to a neighbor, try to process a "high priority" 
       packet ahead of "medium" and "low" priority packets and 
       a "medium" priority packet ahead of "low priority" packets.          February, 2004


Copyright Notice

   (C)   Copyright (C) The "high" priority packets are as designated in Recommendation
       1 of Section 2.  We provide below two candidate examples for
       "medium" priority packets.  All OSPF packets not designated
       as "high" or "medium" priority are "low" priority.

       One example of "medium" priority packet is the
       Database Description (DBD) packet from a slave (during the
       database synchronization process) Internet Society (date). All Rights
         Reserved.

         This document and translations of it may be copied and
         furnished to others, and derivative works that is used as an
       acknowledgment.  

       A second example is an LSA carrying
       intra-area topology change information (this comment on or
         otherwise explain it or assist in its implementation may trigger
       SPF calculation be
         prepared, copied, published and rerouting distributed, in whole or in
         part, without restriction of Label Switched paths any kind, provided that the above
         copyright notice and so
       fast processing of this packet may improve OSPF/LDP convergence
       times). paragraph are included on all such
         copies and derivative works.  However, if this document itself may
         not be modified in any way, such as by removing the processing cost copyright
         notice or references to the Internet Society or other Internet
         organizations, except as needed for the  purpose of identifying and 
       separately queueing developing
         Internet standards in which case the LSA procedures for copyrights
         defined in this example is deemed the Internet Standards process must be followed, or
         as required to translate it into languages other than English.

         The limited permissions granted above are perpetual and will
         not be high 
       then revoked by the implementor may decide not to do it. Internet Society or its successors or
         assigns.

         This document and the information contained herein is provided
         on an "AS IS" basis and THE INTERNET SOCIETY AND THE INTERNET
         ENGINEERING TASK FORCE DISCLAIMS ALL WARRANTIES, EXPRESS OR
         IMPLIED, INCLUDING BUT NOT LIMITED TO ANY WARRANTY THAT THE USE
         OF THE INFORMATION HEREIN WILL NOT INFRINGE ANY RIGHTS OR ANY
         IMPLIED WARRANTIES OF MERCHANTABILITY OR FITNESS FOR A
         PARTICULAR PURPOSE.





















   Choudhury et. al.     Best Current Practice               [Page 18] 14]
----