view Side-By-Side changes
Internet Draft Barbara Fraser Network Working Group SEI/CMU Expires in six monthsJulyDecember 1995 Site Security Handbookfor System and Network Administrators <draft-ietf-ssh-handbook-00.txt><draft-ietf-ssh-handbook-01.txt> Status of this Memo This document is an Internet-Draft. Internet-Drafts are working documents of the Internet Engineering Task Force (IETF), its areas, and its working groups. Note that other groups may also distribute working documents as Internet-Drafts. Internet-Drafts are draft documents valid for a maximum of six months and may be updated, replaced, or obsoleted by other documents at any time. It is inappropriate to use Internet- Drafts as reference material or to cite them other than as ``work in progress.'' To learn the current status of any Internet-Draft, please check the ``1id-abstracts.txt'' listing contained in the Internet- Drafts Shadow Directories on ftp.is.co.za (Africa), nic.nordu.net (Europe), munnari.oz.au (Pacific Rim), ds.internic.net (US East Coast), or ftp.isi.edu (US West Coast). Table of Contents 1. Introduction.................................................... 2 1.1 Purpose of this Work............................................ 2 1.2 Audience........................................................ 2 1.3 Definitions..................................................... 3 1.4 Related Work.................................................... 3 2. Security Policies............................................... 3 2.1 What is a Computer Security Policy and Why Have One?............ 3 2.2 What Makes a Good Computer Security Policy?..................... 5 3. Architecture.................................................... 6 3.1 Objectives...................................................... 6 3.2 Network and Service Configurations.............................. 10 3.4 Firewalls....................................................... 13 4. Security Procedures............................................. 13 4.1 Authentication.................................................. 13 4.2 Authorization................................................... 15 4.3 Access.......................................................... 16 4.4 Modems.......................................................... 16 4.5 Cryptography.................................................... 18 4.6 Auditing........................................................ 18 4.7 Backups......................................................... 20 5. Security Incident Handling...................................... 20 5.1 Preparing and Planning for Incident Handling.................... 22 5.2 Notification and Points of Contact.............................. 24 5.3 Identifying an Incident......................................... 31 5.4 Handling an Incident............................................ 33 5.5 Aftermath of an Incident........................................ 38 Site Security Handbook Working Group [Page 1] Internet Draft Site Security Handbook November 1995 5.6 Responsibilities................................................ 39 6. Maintenance and Evaluation...................................... 40 6.1 Risk Assessments................................................ 40 6.2 Notification of Problems and Events............................. 40 Appendix A1 Tools and Locations..................................... 40 Appendix A2 Mailing Lists and Other Resources....................... 41 References........................................................... ? Annotated Bibliography............................................... ? 1. INTRODUCTION This document provides guidance to system and network administrators on how to address security issues within the Internet community. It builds on the foundation provided in RFC 1244 and is the collective work of a number of contributing authors. Those authors include:Philip J. Nesser, Klaus-Peter Kossakowski, Erik Gutmann, Nevil Brownlee,Jules P. Aronson,Edward.P.Lewis 2. SECURITY POLICIES 2.1 What Is a Security PolicyNevil Brownlee, Joao Nuno Ferreira, Erik Gutmann, Klaus-Peter Kossakowski, Edward.P.Lewis, Gary Malkin, Philip J. Nesser, andWhy Have One? The security-related decisionsMichael S. Ramsey. A special thank youmake, or failgoes tomake, as system administrator largely determines how secure or insecure your system is, how much functionality your system offers,Joyce Reynolds, ISI, and Paul Holbrook, CICnet, for their vision, leadership, andhow easy your system is to use. However, you cannot make good decisions about security without first determining what your security goals are. And until you determine what your security goals are, you cannot make effective useeffort in the creation ofany collectionthe first version ofsecurity tools, because you simply won't know what to check for, or what restrictions to impose. For instance, your goalsthis handbook. It is the working group's sincere hope that this version willprobablybevery different fromas helpful to thegoalscommunity as the earlier one was. 1.1 Purpose of this Work This handbook is ahardware or software vendor who ships your system software set up in a default state that allows maximum access (e.g., all services turned on), but provides minimal security. Your goals will be largely determined by the following key tradeoffs: (1) services offered (functionality) vs. security (2) ease of use vs. security (3) cost of security vs. risk of loss Your goals are communicatedguide toall users, operations staff, and managers through a set of security rules, called a "computer security policy". 2.1.1 Definition of a Computer Security Policy Asetting computer securitypolicy is a formal specification ofpolicies and procedures for sites that have systems on therules by which people are given access to an organization's technologyInternet. This guide lists issues andinformation assets. 2.1.2 Purposes offactors that aComputer Security Policy The main purposesite must consider when setting their own policies. It makes some recommendations and provides discussions of relevant areas. This guide is only acomputerframework for setting securitypolicy is to inform users, staffpolicies andmanagersprocedures. In order to have an effective set oftheir obligatory requirements for protecting technologypolicies andinformation assets through a consistent approach to policy matters. Another purpose is to provideprocedures, abaseline from whichsite will have toacquire, configuremake many decisions, gain agreement, andaudit computer systems for compliance with the policy. Therefore an attempt to use a set of security tools inthen communicate and implement theabsence ofpolicies. 1.2 Audience The audience for this document are system administrators and decision makers (who are more traditionally called "administrators" or "middle management") atleast an implied security policysites. This document ismeaningless. 2.2 What Makes a Good Computer Security Policy?not directed at programmers or those trying to create secure programs or systems. Thecharacteristicsfocus ofa good security policy are: (1) Ablethis document is on the policies and procedures that need to beimplemented (e.g. through system administration procedures, and by publishingin place to support any technical security features that a site may be implementing. The primary audience for this work are sites that are members ofacceptable use guidelines, etc.). (2) Canthe Internet community. However, this document should beenforced (e.g., viauseful to any site that allows communication with other sites. As a general guide to securitytools where appropriate, butpolicies, this document may alsofrom a human perspective, via sanctions). (3) Clearly definesbe useful to sites with isolated systems. Site Security Handbook Working Group [Page 2] Internet Draft Site Security Handbook November 1995 1.3 Definitions For theareas of responsibilities. The componentspurposes of this guide, agood security policy include: (1) Computer Technology Purchasing Guidelines - Specifies required"site" is any organization that owns computers orpreferred security featuresnetwork-related resources. These resources may include host computers that users use, routers, terminal servers, PC's or other devices that have access tosupplement existing purchasing policy and guidelines. (2) Privacy Policy - Defines reasonable expectationsthe Internet. A site may be a end user ofprivacy regardingInternet services or a service provider suchissuesasmonitoringa regional network. However, most ofelectronic mail and loggingthe focus ofkeystrokes. (3) Access Policy - Defines access rightsthis guide is on those end users of Internet services. We assume that the site has the ability to set policies andprivileges,procedures for itself with the concurrence andprotects assetssupport fromloss or disclosure by specifying acceptablethose who actually own the resources. The "Internet" is those set of networks and machines that useguidelines for users, operations staff,the TCP/IP protocol suite, connected through gateways, andmanagement. Provides guidelines for external connections, data communications, connecting devices tosharing anetwork,common name andadding new softwareaddress spaces [1]. The term "system administrator" is used toa system. Also provides notification guidelines (e.g., requires a login warning banner stating that the system iscover all those who are responsible forauthorized users only, insteadthe day-to-day operation ofdisplayingresources. This may be awelcome banner.) (4) Accountability Policy - Defines the responsibilitiesnumber ofusers, management, and operations staff. Specifies an audit capability, and provides incident handling guidelines. (5) Authentication Policy - Establishes trust throughindividuals or aneffective password policy, and by setting guidelines for remote location authentication, andorganization. The term "decision maker" refers to those people at a site who set or approve policy. These are often (but not always) theuse of authentication devices. (6) Availability - Defines availability expectations. Addressespeople who own theredundancy ofresources. 1.4 Related Work The IETF Guidelines for Security Incident Response Working Group (GRIP) is developing a document for securityfunctions. Specifies recovery procedures and mechanisms. (7) Violations Reporting - Denotes which types of violation must be reported, andincident response teams. That document provides additional guidance to those organizations planning to develop their own incident response team (IRT), including acentral reporting point. A non- threatening atmosphere and the possibility of anonymous reporting will result in a greater likelihoodtemplate thatif a violationisdetected, it will be reported. (8) Supporting Information - Provides users, staff,useful to IRTs when descibing their policies andmanagement with contact information for each type of policy violation. Gives guidelinesservices. 2. SECURITY POLICIES 2.1 What is a Computer Security Policy and Why Have One? The security-related decisions you make, or fail to make, as network administrator largely determines how secure or insecure yourpublic relations office onnetwork is, howhandle outside queries about a security incident. Provides cross-referencesmuch functionality your network offers, and how easy your network is to use. However, you cannot make good decisions about securityprocedures and related information, such as company policies and regulatory requirements (federal, state, and local). Oncewithout first determining what yourcomputersecuritypolicy has been established it should be clearly communicatedgoals are. Until you determine what your security goals are, you cannot make effective use of any collection of security tools because you simply will not know what tousers, staff, and management. Having all personnel sign a statement indicating that they have read, understood,check for andagreewhat restrictions toabide by the policy is an important part of the process. Finally,impose. For example, yourpolicy shouldgoals will probably bereviewed onvery different from the goals of aregular basisproduct vendor. Vendors are trying tosee if it is successfully supporting your security needs. 3. SECURITY PROCEDURES 3.1 Authentication 3.1.1 One-Time passwords Given today's networked environments, it is recommended that sites concerned about the securitymake configuration andintegrityoperation of theirsystems and networks consider moving away from standard, reusable passwords. There have been many incidents involving Trojan network programs (e.g., telnet and rlogin) and network packet sniffing programs. These programs capture clear-text hostname, account name, password triplets. Intruders can useproducts as simple as possible, which implies that thecaptured information for subsequentdefault configurations will often be as open (i.e., insecure) as possible. While this does make it easier to install new products, it also leaves access to thosehosts and accounts. This is possible because 1) the password is used oversystems, andover (henceother systems Site Security Handbook Working Group [Page 3] Internet Draft Site Security Handbook November 1995 through them, open to any user who wanders by. Your goals will be largely determined by theterm "reusable"), and 2)following key tradeoffs: (1) services offered vs. security provided - Each service offered to users carries its own security risks. For some services thepassword passes acrossrisk outweighs thenetwork in clear text. Several authentication techniques have been developed that address this problem. Among these techniques are challenge-response technologies that provide passwords that are only used once (commonly called one-time passwords). This document provides a listbenefit ofsources for products that provide this capability. The decision to use a product istheresponsibility of each organization, and each organization should perform its own evaluationservice andselection. 3.1.2 Kerberos <insert info about this technology here> 3.1.3 Password Assurance Whiletheneedadministrator may choose to eliminate theuse of standard, reusable passwords cannot be overstated, it is recognized that some organizations may haveservice rather than try totransitionsecure it. (2) ease of use vs. security - The easiest system totheuseof this technology. Givenwould allow acces to any user and require no passwords; thatsituation, we have includedis, there would be no security. Requiring passwords makes thefollowing sectionssystem a little less convenient, but more secure. Requiring device-generated one-time passwords makes the system even more difficult tohelp withuse, but much more secure. (3) cost of security vs. risk of loss - There are many different costs to security: monetary (i.e., theselectioncost of purchasing security hardware andmaintenancesoftware like firewalls and one-time password generators), performance (i.e., encryption and decryption take time), and ease oftraditional passwords. If you have tousethem, here is some pertinent advice. 3.1.3.1 The importance(as mentioned above). There are also many levels ofrobust passwords In almost all casesrisk: loss ofsystem penetration,privacy (i.e., theintruder needs to gain access to an interactive user account on the system. One way that goal is typically accomplished is through guessing the passwordreading ofa legitimate user. The intruder may attempt to guess a passwordinformation byknowing something aboutunauthorized individuals), loss of data (i.e., thelegitimate user,corruption or(more commonly) by trying dictionary wordserasure of information), andother simple guesses. This threat is increased bytheeasy availabilityloss of service (e.g., thepassword file, since an intruder may attempt to guess the encrypted passwords in that file by using another (perhaps fasterfilling of data storage space, usage of computational resources, andmore powerful) system to run automated "cracking" tools on the stolen password file. To guarddenial of network access). Each type of cost must be weighed againstthis threat, the simplest and most straightforward approach iseach type of loss. Your goals should be communicated toassure that the passwords onallaccounts are not easily guessed. Once the systemusers, operations staff, and managers through a set of security rules, called a "computer security policy." 2.1.1 Definition of a Computer Security Policy A computer security policy isguarded against this threat,a formal statement of theintruder would haverules by which people who are given access todevoteaninordinately large amountorganization's technology and information assets must abide. 2.1.2 Purposes ofresources to try to crack your passwords. Ita Computer Security Policy The main purpose of a computer security policy ismore likely that the intruder would trytofind another methodinform users, staff and managers ofpenetration, or would give uptheir obligatory requirements for protecting technology andtry another system. 3.1.3.2 Restricting access to the password fileinformation assets. Thefirst step to password assurancepolicy should specify the mechanisms through which these requirements can be met. Another purpose is tolimit accessprovide a baseline from which to acquire, configure and audit computer systems and networks for compliance with theencrypted portionpolicy. Therefore an attempt to use a set of security tools in theusers' password. If the encrypted portion is unavailable toabsence of at least anattacker, guessing passwords cannotimplied security policy is meaningless. 2.1.3 Who Should bedone off-line atInvolved When Forming Policy? In order for aremote (and possibly more powerful) location. The typical method usedsecurity policy toprotect the encrypted password isbe appropriate and effective, it Site Security Handbook Working Group [Page 4] Internet Draft Site Security Handbook November 1995 needs touse a shadow password file scheme. This requireshave theuseacceptance and support oftwo distinct password files.all levels of employees within the organization. Thefirst containsfollowing is a list of individuals who should be involved in thestandard password file information (username, full-name, login directory,creation andshell) but containsreview of security policy documents: (1) site security administrator (2) legal counsel (3) computing center personnel (4) network administrators of large user groups within the organization (e.g., business divisions, computer science department within adummyuniversity, etc.) (5) local orfalse encrypted password field. This informationnational security incident response team (6) representatives of the user groups affected by the security policy The list of above iskept in another password file thatrepresentative of many organizations, but is not necessarily comprehensive. The idea isavailable onlytothe root user or system. In this case, standard system utilities (such as FTP)bring in representation from key stakeholders, management who have budget and policy authority, technical staff who know what can andusers without elevated privilegescannotgain access tobe supported, and legal counsel who know the legal ramifications of various policy choices. Involving thisinformation. Itgroup is important if resulting policy statements are tonote that use of a shadow password file does not remove the need for robust password selection onreach thepartbroadest possible acceptance. 2.2 What Makes a Good Computer Security Policy? The characteristics ofthe user, but is one additional protection against penetration. 3.1.3.3 Initial password selection One problem often found in systems that have been penetrated is thatauser's initial password, set by thegood security policy are: (1) It must be implementable through systemadministrator, is easily guessed by an intruder. While the settingadministration procedures, publishing ofthis password is usually accompanied by the request that the passwordacceptable use guidelines, or other appropriate methods. (2) It must bechanged as soon as the accountenforcible with security tools, where appropriate, and with sanctions, where actual prevention isused, many users lack the knowledge or motivation to change it. Also, by giving a user a simple password, the system administrator sets up an expectation onnot technically feasible. (3) It must clearly define thepartareas of responsibility for theuser that simple passwords are acceptable. Therefore, a naive user is unlikely to selectusers, staff, and administrators. The components of amore robust password, sincegood security policy include: (1) Computer Technology Purchasing Guidelines which specify required, or preferred, security features. These should supplement existing purchasing policies and guidelines. (2) A Privacy Policy which defines reasonable expectations of privacy regarding suchpasswords tendissues as monitoring of electronic mail, logging of keystrokes, and access tobe harderusers' files. (3) An Access Policy which defines access rights and privileges toremember. Because of this threat, the system administratorprotect assets from loss or disclosure by specifying acceptable use guidelines for users, operations staff, and management. It shouldfollow some sensibleprovide guidelines(such as the criteria shown below)forselecting "good" initial passwords for the users of the system, and for selectingexternal connections, data communications, connecting devices to agood root password as well. The system administratornetwork, and adding new software to systems. It should also specify any required notification messages Site Security Handbook Working Group [Page 5] Internet Draft Site Security Handbook November 1995 (e.g., connect messages should provideusers with guidance on choosing their own robust passwords. 3.1.3.4 Password selection criteria Many users give little thought to what makes a "good" password,warnings about authorized usage andwould be surprised at just how easy some passwords are for intruders to guess. Unfortunately,line monitoring, and not simply say "Welcome"). (4) An Accountability Policy which defines theincreasing powerresponsibilities ofcomputing is making password "cracking" far easier than ever before.users, operations staff, and management. Itis now fairly trivialshould specify an audit capability, and provide incident handling guidelines (i.e., what tolaunchdo and who to contact if abrute-force attack on anypossible intrusion is detected). (5) An Authentication Policy which establishes trust through an effective password policy, and by setting guidelines for remote location authentication and the use offour characters or less. Also,authentication devices (e.g., one-time passwords and the devices that generate them). (6) An Availability statement which sets users' expectations for theincreasingavailability ofdisk space is allowing intruders access to complete dictionaries of not only English, but many foreign languagesresources. It should address redundancy and recovery issues, aswell. Some password selection criteriawell as specify operating hours and maintenance down-time periods. It should also include contact information for reporting system and network failures. (7) A Violations Reporting Policy thatcan reasonablyindicates which types of violations (e.g., privacy and security, internal and external) must beincludedreported and to whom the reports are made. A non-threatening atmosphere and the possibility of anonymous reporting will result in apassword policy are: o The minimum password length should be 7-8 characters . o The password should not begreater probability that apermutationviolation will be reported if it is detected. (8) Supporting Information which provides users, staff, and management with contact information for each type ofany user accountpolicy violation; guidelines how handle outside queries about a security incident orpersonalinformation which may be considered confidential or proprietary; and cross-references to security procedures and related information, such asname, office, telephone number, or socialcompany policies and regulatory requirements (federal, state, and local). There may be regulatory requirements that affect some aspects of your securitynumber. opolicy (e.g., line monitoring). Thepasswordcreators of the security policy shouldnot be contained in any English or foreign dictionaryconsider seeking legal assistance inregular use. This includes real/imaginary character names or terms from folklore. o The password should not consistthe creation ofacronyms or project-specific information relating totheuser's job or function. While these criteria may seem overly restrictive, there are many schemes that will generate hard to guess passwords that are still easy to remember. One example is to takepolicy. At awell known phrase and useminimum, thefirst or second letter of each word to formpolicy should be reviewed by legal counsel. Once yourpassword. Another iscomputer security policy has been established it should be clearly communicated touse two short words separated byusers, staff, and management. Having all personnel sign acontrol or punctuation character. 3.1.3.5 Password aging Whenstatement indicating that they have read, understood, andhowagreed toexpire passwordsabide by the policy isstill a subjectan important part ofcontroversy amongthesecurity community. It is generally accepted that a passwordprocess. Finally, your policy shouldnotbemaintained once an account is no longer in use, butreviewed on a regular basis to see if it ishotly debated whethersuccessfully supporting your security needs. 3. ARCHITECTURE 3.1 Objectives 3.1.1 Completely defined security plans Site Security Handbook Working Group [Page 6] Internet Draft Site Security Handbook November 1995 Defining ausercomprehensive security plan should beforced to changedone by all sites. This plan should be at agood password that's in active use. The arguments for changing passwords relate to the prevention of the continued use of penetrated accounts. However,higher level than theopposition claims that frequent password changes lead to users writing down their passwordsspecific policies discussed invisible areas (suchsection 2. It should be crafted aspasting them to a terminal), or to users selecting very simple passwords that are easy to guess. While there is no definitive answer to this dilemma,apassword policy should directly address the issue and provideframework of broad guidelinesfor how often a user should change the password.into which specific policies will fit. It isrecommendedimportant to have this framework in place so thatpasswordsindividual policies can bechanged whenever root is penetrated, there isconsistant with the overall site security architecture. For example, having acritical change in personnel (especially if itstrong policy with regard to Internet access, but having weak restrictions on modem usage isthe system administrator!), or wheninconsistent with anaccount has been compromised. In particular, if the root password is compromised, all passwordsoverall philosophy of strong security restrictions onthe systemexternal access. A security policy should contain, at a minimum: a list of services which are currently, or will forseably be, provided; who will have access to those services; how access will bechanged. In addition, an annual change in their passwordprovided; who will administer those services; etc. It isusually not difficult for most users, and you should consider requiring it. 3.1.3.6 Machine generated passwords An alternativealso imperative toallowing usersdefine any limitations on which portions of an organization can provide services. Another aspect of theopportunityplan should concern incident handling. Chapter 5 provides an in-depth discussion of responses toselect easily guessed passwordsincidents, but it is important torequire machine generated passwords. These have historically been a problem due to the difficultydefine classes ofremembering such passwords. As a result, they were often written down,incidents andtherefore more easily stolen. Another alternative isdefine responses topresent a listeach class ofseveral machine generated passwords and allow the userincident. For sites with firewalls, how many attempts toselectfoil theone that's easiest for him or her to remember. It is recommended that if machine generated passwords are mandated by your policy, then users should not be requiredfirewall will trigger a response? Are there levels of escallation in both attacks and responses? For sites without firewalls, does a single attempt tochange passwords very frequently. (For example, not more than once every six months.) 3.1.3.8 Multiple accounts It is now commonplace forconnect constitute an incident? How about ausersystematic scan of machines? For sites connected tohave accounts on many different systems. One problem is that ifthesame password is used on all of these systems,Internet, thepenetrationrampant media glorification ofone system will easily spread to the others. It is therefore recommended thatInternet related security incidents can overshadow auser should(potentially) more serious internal security problem. Likewise, companies who havedifferent passwordsnever been oneach system. However, ifthe Internet before, may have strong, well defined, internal policies but fail to adequately address external connection policy. 3.1.2 Separation of Services There are many services which auser planssite may wish touse the same password on more than one system, these systems should all be in the same administrative domain, and should all be running the same operating system. 3.1.3.9 Toolsprovide forpassword assuranceits users, some of which may be external. There aretools to aida variety of securityadministrator in assuring robust passwords. These tools fall generally into two categories: "password cracking" and "password filtering" programs. Password cracking programs take the encrypted password field andreasons to attempt toperform the same task an intruder would try -- guessing the password from dictionaries and common words. This is a lengthy process for anyisolate services onto dedicated machines. There are also performance reasons in most cases, butthe simplest of schemes since the guesses havea detailed discussion is beyond tobe encrypted once for each compare against the encrypted password entry. However, the intruderscope of this document. The services which a site may provide will, in most cases, have different levels of accessto better dictionariesneeds andmore powerful systems than the security administrator. Nonetheless, password cracking programs provide a mechanism for assuring that no user has a password that is contained in your own dictionaries. Examplesmodels oftools that provide password crackingtrust. Services which are"crack" and "cops". Crack is a tool that is completely devotedessential topassword crackingthe security andprovidessmooth operation of amechanism for variationssite would be better off being placed ondictionary words, and also provides the ability to distribute the processing overanetwork of computers to take advantage of parallelism. Cops isdedicated machine with very limited access restrictions (see Section 3.1.3 "deny all" model), rather than amore general security tool thatmachine which provides a service (or services) which haspassword cracking as one of its functions.traditionally been less secure, or requires greater accessability by users who may accidentally suborn security. It isnot as complete as crack, but serves as a first levelalso important to distinguish between machines which operate within different models ofprotection for a system. Password filtering programs are replacements for the system utility that is invoked to change a user's password. In this scheme,trust, say all theuser submitsmachines inside of aplain-text password toSite Security Handbook Working Group [Page 7] Internet Draft Site Security Handbook November 1995 firewall, and any machines on an exposed network. Some of thetool,services whichchecksshould be examined for potential separation are outlined below. More specific information will be presented in section 3.3 (????). It is important to try to understand that vulnerability is only as strong as thepassword against some criteria before changingweakest link in theencrypted password onchain. Several of thesystem. This schememost publicized penetrations in recent years has been through theadvantage that more tests can be run againstelectronic mail systems of machines. The intruders were not trying to steal electronic mail, but they used theplain-text password (such as a check for recurring patterns or reverse words)vulnerability in thatwould be very expensivesystem tocheck for in a password cracking program.gain access to other systems. Ifsuchpossible, each service should be running on atooldifferent machine whose only duty isinstalled, it should completely replace the password changingto provide a specific service. This help isolate intruders andsetting mechanism so that no user can bypasslimit potential harm. 3.1.2.1 Name Servers (DNS and NIS(+)) The Internet uses theprogramDomain Name System (DNS) to perform name to address resolution. The Network Information Service (NIS) andchangeNIS+ are not used on thepasswordglobal Internet but are subject to the same risks internally as aless robust string. An example of a tool that provides password filteringDNS server. Name-to-address resolution is"npasswd". 3.2 Authorization Authorization referscritical to theprocesssecure operation ofgranting privilegesany network. An attacker who can successfully control or impersonate a DNS server can route traffic toprocesses and ultimately users. This differs from authentication in that authentication is what occursquickly subvert numerous security guards. Routine traffic can be diverted toidentifyauser. Once identified (reliably), the privileges, rights, property,compromised system andpermissible actionstraffic can be monitored or users can trivially be tricked into offering authentication secrets. An organization would do well to arrange well known and protected sites to act as secondary name servers, and also protect their own DNS masters from denial ofthe user are determined by authorization. <Should "objects"service attacks using filtering routers. 3.1.2.2 Password/Key Servers (NIS(+) and"entities" be defined here?> Explicity listing the authorized activitiesKDC) Machines used for password or key servers protect some ofeach user (and user process) with respectthe most valuable information toall resources (objects) is impossible in a reasonable system. Inareal system certain techniques arepotential intruder, the encrypted passwords and/or keys, which can be used tosimplifyaccess theprocesssite. Since many current schemes ofgranting and checking authorization(s). One approach, popularized in UNIX systems, is to assignencryption are either subject toeach object three classes of user - the super user, the owner, and the group. Super user,dictionary orroot, is an entity that has accessbrute force attacks, every effort must be made toall portions (and objects) of the computer.secure these machines. 3.1.2.3 Authentication/Proxy Servers (SOCKS, FWTK) Theowneruse ofan object is the "user" who either created the object or was given it by the super user.proxy servers as a security technique will only continue to expand. Agroup isproxy server provides acollectionnumber ofusers that share privileges oversecurity enhancements. It allows sites to concentrate services through acollection of objects. Groups ease authorization management by simplifying the process of changing the authorization of users and by changing the authorityspecific machine for monitoring, hiding ofa group to manage an object. Another approach is to attach tointernal structure, etc. This funnelling effect creates anobjectattractive target for alist which explicitly contains the identitypotential intruder. 3.1.2.4 Email Electronic mail systems have traditionally been one source for intruder break-ins. Most email servers typically accept input from any source. Most email systems consist of two parts, a receiving/sending agent and a processing agent. Since email is Site Security Handbook Working Group [Page 8] Internet Draft Site Security Handbook November 1995 delivered to allpermittedusers(or groups). Thisand isan Access Control List. The advantageusually private, the processing agent typically requires system privileges to deliver the mail. Most email implementations perform both portions ofthese are that they are easily maintained (one central list per object). <NOTE: What other methods (short phrasesthe service, which means the receiving agent has a direct link to system privileges. There areall that's needed, I probably know what they refer to) should be mentioned... > <NOTE: Shouldseveral packages available which allow a separation of theprocesstwo pieces. 3.1.2.5 World Wide Web (WWW) The WWW is growing in popularity exponentially because ofdeciding what authorizations areits ease of use and the powerful abilities tobe granted be described here (orconcentrate information services. Most WWW servers take some directions and actions from the persons accessing their services. The most common example isthat in policies)? Should mechanisms be described here? > 3.3 Access 3.4 Modems 3.4.1 Modem lines musttaking a request from a remote user and passing the provided information to a program running on the server to process the request. Those programs can bemanaged. Althoughsubverted easily if theyprovide convenient accessare not written with security in mind. 3.1.2.6 FTP FTP allows users to receive and send electronic files in asite for its users, they can also provide an effective detour around the site's firewalls. For this reason it is essentialpoint tomaintain proper control of modems. Don'tpoint manner. Improperly configured ftp servers can allowusersintruders toinstallcopy or replace files at will from throughout amodem line without proper authorization. This includes temporary installations, e.g. plugging a modem into a facsimilesystem. Access to encrypted passwords, proprietary data, ortelephone line overnight. Maintainthe introduction of trojan horses are just aregisterfew of the possibilities. 3.1.3 Deny all/ Allow allyour modem lines. Conduct regularThere are two diametrically oppossed underlying philosophies which can be adopted in defining a security plan. Both alternatives are legitimate models to adopt, depending on the sitechecksand its needs forunauthorized modems; keep your register upsecurity. The first option is todate. 3.4.2 Dial-in users must be authenticated. A usernameturn off all services andpassword check should be completed before a user can access anythingthen selectively enable services onyour network. Normal password security considerations (sucha case by case basis, be it at the machine or network level, aschoosing passwords which don't appear in dictionaries and changing them from time to time)they areparticularly important. Remember that telephone lines canneeded. This model, which will here after betapped, and that it is quite easy to intercept messagesreferred tocellular 'phones. Modern high-speed modems use more sophisticated modulation techniques -, which makes them somewhatas the "deny all" model, is generally moredifficult to monitor - but itsecure. More work isprudent to assume that hackers know howrequired toeavesdrop on your lines. For this reason you should use one-shot passwords (e.g. skey) or hardware authentication devices (e.g. SecureID) if this is at all possible. It is helpfulsuccessfully implement a "deny all" configuration and usually a better understanding of services; which may require several pieces operating together tohavefunction correctly. Only allowing known services allows asingle dial-in point, e.g.better analysis of asingle large modem pool, so that all users are authenticated inparticular service/protocol, and thesame way. Users will occasionally mis-type a password. Setdesign of ashort delay - say two seconds - aftersecurity mechanism suited to thefirst and second failed logins, and force a disconnect aftersecurity level of thethird. Thissite. The other model, which willslow down automated password attacks. Don't tell the user whether the username, the password or both were incorrect. 3.4.3 All logins (successful and unsuccessful) shouldhere after belogged. Don't keep correct passwords inreferred to as thelog, but consider keeping incorrect passwords"allow all" model, is much easier toaid in detecting password attacks. However, bearimplement, but is inmind that most incorrect passwords are correct passwords with one character mistyped, and may suggestgeneral less secure than thereal password. If you can't keep this information secure, don't log it"deny all" model. Simply turn on all services, usually the default atall. If Calling Line Identification is available, take advantage of it by recordingthecalling number for each login attempt. Be sensitivehost level, and allow all protocols to travel across network boundaries, usually theprivacy issues raised by Calling Lne Identification. Alsodefault at the router level. As security holes become apparent, they are patched at either the host or network level. Each of these models can beaware that Calling Line Identification is notapplied to different portions of the site, depending on functionality requirements, administrative Site Security Handbook Working Group [Page 9] Internet Draft Site Security Handbook November 1995 control, site policy etc. For example, the policy may betrusted;to use thedata for informational purposes only, not"allow all" model when setting up workstations forauthentication. <NOTE: it was suggested that we add " Caller ID data can be read from a compatible modem viageneral use, but adopt aserial line" > 3.4.5 Minimize the amount of"deny all" model when setting up informationgiven in your opening banner. In particular, don't announce the type of host hardware or operating system - this encourages specialist hackers. Display a short banner, but don't offerservers, like an'inviting' name (e.g. University of XYZ, Student Records System). Instead, give your site name, a short warning that all sessions are monitored, and a username/password prompt. Get your site's lawyers to check your banner to make sure it states your legal position correctly. For high-security applications, consider using a 'blind' password, i.e. give no response toemail hub. Likewise, anincoming call until"allow all" policy may be adopted for traffic between LAN's internal to theuser has typed in (without any echoing) a password. This effectively simulatessite, but adead modem. 3.4.6 Call-back Capability Some dial-in servers offer call-back facilities, i.e."deny all" policy can be adopted between theuser dials insite andis authenticated, thenthesystem disconnectsInternet. Be careful when mixing philosophies as in thecallexamples above. Many sites adopt the M & M theory of a hard "crunchy" shell andcalls back onaspecified number. You will probably havesoft "squishy" middle. They are willing to pay thechargescost of security forsuch calls.their external traffic and require strong security measures, but are unwilling or unable to provide similar protections internally. Thisfeature should be used with caution; itworks fine as long as the outer defenses are never breached and the internal users caneasilybebypassed. As a minimum, make sure thattrusted. Once thereturn callouter shell (firewall) isnever made frombreached, subverting thesame modem as the incoming one. Overall, although call-back can improve modem security, you should not dependinternal network is trivial. 3.1.4 identify real needs for services There is a large variety of services which may be provided, both internally and onit alone. 3.4.7 Dial-out authentication Dial-out users should also be authenticated, particularly since your site will havethe Internet at large. Managing security is in many ways managing access topay their telephone charges. Never allow dial-out from an unauthenticated dial-in call,services internal to the site andconsider whether you will allow it from an authenticated one. The goal here ismanaging how internal users access information at remote sites. Services tend toprevent callers using your modem poolrush like waves over the Internet. Over the years many sites have established anonymous ftp servers, gopher servers, wais servers, www servers, etc. aspart ofthey became popular but not particularly needed at all sites. Evaluate all new services that are established with achain of logins. Thisskeptical attitude to determine if they are actually needed or just the current fad sweeping the Internet. Bear in mind that security complexity can grow exponentially with the number of services provided. Filtering routers need to behardmodified todetect, particularly if a hacker sets up a path through several hostssupport the new protocols. Some protocols are inherently difficult to filter safely (ex. rpc and udp services), thus providing more openings to the internal network. Services provided onyour site. As a minimum, don't allowthe samemodems and phone lines to be used for both dial-in and dial-out. Thismachine canbe implemented easily if you run separate dial-in and dial-out modem pools. 3.4.8 Make your modem programming as 'bullet-proof' as possible. Be sure modems can't be reprogrammed while they'reinteract inservice. As a minimum, make sure that three plus signs won't put your dial-in modems into command mode! Program your modemscatastrophic ways. (ex. allowing anonymous ftp on the same machine as the www server may allow an intruder toresetplace a file in the anonymous ftp area and cause the http server toyour standard configuration atexecute it.) 3.2 Network and Service Configuration 3.2.1 Protecting thestart of each new call. Failing this, make them reset at the end of each call. This precaution willInfrastructure Many network administrators go to great lengths to protectyou against accidental reprogramming of your modems. Check that your modems terminate calls cleanly. When a user logs out from an access server, verify thattheserver hangs uphosts on their networks. Few administrators make any effort to protect the'phone line properly. Itnetworks themselves. There isequally important that the server forces logouts from whatever sessions were active ifsome rationale to this. For example, it is far easier to protect a host than a network. Also, intruders are likely to be after data on theuser hangs up unexpectedly. 3.5 Cryptography 3.6 Auditing This section covershosts; damaging theprocedures for collecting data generated bynetworkactivity that may be useful in analyzingwould not serve their purposes. That said, there are still reasons to protect thesecurity of anetworks. For example, an intruder might divert networkand/or usefultraffic through an outside host inrespondingorder toaexamine the data (i.e., to search for passwords). Also, infrastructure includes more than the networks and the routers which interconnect them. Infrastructure Site Security Handbook Working Group [Page 10] Internet Draft Site Security Handbook November 1995 also includes network management (e.g., SNMP), services (e.g., DNS, NFS, NTP, WWW), and securityincident. This section(i.e., user authentication and access restrictions). The infrastructure alsocovers the handling, preservation,needs protection against human error. When an administrator misconfigures a host, that host may offer degraded service. This only affects users who require that host, andutilization ofunless that host is a primary server, thedata. (Thisnumber of affected users will therefore bereworked as I developlimited. However, if a router is misconfigured, all users who require theremainder.) 3.6.1 What to collect Audit data should include any attempt to achievenetwork will be affected. Obviously, this is adifferent security levelfar larger number of users than those depending on anyperson, process, or other entity inone host. 3.2.2 Protecting thenetwork.Network There are several problems to which networks are vulnerable. Themost obvious example in this areaclassic is alog"denial ofattempts to loginservice" attack. In this case, the network is brought to ahost computer. Audit data should also include data pertaining to any "public" or anonymous access and retrieval of data, at least to the granularity ofstate in which it can no longer carry legitimate users' data. There are two common ways this can be done: by attacking the"remote" host. And onrouters andon... <NOTE: I'd appreciate suggested logs items (and what level of detail is intended). > 3.6.2 Collection Process The collection process should be enactedby flooding thehost or resource being accessed. Dependingnetwork with extraneous traffic. An attack on theimportance of the data and the needrouter is designed tohavecause itlocal in instances in which services are being denied, data could be kept localtothe resource until neededstop forwarding packets, orbe transmittedtostorage after each event. Reporting data canforward them improperly. The former case may bedone by writing to a file, writingdue to aline printer, writing overmisconfiguration, the injection of anetwork,spurious routing update, orwriting overanon-network port, such as"flood attack" (i.e., the router is bombarded with unroutable packets, causing its performance to degrade). A flood attack on aconsole port. Each of these has importance. File system loggingnetwork is similar to a flood attack on a router, except that theleast resource intensiveflood packets are usually broadcast. An ideal flood attack would be the injection ofall four candidates (foragiven audit log). It is also the least reliable. If a resource has been compromised, the file system issingle packet which exploits some known flaw in thefirstnetwork nodes and causes them togo. Ifretransmit thenetwork in frontpacket, or generate error packets, each ofthe resource has become unusable, the data is inaccessible, unless a direct console port is available. Line printer loggingwhich isuseful in system where permanentpicked up andimmediate logs are required.repeated by another host. Areal time system iswell chosen attack packet can even generate anexample of this, where the exact pointexponential explosion ofa failuretransmissions. Another classic problem is "spoofing." In this case, spurious routing updates are sent to one or more routers causing them to misroute packets. This differs from a denial of service attackmust be recorded. A laser printer, or other device which buffers data betweenonly in theauditing system and storage device may suffer from lost data if buffers containpurpose behind theneeded data at a critical instant. Reporting overspurious route. In denial of service, the object is to make the router unusable; a state which will be quickly detected by networkprovides for allowingusers. In spoofing, the spurious route will cause packets to be routed to aremotehostto storefrom which an intruder may monitor the data ina more permanent and possibly more reliable manner.the packets. These packets are then be re-routed to their correct destinations. However,this consumes bandwidth (attheminimum), exposesintruder may or may not have altered theaudit data in an easy package to a interloper, and could be lost during network denial. Reporting over a console port ensurescontents of thedeliverypackets. The solution to most of these problems is to protect thedata followsrouting update packets sent by thehardware design. The limitationsrouting protocols in use (e.g., RIP-2, OSPF). There arethat the console port requires physical securitythree levels of protection: clear-text password, cryptographic checksum, andusing console ports on machines more than a short distance away (e.g., across a campus) may require a phone line in additionencryption. Passwords offer only minimal protection against intruders who do not have direct access to thenetwork connection. Inphysical networks. Passwords also offer someinstances, thisprotection against misconfigured routers (i.e, routers which, out of the box, attempt to route packets). The advantage of passwords isone more resourcethatmay be constrained. 3.6.3 Collection Load Collecting running data may result inthey have aquick accumulation of bytes. Storage of this must be consideredvery low overhead, inadvance. There are a few ways to limitboth bandwidth and CPU consumption. Checksums protect against therequired storage space. Data may be compressed using oneinjection ofmany methods. Another approach isspurious packets, even if the intruder has direct access toonly archive summaries of activity (possibly losing some detail intheprocess). Data may be archived for justphysical network. Combined with afixed period of time, thenSite Security Handbook Working Group [Page 11] Internet Draft Site Security Handbook November 1995 sequence number, or other unique identifier, a checksum can also protect again "replay" attacks, wherein an old (but valid at the time) routing update isit permanently removed.retransmitted by either an intruder or a misbehaving router. Theissue of archivingmost securitydata differs from archiving network management and application data. Network management data (statistics) can be reducedis provided byalteringcomplete encryption of sequenced, or uniquely identified, routing updates. This prevents an intruder from determining thereporting period. Security data does not have that option (fortopology of themost part). Security data also does not havenetwork. The disadvantage to encryption is thepermanence of application data. 3.6.4 Handlingoverhead involved in processing theData/Preservation Security data should be protected as least as much as any other data is protected because from it, much can be inferred. Security data may give away enough secrets to allow a "masquerader"updates. RIP-2 (RFC 1723) and OSPF (RFC 1583) both support clear-text passwords in their base design specifications. In addition, there are extensions toimpersonate an authorized administrator. Data may also become keyeach base protocol tothe investigation, apprehension,support MD5 encryption. Unfortunately, there is no adequate protection against a flooding attack, orprosecutiona misbehaving host or router which is flooding the network. Fortunately, this type of attack is obvious when it occurs and can be terminated relatively simply. 3.2.3 Protecting theperpetratorServices There are many types ofan incident. Becauseservices; each has its own security requirements. These requirements will vary based on the intended use ofthis,thedata needs to be protected and clearly documented.service. Forthis reason itexample, a service which should only be usable within a site (e.g., NFS) requires little protection. That is, protecting the server from external access isadvisablesufficient toseekprotect theadvice of local legal council or law enforcement when deciding howservice. However, a WWW server, which provides a home page intended for viewing by users anywhere on the Internet, requires built-in protection. That is, the service/protocol/server must provide whatever securitydata ismay be required to prevent unauthorized access and modification of the Web database. Internal services (i.e., services meant to betreated. This should happen before an incident occurs. Ifused only by users within adata handling plan is not cleared priorsite) and external services (i.e., services deliberately available toan incident, this does not mean that it is useless. It means two things. You may not have recourseusers outside a site) will, in general, have protection requirements which differ as previously described. It is therefore wise to isolate theaftermathinternal services to one set ofan event. You may also me liable for penalties resulting from your treatment of the data too. 3.6.5 Audit Data Precautions In certain instances, audit data may contain personal information. Searching through the data, even for just a routine check of the network's security could present an invasion of privacy or makeserver machines and theauditing entity privyexternal services toinformation itanother set of server machines. That is, internal and external servers should not beallowedco-located. In fact, many sites go so far as tohave. Note that this is not automatically true - not all data is "sensitive" and "sensitive" differs by locale. Another danger presented by auditing data is that it may reveal a patternhave one set ofincidents. If an organization knows aboutsubnets (or even different networks) which are accessible from theincidents but permits them to continueoutside andthis results in damage toanotherorganization ("downstream"), legal action could result. An organizationset which mayalsobeliable for not (making a "best effort" to) analyze this data for incidents. 3.7 Backups 4. ARCHITECTURE 4.1 Objectives 4.2 Service Configurations 4.3 Network Configurations Topology Infrastructure elements (DNS, mail hub, information servers, etc.) Network management 4.4 Firewalls 5. INCIDENT HANDLING This section ofaccessed only within thedocument will supply guidance to be applied before, during and after a computer security incidentsite. Of course, there isin progress on a machine, network, site, or multi-site environment. The operative philosophy in the event ofusually abreach of computer security is to react accordingfirewall which connects these partitions. Great care must be taken to ensure that such aplan. This is true whether the breachfirewall isthe resultoperating properly. One form ofanexternalintruder attack, unintentional damage, a student testingservice deserves somenew program to exploit a software vulnerability,special consideration, and that is anonymous, ora disgruntled employee. Each of the possible types of events described above shouldguest, access. This may beconsidered for an adequate contingency plan. Without a proactive approacheither anonymous FTP or guest (unauthenticated) login. It is extremely important toprotect the assets in case of an incident the handling process can not be as efficient as with well prepared procedures, methodsensure that anonymous FTP servers andpolicies in place. Traditional computer security, while quite important in the overall site security plan, usually falls heavily on protecting systemsguest login userids are carefully isolated fromattack,any hosts andperhaps monitoringfile systems from which outside users should be kept. Another area todetect attacks. Littlewhich special attentionis usuallymust be paid concerns anonymous, writable access. A site may be legally responsible forhow to actually handletheattack when it occurs. The resultcontent of publicly available information, so careful monitoring of the information deposited by anonymous users isthat when an attackadvised. Site Security Handbook Working Group [Page 12] Internet Draft Site Security Handbook November 1995 3.2.4 Protecting the Protection It is amazing how often a site will overlook the most obvious weakness inprogress, many decisions are made in hasteits security by leaving the security server itself open to attack. Based on considerations previously discussed, it should be clear that: the security server should not be accessible from off-site; should offer minimum access, except for the authentication function, to users on-site; andcanshould not bedamagingco-located with any other servers. Further, all access totracking downthesource of the incident, collecting evidencenode, including access to the service itself, should beusedlogged to provide a "paper trail" inprosecution efforts, preparing fortherecoveryevent of a security breach. 3.3 Firewalls 4. SECURITY PROCEDURES 4.1 Authentication For many years, thesystem, and protectingprescribed method for authenticating users has been through thevaluable data contained onuse of standard, reusable passwords. Originally, these passwords were used by users at terminals to authenticate themselves to a central computer. At thesystem. Onetime, there were no networks (internally or externally) and hence the risk of disclosure of themost important but often overlooked benefit for efficient incident handling is an economic one. Having both technicalcleartext password was minimal. Today, systems are connected together through local networks, andmanagerial personnel respond to an incident requires considerable resources, resources which could be utilized more profitably if an incident did not require their services. If these personnelthose local networks aretrainedfurther connected together, and tohandle an incident efficiently, less ofthe Internet. Users are logging in from all over the globe, and theirtime is required to deal with that incident. Duereusable passwords are often transmitted across those same networks in cleartext, ripe for anyone inbetween to capture. And indeed, theworldwide network mostCERT Coordination Center and other response teams are seeing a tremendous number oftheincidents involving packet sniffers which arenot restricted to one site only. Operating systems vulnerabilities apply (in some cases) to several millionscapturing the cleartext passwords. To address this threat, we are including sections on better technologies like one-time passwords, and Kerberos. With the advent ofsystemsnewer technologies like one-time passwords (e.g., S/Key), PGP, andmany vulnerabilitiestoken-based authentication devices, people are using password- like strings as secret tokens and pins. We are including a discussion on these since they areexploited withinthenetwork itself. Thereforefoundation upon which stronger authentication techniques are based. If these secret tokens and pins are not properly selected and protected, the authentication will be easily subverted. 4.1.1 One-Time passwords As mentioned above, given today's networked environments, it isvital for all sitesrecommended thatall involved parties are informed as soon as possible. Another benefit is related to public relations. Newssites concerned aboutcomputerthe security and integrity of their systems and networks consider moving away from standard, reusable passwords. There have been many incidentstends to be damaging to an organization's stature among current or potential clients. Efficient incident handling minimizesinvolving Trojan network programs (e.g., telnet and rlogin) and network packet sniffing programs. These programs capture cleartext hostname, account name, password triplets. Intruders can use thepotentialcaptured information fornegative exposure. A final benefit of efficient incident handling is relatedsubsequent access tolegal issues. Itthose hosts and accounts. This is possiblethat in the near future organizations may be suedbecauseone of their nodes was used to launch a network attack. In a similar vein, people who develop patches or workarounds may be sued if the patches or workarounds are ineffective, resulting in damage to systems, or if1) thepatches or workarounds themselves damage systems. Knowing about operating system vulnerabilitiespassword is used over andpatterns of attacksover (hence the term "reusable"), andthen taking appropriate measures is critical to circumventing possible legal problems. This chapter is arranged such2) the password passes across the network in Site Security Handbook Working Group [Page 13] Internet Draft Site Security Handbook November 1995 clear text. Several authentication techniques have been developed that address this problem. Among these techniques are challenge-response technologies that provide passwords that are only used once (commonly called one-time passwords). This document provides a list ofrelevant topics may be generated from the Table of Contents tosources for products that provide this capability. The decision to use astarting point for creating a policy for handling ongoing incidents. The main points to be included in a policy for handling incidents are: o Preparing and planning (what are the goals and objectives in handling an incident). o Notification (who should be contacted in case of an incident). o Evaluation (how seriousproduct is theincident). o Handling (what should be done when an incident occurs). This especially includes: - Notification (whoresponsibility of each organization, and each organization shouldbe notifiedperform its own evaluation and selection. 4.1.2 Kerberos <insert info aboutthe incident). - Containment (how can the damage be limited). - Eradication (eliminate the reasons for the incident). - Recovery (how to reestablish servicethis technology here> 4.1.3 Choosing andsystems). - Follow Up (what actions shouldProtecting Secret Tokens and Pins <to betaken afterfilled in> 4.1.4 Password Assurance While theincident). - Legal/Investigative implications (what areneed to eliminate thelegal and prosecutorial implicationsuse ofthe incident). - Documentation Logs (what records shouldstandard, reusable passwords cannot bekept from before, during, and afteroverstated, it is recognized that some organizations may have to transition to theincident). o Aftermath (overall implicationsuse ofpast incidents). o Responsibilities (for planningbetter technology. Given that situation, we have included the following advice to help with the selection andhandling an incident). Eachmaintenance of traditional passwords. But remember, none of thesepoints is important in an overall plan for handling incidents.measures provides protection against disclosure due to sniffer programs. (1) Theremainderimportance ofthis chapter will detail the issues involved in eachrobust passwords - In many (if not most) cases of system penetration, therelevant topics, and provide some guidance asintruder needs towhat should be included in a site policy for handling incidents. Guidelines for End User involvement in dealing with compromised accounts and vulnerabilities are covered in the corresponding "End User Security Handbook" [RFC xxx]. Especially interesting for Site Administrators which act as Site Security Contact in assisting other users and administrators in dealing with incidents aregain access to an account on the"Guidelines and Recommendations for Incident Processing" [RFC xxx]. 5.1 Preparingsystem, andPlanning for Incident Handling Part of handling an incidentone way that goal isbeing prepared to respond beforetypically accomplished is through guessing theincident occurs.password of a legitimate user. Thisincludes establishingis often accomplished by running an automated password cracking program, which utilizes asuitable level of protections as explained invery large dictionary, against thechapters before. Notsystem's password file. The onlyare incidents avoided throughway to guard against passwords being disclosed in thisprotection but if the incident becomes severe, the damage which can occurmanner islimited. Protection includes preparing incident handling guidelines as part of a contingency plan for your organization or site. Having written plans eliminates much ofthrough theambiguitycareful selection of passwords whichoccurs during an incident,cannot be easily guessed (i.e., combinations of numbers, letters, andwill lead to a more appropriatepunctuation characters). (2) Changing default passwords - Many operating systems andthorough set of responses. As explained forapplication programs are installed with default accounts and passwords. These must be changed immediately to something that cannot be guessed or cracked. (3) Restricting access to the password file - In particular, a sitespecific contingency plan in section xxx it is vital importantwants totestprotect theproposed plan before an incident actually occurs through 'dry runs'. Onceencrypted password portion of the file so that would-be intruders don't have them available for cracking. One effective technique is to use shadow passwords where the password field of the standard file contains asite has recovered from and incident, site policydummy or false password. The file containing the legitimate passwords are protected elsewhere on the system. (4) Password aging - When andprocedures should be reviewed to encompass changeshow toprevent similar incidents. If an incidentexpire passwords isbased on poor policy, and unlessstill a subject of controversy among thepolicy is changed, then onesecurity community. It isdoomed to repeat the past. Even withoutgenerally accepted that Site Security Handbook Working Group [Page 14] Internet Draft Site Security Handbook November 1995 a password should not be maintained once anincident,account is no longer in use, but itwould be prudent to review policies and procedures on a regular basis. Reviews are imperative due to today's changing computing environments. To improve this processis hotly debated whether aproblem reporting procedureuser should beimplementedforced todescribe,change a good password that's indetail, the incident and the solutionsactive use. The arguments for changing passwords relate to theincident. Each incident should be reviewed byprevention of thesite security subgroup to allow understandingcontinued use of penetrated accounts. However, theincident with possible suggestionsopposition claims that frequent password changes lead tothe site policy and procedures. Learningusers writing down their passwords in visible areas (such as pasting them torespond efficientlya terminal), or toan incident is important for numerous reasons: o protect the assets whichusers selecting very simple passwords that are easy toprotect by normal security in case of a worst event o protect your resources which couldguess. It should also beutilized more profitably if an incident did not require their services o take carestated that(government) regulations are complied with o preventan intruder will probably useof your systems against other systems (which could incur legal liability) o minimize the potential for negative exposure Asa captured or guessed password sooner rather than later, in which case password aging provides little if anyset of pre-planned procedures, attention must be placed on a set of goals for handling an incident. These goals will be prioritized differently depending on the site. The set of goals will be closely relatedprotection. While there is no definitive answer to this dilemma, a password policy should directly address thegoals for security in general. Therefore, the sameissue and provide guidelinesas in section xxx for security in general might be applied here. A specific set of objectives can be identifiedfordealing with incidents: o Figure out how it happened. o Find outhowto avoid further exploitations. o Avoid escalation and further incidents. o Recover from the incident. o Find out who did it. Due to the nature ofoften a user should change theincident there mightpassword. It is recommended that passwords be changed whenever root is penetrated, there is aconflict between analyzingcritical change in personnel (especially if it is theoriginal source of a problem instead of restoring systems and services.system administrator!), or when an account has been compromised. Inthis case overall goals (like assuringparticular, if theintegrity of (life) critical systems) might be the reason for not analyzing an incident. Of course thisroot password isan important management decision, butcompromised, allinvolved parties must be aware that without a analysispasswords on thesame incident may happen again. It is important to prioritize actions tosystem should betaken duringchanged. In addition, anincident wellannual change inadvance of the time an incident occurs. Sometimes an incident may be so complex that ittheir password isimpossible to do everything at onceusually not difficult for most users, and you should consider requiring it. 4.2 Authorization Authorization refers torespondthe process of granting privileges toit; priorities are essential. Although priorities will varyprocesses and ultimately users. This differs frominstitutionauthentication in that authentication is what occurs toinstitution, the following suggested priorities may serve asidentify astarting point for defining an organization's response: o Priority one -- protect human lifeuser. Once identified (reliably), the privileges, rights, property, andpeople's safety; human life always has precedence over all other considerations. o Priority two -- protect classified and/or sensitive data. Prevent exploitation of classified and/or sensitive systems, networks or sites. Inform effected classified and/or sensitive systems, networks or sites about already occurred penetrations. (Be awarepermissible actions ofregulations by your site orthe user are determined bygovernment) o Priority three -- protect other data, including proprietary, scientific, managerialauthorization. Should "objects" andother data, because loss"entities" be defined here?> Explicity listing the authorized activities ofdataeach user (and user process) with respect to all resources (objects) iscostlyimpossible interms of resources. Prevent exploitations of other systems, networks or sites and inform already effected systems, networks or sites about successful penetrations. o Priority four -- prevent damage to systems (e.g., loss or alteration ofa reasonable system. In a real systemfiles, damage to disk drives, etc.); damagecertain techniques are used tosystems can result in costly down time and recovery. o Priority five -- minimize disruptionsimplify the process ofcomputing resources; it is bettergranting and checking authorization(s). One approach, popularized inmany cases to shut a system down or disconnect from a network thanUNIX systems, is torisk damageassign todataeach object three classes of user - the super user, the owner, and the group. Super user, orsystems. An important implication for defining prioritiesroot, is an entity thatonce human life and national security considerations have been addressed, it is generally more important to save data than system software and hardware. Although it is undesirablehas access tohave any damage or loss during an incident, systems can be replaced;all portions (and objects) of theloss or compromisecomputer. The owner ofdata (especially classified data), however, is usually notanacceptable outcome under any circumstances. Another important concernobject is theeffect on other, beyond the systems and networks where"user" who either created theincident occurs. Additionally to government regulationsobject or was given it by the super user. A group isalways important to inform effected parties at soon as possible. Due toa collection of users that share privileges over a collection of objects. Groups ease authorization management by simplifying thefactprocess oflegal implications this topic should be addressed inchanging theplans to avoid further delaysauthorization of users anduncertainty for the administrators. Any plan for responding to security incidents should be guidedbylocal policies and regulations. Government and private sites that deal with classified material have specific rules that they must follow. The policies your site makes chooses how it reacts to incidents will shape your response. For example, it may make little sensechanging the authority of a group tocreate mechanismsmanage an object. Another approach is tomonitor and trace intruders if your site does not planattach totake action againstan object a list which explicitly contains theintruders if theyidentity of all permitted users (or groups). This is an Access Control List. The advantage of these arecaught. Other organizations may have policiesthataffect your plans. Telephone companies often release information about telephone traces onlythey are easily maintained (one central list per object). <NOTE: What other methods (short phrases are all that's needed, I probably know what they refer to) should be mentioned... > Site Security Handbook Working Group [Page 15] Internet Draft Site Security Handbook November 1995 <NOTE: Should the process of deciding what authorizations are tolaw enforcement agencies. You may also note that if any legal actionbe granted be described here (or isplanned, there are specific guidelinesthat in policies)? Should mechanisms be described here? > 4.3 Access 4.4 Modems 4.4.1 Modem lines must befollowedmanaged Although they provide convenient access tomake sure that any information collecteda site for its users, they canbe used as evidence. 5.2 Notification and Point of Contacts Italso provide an effective detour around the site's firewalls. For this reason it is essential to maintain proper control of modems. Don't allow users to install a modem line without proper authorization. This includes temporary installations, e.g. plugging a modem into a facsimile or telephone line overnight. Maintain a register of all your modem lines. Conduct regular site checks for unauthorized modems; keep your register up to date. 4.4.2 Dial-in users must be authenticated A username and password check should be completed before a user can access anything on your network. Normal password security considerations (such as choosing passwords which don't appear in dictionaries and changing them from time to time) are particularly important. Remember that telephone lines can be tapped, and that it is quite easy to intercept messages to cellular 'phones. Modern high-speed modems use more sophisticated modulation techniques -, which makes them somewhat more difficult to monitor - but it is prudent to assume that hackers know how to eavesdrop on your lines. For this reason you should use one-shot passwords (e.g. skey) or hardware authentication devices (e.g. SecureID) if this is at all possible. It is helpful to have a single dial-in point, e.g. a single large modem pool, so that all users are authenticated in the same way. Users will occasionally mis-type a password. Set a short delay - say two seconds - after the first and second failed logins, and force a disconnect after the third. This will slow down automated password attacks. Don't tell the user whether the username, the password or both were incorrect. 4.4.3 All logins (successful and unsuccessful) should be logged Don't keep correct passwords in the log, but consider keeping incorrect passwords to aid in detecting password attacks. However, bear in mind that most incorrect passwords are correct passwords with one character mistyped, and may suggest the real password. If you can't keep this information secure, don't log it at all. Site Security Handbook Working Group [Page 16] Internet Draft Site Security Handbook November 1995 If Calling Line Identification is available, take advantage of it by recording the calling number for each login attempt. Be sensitive to the privacy issues raised by Calling Lne Identification. Also be aware that Calling Line Identification is not to be trusted; use the data for informational purposes only, not for authentication. <NOTE: it was suggested that we add " Caller ID data can be read from a compatible modem via a serial line" > 4.4.5 Minimize the amount of information given in your opening banner In particular, don't announce the type of host hardware or operating system - this encourages specialist hackers. Display a short banner, but don't offer an 'inviting' name (e.g. University of XYZ, Student Records System). Instead, give your site name, a short warning that all sessions are monitored, and a username/password prompt. Get your site's lawyers to check your banner to make sure it states your legal position correctly. For high-security applications, consider using a 'blind' password, i.e. give no response to an incoming call until the user has typed in (without any echoing) a password. This effectively simulates a dead modem. 4.4.6 Call-back Capability Some dial-in servers offer call-back facilities, i.e. the user dials in and is authenticated, then the system disconnects the call and calls back on a specified number. You will probably have to pay the charges for such calls. This feature should be used with caution; it can easily be bypassed. As a minimum, make sure that the return call is never made from the same modem as the incoming one. Overall, although call-back can improve modem security, you should not depend on it alone. 4.4.7 Dial-out authentication Dial-out users should also be authenticated, particularly since your site will have to pay their telephone charges. Never allow dial-out from an unauthenticated dial-in call, and consider whether you will allow it from an authenticated one. The goal here is to prevent callers using your modem pool as part of a chain of logins. This can be hard to detect, particularly if a hacker sets up a path through several hosts on your site. As a minimum, don't allow the same modems and phone lines to be used for both dial-in and dial-out. This can be implemented easily if you run separate dial-in and dial-out modem pools. 4.4.8 Make your modem programming as 'bullet-proof' as possible Be sure modems can't be reprogrammed while they're in service. As a Site Security Handbook Working Group [Page 17] Internet Draft Site Security Handbook November 1995 minimum, make sure that three plus signs won't put your dial-in modems into command mode! Program your modems to reset to your standard configuration at the start of each new call. Failing this, make them reset at the end of each call. This precaution will protect you against accidental reprogramming of your modems. Check that your modems terminate calls cleanly. When a user logs out from an access server, verify that the server hangs up the 'phone line properly. It is equally important that the server forces logouts from whatever sessions were active if the user hangs up unexpectedly. 4.5 Cryptography 4.6 Auditing This section covers the procedures for collecting data generated by network activity that may be useful in analyzing the security of a network and/or useful in responding to a security incident. This section also covers the handling, preservation, and utilization of the data. (This will be reworked as I develop the remainder.) 4.6.1 What to collect Audit data should include any attempt to achieve a different security level of any person, process, or other entity in the network. The most obvious example in this area is a log of attempts to login to a host computer. Audit data should also include data pertaining to any "public" or anonymous access and retrieval of data, at least to the granularity of the "remote" host. And on and on... <NOTE: I'd appreciate suggested logs items (and what level of detail is intended). > 4.6.2 Collection Process The collection process should be enacted by the host or resource being accessed. Depending on the importance of the data and the need to have it local in instances in which services are being denied, data could be kept local to the resource until needed or be transmitted to storage after each event. Reporting data can be done by writing to a file, writing to a line printer, writing over a network, or writing over a non-network port, such as a console port. Each of these has importance. Site Security Handbook Working Group [Page 18] Internet Draft Site Security Handbook November 1995 File system logging is the least resource intensive of all four candidates (for a given audit log). It is also the least reliable. If a resource has been compromised, the file system is the first to go. If the network in front of the resource has become unusable, the data is inaccessible, unless a direct console port is available. Line printer logging is useful in system where permanent and immediate logs are required. A real time system is an example of this, where the exact point of a failure or attack must be recorded. A laser printer, or other device which buffers data between the auditing system and storage device may suffer from lost data if buffers contain the needed data at a critical instant. Reporting over the network provides for allowing a remote host to store data in a more permanent and possibly more reliable manner. However, this consumes bandwidth (at the minimum), exposes the audit data in an easy package to a interloper, and could be lost during network denial. Reporting over a console port ensures the delivery of the data follows the hardware design. The limitations are that the console port requires physical security and using console ports on machines more than a short distance away (e.g., across a campus) may require a phone line in addition to the network connection. In some instances, this is one more resource that may be constrained. 4.6.3 Collection Load Collecting running data may result in a quick accumulation of bytes. Storage of this must be considered in advance. There are a few ways to limit the required storage space. Data may be compressed using one of many methods. Another approach is to only archive summaries of activity (possibly losing some detail in the process). Data may be archived for just a fixed period of time, then is it permanently removed. The issue of archiving security data differs from archiving network management and application data. Network management data (statistics) can be reduced by altering the reporting period. Security data does not have that option (for the most part). Security data also does not have the permanence of application data. 4.6.4 Handling the Data/Preservation Security data should be protected as least as much as any other data is protected because from it, much can be inferred. Security data may give away enough secrets to allow a "masquerader" to impersonate an authorized administrator. Data may also become key to the investigation, apprehension, or prosecution of the perpetrator of an incident. Because of this, the data needs to be protected and clearly documented. For this reason it is advisable to seek the advice of local legal council or law enforcement when deciding how security data is to be treated. This Site Security Handbook Working Group [Page 19] Internet Draft Site Security Handbook November 1995 should happen before an incident occurs. If a data handling plan is not cleared prior to an incident, this does not mean that it is useless. It means two things. You may not have recourse in the aftermath of an event. You may also me liable for penalties resulting from your treatment of the data too. 4.6.5 Audit Data Precautions In certain instances, audit data may contain personal information. Searching through the data, even for just a routine check of the network's security could present an invasion of privacy or make the auditing entity privy to information it should not be allowed to have. Note that this is not automatically true - not all data is "sensitive" and "sensitive" differs by locale. Another danger presented by auditing data is that it may reveal a pattern of incidents. If an organization knows about the incidents but permits them to continue and this results in damage to another organization ("downstream"), legal action could result. An organization may also be liable for not (making a "best effort" to) analyze this data for incidents. 4.7 Backups 5. SECURITY INCIDENT HANDLING This section of the document will supply guidance to be applied before, during, and after a computer security incident is in progress on a machine, network, site, or multi-site environment. The operative philosophy in the event of a breach of computer security is to react according to a plan. This is true whether the breach is the result of an external intruder attack, unintentional damage, a student testing some new program to exploit a software vulnerability, or a disgruntled employee. Each of the possible types of events described above should be addressed by an adequate contingency plans. Without a proactive approach to protect the assets in case of an incident the handling process can not be as efficient as with well prepared procedures, methods and policies in place. Traditional computer security, while quite important in the overall site security plan, usually falls heavily on protecting systems from attack, and perhaps monitoring systems to detect attacks. Little attention is usually paid for how to actually handle the attack when it occurs. The result is that when an attack is in progress, many decisions are made in haste and can be damaging to tracking down the source of the incident, collecting evidence to be used in prosecution efforts, preparing for the recovery of the system, and protecting the valuable data contained on the system. One of the most important but often overlooked benefit for efficient incident handling is an economic one. Having both technical and managerial personnel respond to an incident requires considerable resources, resources which could be utilized more profitably if an Site Security Handbook Working Group [Page 20] Internet Draft Site Security Handbook November 1995 incident did not require their services. If these personnel are trained to handle an incident efficiently, less of their time is required to deal with that incident. Due to the worldwide network most of the incidents are not restricted to one site only. Operating systems vulnerabilities apply (in some cases) to several millions of systems and many vulnerabilities are exploited within the network itself. Therefore it is vital for all sites that all involved parties are informed as soon as possible. Another benefit is related to public relations. News about computer security incidents tends to be damaging to an organization's stature among current or potential clients. Efficient incident handling minimizes the potential for negative exposure. A final benefit of efficient incident handling is related to legal issues. It is possible that in the near future organizations may be sued because one of their nodes was used to launch a network attack. In a similar vein, people who develop patches or workarounds may be sued if the patches or workarounds are ineffective, resulting in damage to systems, or if the patches or workarounds themselves damage systems. Knowing about operating system vulnerabilities and patterns of attacks and then taking appropriate measures is critical to circumventing possible legal problems. This chapter is arranged such that a list of relevant topics may be drawn from the following outline, to provide a starting point for creating a policy for handling ongoing incidents. The main points to be included in a policy for handling incidents are: (1) Preparing and planning (what are the goals and objectives in handling an incident). (2) Notification (who should be contacted in case of an incident). (3) Evaluation (how serious is the incident). (4) Handling (what should be done when an incident occurs). This especially includes: - Notification (who should be notified about the incident). - Containment (how can the damage be limited). - Eradication (eliminate the reasons for the incident). - Recovery (how to reestablish service and systems). - Follow Up (what actions should be taken after the incident). - Legal/Investigative implications (what are the legal and prosecutorial implications of the incident). - Documentation Logs (what records should be kept from before, during, and after the incident). (5) Aftermath (overall implications of past incidents). (6) Responsibilities (for planning and handling an incident). Each of these points is important in an overall plan for handling incidents. The remainder of this chapter will detail the issues involved in each of the relevant topics, and provide some guidance as to what should be included in a site policy for handling incidents. Guidelines for End User involvement in dealing with compromised Site Security Handbook Working Group [Page 21] Internet Draft Site Security Handbook November 1995 accounts and vulnerabilities are covered in the corresponding "End User Security Handbook" [RFC xxx]. Especially interesting for Site Administrators which act as Site Security Contact in assisting other users and administrators in dealing with incidents are the "Guidelines and Recommendations for Incident Processing" [RFC xxx]. 5.1 Preparing and Planning for Incident Handling Part of handling an incident is being prepared to respond before the incident occurs. This includes establishing a suitable level of protections as explained in the chapters before. Not only are incidents avoided through this protection but if the incident becomes severe, the damage which can occur is limited. Protection includes preparing incident handling guidelines as part of a contingency plan for your organization or site. Having written plans eliminates much of the ambiguity which occurs during an incident, and will lead to a more appropriate and thorough set of responses. As explained for the site specific contingency plan in section xxx it is vitally important to test the proposed plan before an incident actually occurs through 'dry runs'. A team might even consider hiring a tiger team to act in parallel with the dry run. Once a site has recovered from an incident, site policy and procedures should be reviewed to encompass changes to prevent similar incidents. If an incident is based on poor policy, and unless the policy is changed, then one is doomed to repeat the past. Even without an incident, it would be prudent to review policies and procedures on a regular basis. Reviews are imperative due to today's changing computing environments. To improve this process a problem reporting procedure should be implemented to describe, in detail, the incident and the solutions to the incident. Each incident should be reviewed by the site security subgroup to allow understanding of the incident with possible suggestions to the site policy and procedures. Learning to respond efficiently to an incident is important for numerous reasons: (1) protect the assets which are to protect by normal security in case of a worst event (2) protect your resources which could be utilized more profitably if an incident did not require their services (3) take care that (government) regulations are complied with (4) prevent use of your systems against other systems (which could incur legal liability) (5) minimize the potential for negative exposure As in any set of pre-planned procedures, attention must be placed on a set of goals for handling an incident. These goals will be prioritized differently depending on the site. The set of goals will be closely related to the goals for security in general. Therefore, the same guidelines as in section xxx for security in general might be applied here. A specific set of objectives can be identified for dealing with incidents: Site Security Handbook Working Group [Page 22] Internet Draft Site Security Handbook November 1995 (1) Figure out how it happened. (2) Find out how to avoid further exploitations. (3) Avoid escalation and further incidents. (4) Recover from the incident. (5) Find out who did it. Due to the nature of the incident there might be a conflict between analyzing the original source of a problem instead of restoring systems and services. In this case overall goals (like assuring the integrity of (life) critical systems) might be the reason for not analyzing an incident. Of course this is an important management decision, but all involved parties must be aware that without a analysis the same incident may happen again. It is important to prioritize actions to be taken during an incident well in advance of the time an incident occurs. Sometimes an incident may be so complex that it is impossible to do everything at once to respond to it; priorities are essential. Although priorities will vary from institution to institution, the following suggested priorities may serve as a starting point for defining an organization's response: (1) Priority one -- protect human life and people's safety; human life always has precedence over all other considerations. (2) Priority two -- protect classified and/or sensitive data. Prevent exploitation of classified and/or sensitive systems, networks or sites. Inform effected classified and/or sensitive systems, networks or sites about already occurred penetrations. (Be aware of regulations by your site or by government) (3) Priority three -- protect other data, including proprietary, scientific, managerial and other data, because loss of data is costly in terms of resources. Prevent exploitations of other systems, networks or sites and inform already effected systems, networks or sites about successful penetrations. (4) Priority four -- prevent damage to systems (e.g., loss or alteration of system files, damage to disk drives, etc.); damage to systems can result in costly down time and recovery. (5) Priority five -- minimize disruption of computing resources; it is better in many cases to shut a system down or disconnect from a network than to risk damage to data or systems. An important implication for defining priorities is that once human life and national security considerations have been addressed, it is generally more important to save data than system software and hardware. Although it is undesirable to have any damage or loss Site Security Handbook Working Group [Page 23] Internet Draft Site Security Handbook November 1995 during an incident, systems can be replaced; the loss or compromise of data (especially classified data), however, is usually not an acceptable outcome under any circumstances. Another important concern is the effect on others, beyond the systems and networks where the incident occurs. Within the limits imposed by government regulations it is always important to inform effected parties at soon as possible. Due to the legal implications of this topic, it should be included in the planned procedures to avoid further delays and uncertainty for the administrators. Any plan for responding to security incidents should be guided by local policies and regulations. Government and private sites that deal with classified material have specific rules that they must follow. The policies chosen by your site on how it reacts to incidents will shape your response. For example, it may make little sense to create mechanisms to monitor and trace intruders if your site does not plan to take action against the intruders if they are caught. Other organizations may have policies that affect your plans. Telephone companies often release information about telephone traces only to law enforcement agencies. You may also note that if any legal action is planned, there are specific guidelines that must be followed to make sure that any information collected can be used as evidence. 5.2 Notification and Point of Contacts It is important to establish contacts with various personnel before a real incident occurs. These contacts are either local,related toother system administrators elsewhere on thenetworkinternet or are investigative agencies. Working with these contacts appropriately will help to make your incident handling process more efficient.o PointCommunication may need to be established with various "Points ofContact (POC) people (Technical, Administrative, Response Teams, Investigative, Legal, Vendors,Contact." These may be technical or administrative in nature, may include legal or investigative agencies, as well as Serviceproviders),Providers andwhich POCs are visiblevendors. It is important towhom. o Wider community (users). o Other sites that mightdecide how much information will beaffected. o Theshared, especially with the wider community of users at a site, with the public(press). These(the press) and with other sites. Settling these issues are especially important for the local person responsible for handling the incident, since that is the person responsible for the actual notification of others. A list of contacts in each of these categories is an important time saver for this person during an incident. It can be quite difficult to find an appropriate person during an incident when many urgent events are ongoing. Including relevant telephone numbers (also electronic mail addresses and fax numbers) in the site securitypolicy is strongly recommended.policy is strongly recommended. It is especially important to know how to contact individuals who will be directly involved in handling a security incident. Site Security Handbook Working Group [Page 24] Internet Draft Site Security Handbook November 1995 5.2.1 Local Managers and Personnel When an incident is under way, a major issue is deciding who is in charge of coordinating the activity of the multitude of players. A major mistake that can be made is to have a number of "points of contact" (POC) that are not pulling their efforts together. This will only add to the confusion of the event, and will probably lead toadditional confusion andwasted or ineffective effort. The single point of contact may or may not be the person "in charge" of the incident. There are two distinct rolls to fill when deciding who shall be the point of contact and the person in charge of the incident. The person in charge will make decisions as to the interpretation of policy applied to the event. The responsibility for the handling of the event falls onto this person. In contrast, the point of contact must coordinate the effort of all the parties involved with handling the event. The point of contact must be a person with the technical expertise to successfully coordinate the effort of the system managers and users involved in monitoring and reacting to the attack. Often the management structure of a site is such that the administrator of a set of resources is not a technically competent person with regard to handling the details of the operations of the computers, but is ultimately responsible for the use of these resources. Another important function of the POC is to maintain contact with law enforcement and other external agencies to assure thatmulti-agencymulti- agency involvement occurs. (In the U.S. FBI, CIA, DoD, U.S. Army, or others might be concerned.) Finally, if legal action in the form of prosecution is involved, the POC may beableasked to speak for the site in court. The alternative is to have multiple witnesses that will be hard to coordinate in a legal sense, and will weaken any case against the attackers. A single POC may also be the single person in charge ofevidence collected,collecting evidence, which will keep the number of people accounting for evidence to a minimum. As a rule of thumb, the more people that touch a potential piece of evidence, the greater the possibility that it will be inadmissible in court.It is very important to prepare a method of notification, so that you will know who to call and how to contact them. For example, every memberOne of theDepartmentmost critical tasks for the POC is the coordination ofEnergy's CIAC Team carries a card with every other team member's work and home phone numbers, as well as pager numbers. <no text yet> +all relevant processes. As responsibilitiesaremight be distributed over the whole site,this has strong implicationswhich in fact can consist of multiple independent departments or groups, a well coordination effort is crucial forcoordinatingoverall success. The situation get even worse if multiple sites are involved. In many cases, no single POC in one nvolved site can coordinate the handling of an entire incident. The appropriate incident response teams are more suitable, if multiple sites are involved. The incident handling process+ in multi-site incidents this even gets worser + the POSshould provide some escalation mechanisms. The POC might change; the impact of the incident force the management to take the lead instead of giving the technical Site Security Handbook Working Group [Page 25] Internet Draft Site Security Handbook November 1995 administrator the responsibility. Other reasons for changing the POC are the emergence of conflicts of interest, changing priorities or responsibilities. Regardless of why the POC is changed, all involved parties must bechanged duringinformed. Arrangements should be made to allow the new POC to contact the old one, to ensure anincident becauseadequate briefing ofconflicting priorities/responsibilitiesbackground information. 5.2.2 Law Enforcement and Investigative Agencies In the event of an incident it is important to establish contact with investigative agencies such as the FBI and Secret Service or their equivalent in your country, as soon aspossible, for several reasons.possible. Local law enforcement and local security offices or campus policeorganizationsdepartments should also be informed when appropriate. A primary reason is that once a major attack is in progress, there is little time to call various personnel in these agencies to determine exactly who the correct point of contact is. Another reason is that it is important to cooperate with these agencies in a manner that will foster a good working relationship, and that will be in accordance with the working procedures of these agencies. Knowing the working procedures in advance and the expectations of your point of contact is a big step in this direction. For example, it is important to gather evidence that will be admissible in a court of law. If you don't know in advance how to gather admissible evidence, your efforts to collect evidence during an incident are likely to be of no value to the investigative agency with which you deal. A final reason for establishing contacts as soon as possible is that it is impossible to know the particular agency that will assume jurisdiction in any given incident. Making contacts and finding the proper channels early will make responding to an incident go considerably more smoothly. If your organization or site has a legal counsel, you need to notify this office soon after you learn that an incident is in progress. At a minimum, your legal counsel needs to be involved to protect the legal and financial interests of your site or organization. There are many legal and practical issues, a few of which are: (1) Whether your site or organization is willing to risk negative publicity or exposure to cooperate with legal prosecution efforts. (2) Downstream liability--if you leave a compromised system as is so it can be monitored and another computer is damaged because the attack originated from your system, your site or organization may be liable for damages incurred. (3) Distribution of information--if your site or organization distributes information about an attack in which another site or organization may be involved or the vulnerability in a product that may affect ability to market that product, your site or organization may again be liable for any damages (including damage of reputation). (4) Liabilities due to monitoring--your site or organization Site Security Handbook Working Group [Page 26] Internet Draft Site Security Handbook November 1995 may be sued if users at your site or elsewhere discover that your site is monitoring account activity without informing users. Unfortunately, there are no clear precedents yet on the liabilities or responsibilities of organizations involved in a security incident or who might be involved in supporting an investigative effort. Investigators will often encourage organizations to help trace and monitor intruders -- indeed, most investigators cannot pursue computer intrusions without extensive support from the organizations involved. However, investigators cannot provide protection from liability claims, and these kinds of efforts may drag out for months and may take lots of effort. On the other side, an organization's legal council may advise extreme caution and suggest that tracing activities be halted and an intruder shut out of the system. This in itself may not provide protection from liability, and may prevent investigators from identifying anyone. The balance between supporting investigative activity and limiting liability is tricky; you'll need to consider the advice of your council and the damage the intruder is causing (if any) in making your decision about what to do during any particular incident. Your legal counsel should also be involved in any decision to contact investigative agencies when an incident occurs at your site. The decision to coordinate efforts with investigative agencies is most properly that of your site or organization. Involving your legal counsel will also foster the multi-level coordination between your site and the particular investigative agency involved which in turn results in an efficient division of labor. Another result is that you are likely to obtain guidance that will help you avoid future legal mistakes. Finally, your legal counsel should evaluate your site's written procedures for responding to incidents. It is essential to obtain a "clean bill of health" from a legal perspective before you actually carry out these procedures. One of the most important considerations in dealing with investigative agencies is verifying that the person who calls asking for information is a legitimate representative from the agency in question. Unfortunately, many well intentioned people have unknowingly leaked sensitive information about incidents, allowed unauthorized people into their systems, etc., because a caller has masqueraded as a representative of a government agency (e. g. the FBI or Secret Service in the US). A similar consideration is using a secure means of communication. Because many network attackers can easily reroute electronic mail, avoid using electronic mail to communicate with other agencies (as well as others dealing with the incident at hand). Non-secured phone lines (e. g., the phones normally used in the business world) are also frequent targets for tapping by network intruders, so be careful! Site Security Handbook Working Group [Page 27] Internet Draft Site Security Handbook November 1995 There is no established set of rules for responding to an incident when the local Government becomes involved. Normally, except by court order, no agency can force you to monitor, to disconnect from the network, to avoid telephone contact with the suspected attackers,etc..etc. As discussed before, you should consult the matter with your legal counsel, especially before taking an action that your organization has never taken. The particular agency involved may ask you to leave an attacked machine on and to monitor activity on this machine, for example. Your complying with this request will ensure continued cooperation of theagency--usuallyagency. This is usually the best route towards finding the source of the network attacks and, ultimately, terminating these attacks. Additionally, you may needsomeinformation or a favor from the agency involved in the incident. You are likely to get what you need only if you have been cooperative.Of particular importanceIt isavoidingparticularly important to avoid unnecessary or unauthorized disclosure of information about the incident, including any information furnished by the agency involved. The trust between your site and the agency hinges upon your ability to avoid compromising the case the agency will build; keeping "tight lipped" is imperative. Sometimes your needs and the needs of an investigative agency will differ. Your site may want to get back to normal business by closing an attack route, but the investigative agency may want you to keep this route open. Similarly, your site may want to close a compromised system down to avoid the possibility of negative publicity, but again the investigative agency may want you to continue monitoring. When there is such a conflict, there may be a complex set of tradeoffs (e.g., interests of your site's management, amount of resources you can devote to the problem, jurisdictional boundaries, etc.). An important guiding principle is related to what might be called "Internet citizenship" [22, IAB89, 23 (xxx old references)] and its responsibilities. Your site can shut a system down, and this will relieve you of the stress, resource demands, and danger of negative exposure. The attacker, however, is likely to simply move on to another system, temporarily leaving others blind to the attacker's intention and actions until another path of attack can be detected. Providing that there is no damage to your systems and others, the most responsible course of action is to cooperate with the participating agency by leaving your compromised system on. This will allow monitoring (and, ultimately, the possibility of terminating the source of the threat to systems just like yours). On the other hand, if there is damage to computers illegally accessed through your system, the choice ismorecomplicated: shutting down the intruder may prevent further damage to systems, but might make it impossible to track down the intruder. If there has been damage, the decision about whether it is important to leave systems up to catch the intruder should involve all the organizations effected. Further complicating the issue of network responsibility is the consideration that if you do not cooperate withthe agency involved,an agency, you will be less likely to receive help from that agency in the future. 5.2.3 Computer Security Incident Handling Teams There now exists a number of Computer Security Incident Handling Site Security Handbook Working Group [Page 28] Internet Draft Site Security Handbook November 1995 teams (CSIH teams) such as the CERT Coordination Center and the CIAC or other teams around the globe. Teams exist for many major government agencies and large corporations. If such a team is available, notifying it should be of primary importance during the early stages of an incident. These teams are responsible for coordinating computer security incidents over a range of sites and larger entities. Even if the incident is believed to be containedtowithin a single site, it is possible that the information available through a response team could help in closing out the incident. If it is determined that the breach occurred due to a flaw in the systems' hardware or software, the vendor (or supplier) and a Computer Security Incident Handling team should be notified as soon as possible. This is especially important due to the fact that many other systems are vulnerable, too. In setting up a site policy for incident handling, it may be desirable to create a subgroup, much like those teams that already exist, that will be responsible for handling computer security incidents for the site (or organization). If such a team is created, it is essential that communication lines be opened between this team and other teams. Once an incident is under way, it is difficult to open atrusted dialogue between othertrusted dialogue between other teams if none has existed before. (See [RFC xxx] for more information about the considerations for creating your own incident handling team.) 5.2.4 Effected and involved Sites If an incident has an impact on other sites, it is good practice to inform them. It may be obvious from the beginning that the incident is not limited to the local site, or it may emerge only after further analysis. Each site might choose to contact other sites directly or they can pass the information to an appropriate incident response team, to which the involved site belongs. As it is often very difficult to find the responsible POC at remote sites, the involvement of an incident response team will facilitate contact by making use of already established channels. The legal and liability issues arising from a security incident may differ from site to site. It is important to define a policy for the sharing and logging of information about other sites before an incident occurs. This policy should be crafted in consultation with legal counsel. Information about specific people is especially sensitive, and is subject to privacy laws. To avoid problems in this area, irrelevent information should be deleted and a statement of how to handle the remaining information should be included. A clear statement of how this information is to be used is essential. (No one who informs a site of a security incident would like to read in the newspaper that they also had a problem.) Incident response teamsif none has existed before. (See [RFC xxx] for moreare valuable in this respect. As they pass informationaboutto responsible POCs, they are Site Security Handbook Working Group [Page 29] Internet Draft Site Security Handbook November 1995 able to protect theconsiderations for creating your own incident handling team.) 5.2.4 Effectedanonymity of the original source. But be aware that in many cases the analysis of logs andinvolved Sites <no text yet> + special careinformation at other sites will reveal addresses of your site. All the problems discussed should betakennot seen as reasons not toinforminvolve othereffected sites + directly: but who is responsible for the contact, who is responsible for an incident atsites. In fact, theotherexperiences of existing teams reveal that most sites informed about security problems are not even aware that their site+ indirectly: utilizing a incident response team + responsibility/liability + privacy reasonshad been compromised. Without timely information other sites are often unable toprotect specific datatake measures against intruders. 5.2.5 Public Relations - Press Releases One of the most important issues to consider is when, who, and how much to release to the general public through the press. There are many issues to consider when deciding this particular issue. First and foremost, if a public relations office exists for the site, it is important to use this office as liaison to the press. The public relations office is trained in the type and wording of information released, and will help to assure that the image of the site is protected during and after the incident (if possible). A public relations office has the advantage that you can communicate candidly with them, and provide a buffer between the constant press attention and the need of the POC to maintain control over the incident. If a public relations office is not available, the information released to the press must be carefully considered. If the information is sensitive, it may be advantageous to provide only minimal or overview information to the press. It is quite possible that any information provided to the press will be quickly reviewed by the perpetrator of the incident. As a contrast to this consideration, it was discussed above that misleading the press can often backfire and cause more damage than releasing sensitive information. While it is difficult to determine in advance what level of detail to provide to the press, some guidelines to keep in mind are:o(1) Keep the technical level of detail low. Detailed information about the incident may provide enough information for copy-cat events or even damage the site's ability to prosecute once the event is over.o(2) Keep the speculation out of press statements. Speculation of who is causing the incident or the motives are very likely to be in error and may cause an inflamed view of the incident.o(3) Work with law enforcement professionals to assure that evidence is protected. If prosecution is involved, assure that the evidence collected is not divulged to the press.o(4) Try not to be forced into a press interview before you are prepared. The popular press is famous for the "2am" interview, where the hope is to catch the interviewee off Site Security Handbook Working Group [Page 30] Internet Draft Site Security Handbook November 1995 guard and obtain information otherwise not available.o(5) Do not allow the press attention to detract from the handling of the event. Always remember that the successful closure of an incident is of primary importance. 5.3 Identifying an Incident 5.3.1It isIs it real? This stage involves determining, if a problem really exist. Of course many, if not most, signs often associated with virus infections, system intrusions, malicious users, etc., are simply anomalies such as hardware failures or suspicious system/user behavior. To assist in identifying whether there really is an incident, it is usually helpful to obtain and use any detection software which may be available. For example, widely available software packages can greatly assist someone who thinks there may be a virus in a personal computer. Audit information is also extremely useful, especially in determining whether there is a network attack. It is extremely important to obtain a system snapshot as soon as one suspects that something is wrong. Many incidents cause a dynamic chain of events to occur, and an initial system snapshot may do more good in identifying the problem and any source of attack than most other actions which can be taken at this stage. Finally, it is important to start a log book. Recording system events, telephone conversations, time stamps, etc., can lead to a more rapid and systematic identification of the problem, and is the basis for subsequent stages of incident handling. There are certain indications or "symptoms" of an incident which deserve special attention:o(1) System crashes.o(2) New user accounts (e.g., the account RUMPLESTILTSKIN has unexplainedly been created), or high activity on an account that has had virtually no activity for months.o(3) New files (usually with novel or strange file names, such as data.xx or k).o(4) Accounting discrepancies (e.g., in a UNIX system you might notice that the accounting file called /usr/admin/lastlog has shrunk, something that should make you very suspicious that there may be an intruder).o(5) Changes in file lengths or dates (e.g., a user should be suspicious if he/she observes that the .EXE files in an MS DOS computer have unexplainedly grown by over 1800 bytes).o(6) Attempts to write to system (e.g., a system manager notices that a privileged user in a VMS system is attempting to alter RIGHTSLIST.DAT).o(7) Data modification or deletion (e.g., files start to disappear).o(8) Denial of service (e.g., a system manager and all Site Security Handbook Working Group [Page 31] Internet Draft Site Security Handbook November 1995 other users become locked out of a UNIX system, which has been changed to single user mode).o(9) Unexplained, poor system performance (e.g., system response time becomes unusually slow).o(10) Anomalies (e.g., "GOTCHA" is displayed on a display terminal or there are frequent unexplained "beeps").o(11) Suspicious probes (e.g., there are numerous unsuccessful login attempts from another node).o(12) Suspicious browsing (e.g., someone becomes a root user on a UNIX system and accesses file after file in one user's account, then another's). By no meansdoesis this listimplies, that all possible signs are covered here. Nonecomprehensive. We have just listed a number of common indicators. Furthermore, none of these indications is absolute "proof" that an incident is occurring, nor are all of these indications normally observed when an incident occurs. If you observe any of these indications, however, it is important to suspect that an incident might be occurring, and act accordingly. There is no formula for determining with 100 percent accuracy that an incident is occurring. It is best at this point to collaborate with other technical and computer security personnel to make a decision as a group about whether an incident is occurring. 5.3.2 Types and Scope of Incidents Along with the identification of the incident is the evaluation of the scope and impact of the problem. It is important to correctly identify the boundaries of the incident in order to effectively deal with it. In addition, the impact of an incident will determine its priority in allocating resources to deal with the event. Without an indication of the scope and impact of the event, it is difficult to determine a correct response. In order to identify the scope and impact, a set of criteria should be defined which is appropriate to the site and to the type of connections available. Some of the issues are:o(1) Is this a multi-site incident?o(2) Are many computers at your site effected by this incident?o(3) Is sensitive information involved?o(4) What is the entry point of the incident (network, phone line, local terminal, etc.)?o(5) Is the press involved?o(6) What is the potential damage of the incident?o(7) What is the estimated time to close out the incident?o(8) What resources could be required to handle the incident? 5.3.3 Assessing the Damage and Extent The analysis of the damage and extent of the incident can be quite time consuming, but should lead into some of the insight as to the nature of the incident, and aid investigation and prosecution. Site Security Handbook Working Group [Page 32] Internet Draft Site Security Handbook November 1995 As soon as the breach has occurred, the entire system and all its components should be considered suspect. System software is the most probable target. Preparation is key to be able to detect all changes for a possibly tainted system. This includes checksumming all tapes from the vendor using a checksum algorithm which (hopefully) is resistant to tampering [10]. (See sections xxx.) Assuming original vendor distribution tapes are available, an analysis of all system files should commence, and any irregularities should be noted and referred to all parties involved in handling the incident. It can be very difficult, in some cases, to decide which backup tapes are showing a correct system status; consider that the incident may have continued for months or years before discovery, and that the suspect may be an employee of the site, or otherwise have intimate knowledge or access to the systems. In all cases, the pre-incident preparation will determine what recovery is possible. If the system supports centralized logging (most do), go back over the logs and look for abnormalities. If process accounting and connect time accounting is enabled, look for patterns of system usage. To a lesser extent, disk usage may shed light on the incident. Accounting can provide much helpful information in an analysis of an incident and subsequent prosecution. If you can address all aspects of a specific incident strongly depends on the success of this analysis. This also effects theefficienceefficiency of the incident handling process. Review the lessons learned from the analysis and always update the policy and procedures to reflect changes necessitatedby the incident. 5.4 Handling an Incident A major topic still untouched here is how to actually respond to an event. The response to an event will fall into the general categories: o Containment. o Eradication. o Recovery. o Follow-up. Two other topicsby the incident. 5.4 Handling an Incident Certain steps arevital fornecessary to take during theincidenthandlingand relevant forof an incident. In all security related activities, the most important point to be made is: One should have policies in place. Without defined goals, activities undertaken will remain without focus. One of thecategories mentioned above. They are therefore addressed before these categories: o Typesmost fundamental objectives is to restore control ofnotificationthe effected systems and to limit theexchange of information o Protection of evidenceimpact andactivity logs <no text yet> + what aredamage. In thegoals for this process. + what isworst case scenario, shutting down thebest way to handle an incident. + important: do notsystem or disconnecting the system from the network may the only practical solution. As the activities involved are complex, try tohandle the incident alone +get as much help asnecessary <no text yet> + part of original RFC1244 which isnecessary. While trying tolesssolve the problem alone, real damage might occur due toservedelays or missing information. Most system administrators take the discovery of an intruder asa guideline but neverthelesspersonal challenge. By proceeding this way, other objectives as outlined in thetopics shouldlocal policies may not always considered. Trying to catch intruders may becovered here, too <NOTE: thisa very low priority, compared to system integrity, for example. One other pitfall isfrom rfc1244; don't know if it willthe premise that by monitoring the activities of a hacker, other sites can beincluded 5.4.1 What Will You Do? o Restore control. o Relationwarned about problems they have been exposed to. By allowing the intruder topolicy. o Which level of service is needed? o Monitor activity. o Constrain(ab)use the local system, a site may be incurring legal liability orshut down system. END OF rfc1244 section>indirect responsibility for the damage caused to other sites. For all the reasons outlined above, it is necessary to establish procedures for the incident handling to allow the technical Site Security Handbook Working Group [Page 33] Internet Draft Site Security Handbook November 1995 administrator to do the "right" thing, as identified by management and legal counsel. 5.4.1 Types of notification, Exchange of information When you have confirmed that an incident is occurring, the appropriate personnel must be notified. How this notification is achieved is very important in keeping the event under control both from a technical and emotional standpoint.To aid prompt acknowledgment and understanding of the problem, theThe circumstances should be described in as much detail aspossible.possible, in order to aid prompt acknowledgment and understanding of the problem. Great care should be taken to which groups detailed technical information is given during the notification. For example it is helpful to pass this kind of information to an incident handling team. They can assist you by providing helpful hints for eradicating the vulnerabilities involved in an incident. On the other hand putting the critical knowledge into the public domain (e. g. netnews, mailing lists) may potentially put a great number of systems at risk of intrusion. It is a wrong assumption, that all administrators are reading a particular news group, have access to operating system source code or can even understand the techniques well enough to take adequate steps. First of all, any notification to either local or off-site personnel must be explicit. This requires that any statement (be it an electronic mail message, phone call, or fax) provides information about the incident that is clear, concise, and fully qualified. When you are notifying others that will help you to handle an event, a "smoke screen" will only divide the effort and create confusion. If a division of labor is suggested, it is helpful to provide information to each section about what is being accomplished in other efforts. This will not only reduce duplication of effort, but allow people working on parts of the problem to know where to obtain other information that would help them resolve a part of the incident. Another important consideration when communicating about the incident is to be factual. Attempting to hide aspects of the incident by providing false or incomplete information may not only prevent a successful resolution to the incident, but may even worsen the situation. This is especially true when the press is involved. When an incident severe enough to gain press attention is ongoing, it is likely that any false information you provide will not be substantiated by other sources. This will reflect badly on the site and may create enough ill-will between the site and the press to damage the site's public relations. The choice of language used when notifying people about the incident can have a profound effect on the way that information is received. When you use emotional or inflammatory terms, you raise the expectations of damage and negative outcomes of the incident. It is important to remain calm both in written and spoken notifications. Another aspect of the choice of language used is that not all people speak the same language. Due to this fact misunderstandings and delay may arise, especially if it is a multi-national incident.<no text yet> + more aboutSite Security Handbook Working Group [Page 34] Internet Draft Site Security Handbook November 1995 Other internationalaspectsconcerns include differing legal implications of a security incident and cultural differences. Cultural differences do not only exist between countries like Germany and Australia. They even exist within countries between different social groups or user groups. An administrator of a university system might be very relaxed about attempts to connect to the system via telnet, but the administrator of a military system will follow his procedures and consider the same action as a possible attack. Another issue associated with the choice of language is the notificationtoof non-technical or off-site personnel. It is important to accurately describe the incident without undue alarm or confusing messages. While it is more difficult to describe the incident to a non-technical audience, it is often more important. A non-technical description may be required for upper-level management, the press, or law enforcement liaisons. The importance of these notifications cannot be underestimated and may make the difference between handling the incident properly and escalating to some higher level of damage.<no text yet> + TemplateIf a IRT becomes involved it might be necessary to fill out a template for the information exchange. Although this may seem to be an additional burden and adds a certain delay, it helps the team to act on this minimum set of information. The response team may be able to respond to aspects of the incident which the local administrator is unaware of. If information is given out to someone else, the following minimum informationexchange+ + Advice for Reporting, e.g. Timezonesshould be provided: (1) timezone of logs, ... inGMT,... + give wholeGMT or local time (2) information(e.g.about the remote system (including host names, ip addresses and user ids) (3) alltcpwrapperlog entriesbelonging torelevant for theincident and not only talking about: "foundremote site If local information (ie. local user ids) is included in thetcp wrapper logs, I assume ..." + international aspectslog entries, it might be necessary to sanitize the entries beforehand to avoid privacy issues. In general, all information which might assist a remote site in resolving an incident should be given out, unless local policies prohibit this. 5.4.2 Protection of evidence and activity logs When you respond to an incident, document all details related to the incident. This will provide valuable information to yourself and others as you try to unravel the course of events. Documenting all details will ultimately save you time. If you don't document every relevant phone call, for example, you are likely to forget a good portion of information you obtain, requiring you to contact the source of information once again. This wastes yours and others' time, something you can ill afford. At the same time, recording details will provide evidence for prosecution efforts, providing the case moves in this direction. Documenting an incident also will help you perform a final assessment of damage (something your management as well as law enforcement officers will want to know), and will Site Security Handbook Working Group [Page 35] Internet Draft Site Security Handbook November 1995 provide the basis for a follow-up analysis in which youcan engagecan engage in a valuable "lessons learned" exercise. Additionally it will help during later phases of the handling process, especially during the eradiction and recovery. During the initial stages of an incident, it is often infeasible to determine whether prosecution is viable, so you should document as if you are gathering evidence for a court case. At a minimum, you should record: (1) All system events (audit records). (2) All actions you take (time tagged). (3) All phone conversations (including the person with whom you talked, the date and time, and the content of the conversation). The most straightforward way to maintain documentation is keeping a log book. This allows you to go to a centralized, chronological source of information when you need it, instead of requiring you to page through individual sheets of paper. Much of this information is potential evidence in a court of law. Thus, when you initially suspect that an incident will result in prosecution or when an investigative agency becomes involved, you need to regularly (e.g., every day) turn in photocopied, signed copies of your logbook (as well as media you use to record system events) to a document custodian who can store these copied pages in a secure place (e.g., a safe). When you submit information for storage, you should in return receive a signed, dated receipt from the document custodian. Failure to observe these procedures can result in invalidation of any evidence you obtain in avaluable "lessons learned" exercise. Additionally it will help during later phasescourt of law. 5.4.3 Containment The purpose of containment is to limit thehandling process, especially during the eradiction and recovery. During the initial stagesextent of anincident,attack. For example, it isoften infeasibleimportant todetermine whether prosecution is viable, so you should documentlimit the spread of a worm attack on a network asif you are gathering evidence forquickly as possible. An essential part of containment is decision making (i.e., determining whether to shut acourt case. Atsystem down, to disconnect from aminimum, you should record: o Allnetwork, to monitor systemevents (audit records). o All actions you take (time tagged). o All phone conversations (includingor network activity, to set traps, to disable functions such as remote file transfer on a UNIX system, etc.). Sometimes this decision is trivial; shut theperson with whom you talked,system down if thedate and time, andsystem is life-critical, classified or sensitive, or if proprietary information is at risk! In other cases, it is worthwhile to risk having some damage to thecontent ofsystem if keeping theconversation). The most straightforward waysystem up might enable you tomaintain documentationidentify an intruder. This stage should involve carrying out predetermined procedures. Your organization or site should, for example, define acceptable risks in dealing with an incident, and should prescribe specific actions and strategies accordingly. This iskeepingespecially important when alog book. This allows youquick decision is necessary without the possibility togocontact all involved parties and discuss the decision. In most of the cases the person in charge will have not the power to make acentralized, chronological source of information when you need it, instead of requiring youdifficult management decision (like topage through individual sheetsloss the results ofpaper. Mucha costly Site Security Handbook Working Group [Page 36] Internet Draft Site Security Handbook November 1995 experiment). Finally, notification of cognizant authorities should occur during thisinformationstage. In some cases, it ispotential evidenceprudent to remove all access or functionality as soon as possible, and then restore normal operation ina court of law. Thus, when you initially suspectlimited stages. Bear in mind that removing all access while an incidentwill resultis inprosecution or whenprogress will obviously notify all users, including the alleged problem users, that the administrators are aware of a problem; this may have a deleterious effect on aninvestigative agency becomes involved, you needinvestigation. 5.4.4 Eradication Once an incident has been detected, it is important toregularly (e.g., every day) turn in photocopied, signed copies of your logbook (as well as media you usefirst think about containing the incident. Once the incident has been contained, it is now time torecord system events)eradicate the cause. But before eradicate the cause great care should be taken toa document custodian who can store these copied pages in a secure place (e.g., a safe). When you submitcollect all necessary informationfor storage, you should in return receive a signed, dated receipt fromabout thedocument custodian. Failurecompromised system and the cause of the incident due later on they will disappear during the eradication. Software may be available toobserve these procedures can result in invalidation of any evidencehelp youobtainina court of law. 5.4.3 Containment The purpose of containment is to limittheextent of an attack.eradiction process. For example,iteradication software isimportantavailable tolimit the spread of a worm attack on a network as quickly as possible. An essential part of containmenteliminate most viruses which infect small systems. If any bogus files have been created, it isdecision making (i.e., determining whethertime toshutarchive them for later use in case of asystem down, to disconnectcourt case. Thereafter delete them froma network, to monitor system or network activity, to set traps, to disable functions such as remote file transfer on a UNIX system, etc.). Sometimes this decision is trivial; shut the system down ifthe systemis life-critical, classified or sensitive, or if proprietary information isatrisk!this point. Inother cases,the case of virus infections, it isworthwhile to risk having some damageimportant to clean and reformat any disks containing infected files. Finally, ensure that all backups are clean. Many systems infected with viruses become periodically reinfected simply because people do not systematically eradicate thesystem if keeping the system up might enable you to identify an intruder. This stagevirus from backups. After eradiction a new backup shouldinvolve carrying out predetermined procedures. Your organization or site should, for example, define acceptable risks in dealing withbe taken, too. Removing all vulnerabilities once anincident, and should prescribe specific actions and strategies accordingly. This is especially important when a quick decisionincident has occurred isnecessary without the possibilitydifficult. The key tocontact all involved partiesremoving vulnerabilities is knowledge anddiscuss the decision. In mostunderstanding of thecases the person in charge will have not the powerbreach. It may be necessary tomake a difficult management decision (likego back tolosstheresults oforiginal distributed tapes and recustomize the system. To facilitate this worst case scenario, acostly experiment). Finally, notificationrecord ofcognizant authoritiesthe original systems setup and each customization change shouldoccur during this stage. <no text yet> + since I decidedbe kept current with each change tomake this an own section it should contain more more text?the system. Insome cases,the case of a network-based attack, it isprudentimportant toremove all access or functionality as soon as possible, and then restore normal operation in limited stages. Bearinstall patches for any operating system vulnerability which was exploited. As discussed inmind thatsection $$.4.2, a security log can be most valuable during this phase of removingall access while an incidentvulnerabilities. There are two considerations here; the first isin progress will obviously notify all users, includingto keep logs of thealleged problem users,procedures thatthe administrators are aware of a problem; this mayhavea deleterious effectbeen used to make the system secure again. This should include command procedures (e.g., shell scripts) that can be run onan investigation. However, allowing an incidenta periodic basis tocontinue may also openrecheck thelikelihoodsecurity. Second, keep logs ofgreater damage, loss, aggravation, or liability (civil or criminal). That's another reason why the relevant decisions belongimportant system events. These can be referenced when trying to determine thesite policyextent of the damage of a given incident. If a particular vulnerability is isolated as having been exploited, the next step is to find a mechanism to protect your system. The Site Security Handbook Working Group [Page 37] Internet Draft Site Security Handbook November 1995 security mailing lists andshouldbulletins would bedetermined before ana good place to search for this information and you can get advice from incidentoccurs. 5.4.4 Eradictionresponse teams. 5.4.5 Recovery Once the cause of an incident has beendetected, iteradicated, the recovery phase defines the next stage of action. The goal of recovery isimportanttofirst think about containingreturn theincident. Oncesystem to normal. In general, bringing up services in theincident has been contained, it is now timeorder of demand toeradicateallow a minimum of user inconvenience is thecause. But before eradicatebest practice. Understand that thecause great careproper recovery procedures for the system are extremely important and should betakenspecific tocollect all necessary information aboutthecompromisedsite. 5.4.6 Follow-Up Once you believe that a system has been restored to a "safe" state, it is still possible that holes and even traps could be lurking in thecausesystem. One of theincident due later on they will disappear during the eradication. Software may be availablemost important stages of responding tohelp you in the eradiction process. For example, eradication softwareincidents isavailable to eliminatealso the mostviruses which infect small systems. If any bogus filesoften omitted---the follow-up stage. In the follow-up stage, the system should be monitored for items that may have beencreated, it is timemissed during the cleanup stage. It would be prudent toarchive them for later use in caseutilize some ofa court case. Thereafter delete them fromthe tools mentioned in section xxx (e.g., xxx) as a start. Remember, these tools don't replace continual systemat this point. In the case of virus infections, it is important to cleanmonitoring andreformat any disks containing infected files. Finally, ensure that all backups are clean. Manygood systemsinfected with viruses become periodically reinfected simplyadministration procedures. The follow-up stage is important for another reason, too, becausepeople do not systematically eradicateit helps those involved in handling thevirus from backups. After eradictionincident develop anew backup should be taken, too. Removing all vulnerabilities onceset of "lessons learned" (see section $$.5) to improve future performance in such situations. This stage also provides information which justifies anincident has occurred is difficult. The keyorganization's computer security effort toremoving vulnerabilities is knowledgemanagement, andunderstanding of the breach. Ityields information which may benecessary to go back toessential in legal proceedings. The most important element of theoriginal distributed tapesfollow-up stage is performing a postmortem analysis. Exactly what happened, andrecustomizeat what times? How well did thesystem. To facilitate this worst case scenario, a recordstaff involved with the incident perform? What kind of information did theoriginal systems setupstaff need quickly, andeach customization change should be kept current with each change to the system. Inhow could they have gotten that information as soon as possible? What would the staff do differently next time? A follow-up report is valuable because it provides a reference to be used in case of other similar incidents. Creating anetwork-based attack,formal chronology of events (including time stamps) is also important for legal reasons. Similarly, it is also important toinstall patches foras quickly obtain a monetary estimate of the amount of damage the incident caused in terms of anyoperating system vulnerability which was exploited. <no text yet> + patchloss of software and files, hardware damage, and manpower costs to restore altered files, reconfigure affected systems, and so forth. This estimate may become the basis forvulnerabilities not available + uncertaintysubsequent prosecution activity. 5.5 Aftermath ofcause As discussed in section $$.4.2, a security logan Incident In the wake of an incident, several actions should take place. These actions can bemost valuable during this phasesummarized as follows: (1) An inventory should be taken ofremoving vulnerabilities. There are two considerations here;thefirst is to keep logssystems' assets, Site Security Handbook Working Group [Page 38] Internet Draft Site Security Handbook November 1995 i. e., a careful examination should determine how the system was affected by the incident, (2) The lessons learned as a result of theprocedures that have been usedincident should be included in revised security plan tomakeprevent thesystem secure again. Thisincident from re-occurring, (3) A new risk analysis shouldinclude command procedures (e.g., shell scripts) that canberun on a periodic basis to recheckdeveloped in light of thesecurity. Second, keep logsincident, (4) An investigation and prosecution ofimportant system events. These can be referenced when tryingthe individuals who caused the incident should commence, if it is deemed desirable. All four steps should provide feedback todetermine the extent ofthedamagesite security policy committee, leading to prompt re-evaluation and amendment ofa given incident. 5.4.5 Recovery Oncethecause ofcurrent policy. If an incidenthas been eradicated, the recovery phase defines the next stage of action. The goal of recoveryisto returnbased on poor policy, and unless thesystempolicy is changed, then one is doomed tonormal. In general, bringing up services inrepeat theorder of demand to allowpast. Once aminimum of user inconvenience is the best practice. Understand that the proper recovery procedures for the system are extremely importantsite has recovered from and incident, site policy and procedures should bespecificreviewed tothe site. <no text yet> + more text needed? 5.4.6 Follow-Up Once you believe that a system has been restoredencompass changes toa "safe" state,prevent similar incidents. Even without an incident, itis still possible that holes and even traps couldwould belurking in the system. One of the most important stages of respondingprudent toincidentsreview policies and procedures on a regular basis. Reviews are imperative due to today's changing computing environments. After an incident, it isalso the most often omitted---the follow-up stage. In the follow-up stage, the system should be monitored for items that may have been missed during the cleanup stage. It would beprudent toutilize some of the tools mentioned in section xxx (e.g., xxx) aswrite astart. Remember, these tools don't replace continual systemreport describing the incident, method of discovery, correction procedure, monitoring procedure, andgood systems administration procedures. The follow-up stage is important for another reason, too, because it helps those involveda summary of lesson learned. This will aid inhandlingthe clear understanding of the problem. Remember, it is difficult to learn from an incidentdevelop a setif you don't understand the source. The whole purpose of"lessons learned" (see section $$.5)this "post mortem" process is to improve all security measures to protect the site against futureperformance in such situations. This stage also provides information which justifiesattacks. In the light of anorganization's computer security effortincident one should gather practical knowledge from the experience. This should improve one's ability tomanagement, and yields information which may be essentialdetect the occurance of similar problems inlegal proceedings. The mostthe future. A concrete goal is developing proactive methods, for example: early warning by probes. Another importantelementfacet of thefollow-up stageaftermath isperforming a postmortem analysis. Exactly what happened,more related to end user andat what times? How well did the staff involved withadministrator awareness and eduction. By reviewing the actual incidentperform? Whathandling effort the process can be improved and extended to reflect new lessons learned. All this will help the site in the handling of future incidents even if a completely different kind ofinformation didattack occurs. 5.6 Responsibilities It is one thing to protect one's own network, but quite another to assume that one should protect other networks. During thestaff need quickly,handling of an incident, certain system vulnerabilities of one's own systems andhow could they have gotten that information as soon as possible? What wouldthestaff do differently next time? A follow-up reportsystems of others become apparent. It isvaluable because it provides a reference toquite easy and may even beusedtempting to pursue the intruders in order to track them. Keep incase of other similar incidents. Creatingmind that at aformal chronology of events (including time stamps)certain point it isalso important for legal reasons. Similarly,possible to 'cross the line,' and with the best intentions, become no better than the Site Security Handbook Working Group [Page 39] Internet Draft Site Security Handbook November 1995 intruder. The best rule when it comes to propriety isalso importantto not use any facility of remote sites which is not public. This clearly excludes any entry onto a system (such asquickly obtainamonetary estimateremote shell or login session) which is not expressly permitted. This may be very tempting; after a breach of security is detected, a system administrator may have theamount ofmeans to 'follow it up,' to ascertain what damage is being done to theincident caused in terms of any loss of software and files, hardware damage, and manpower costsremote site. Don't do it. Instead attempt torestore altered files, reconfigure affected systems, and so forth. This estimate may becomereach thebasis for subsequent prosecution activity. 5.5 AftermathPOC ofan Incident Inthewakeeffected site. 6. MAINTENANCE and EVALUATION 6.1 Risk assessments 6.2 Notification ofan incident, several actions should take place. These actionsproblems/events APPENDICES A1 Tools and Locations This section provides a brief overview of publicly available security technology which can besummarized as follows: (1) An inventory should be takendownloaded from the Internet. Many of thesystems' assets, i. e.,items described below will undoubtedly be surpassed or made obsolete before this document is published. This section is divided into two major subsections, applications and tools. The applications heading will include all end user programs (clients) and their supporting system infrastructure (servers). The tools heading will deal with the tools that acareful examination should determine how the system was affectedgeneral user will never see or need to use, but which may be part of or used bythe incident, (2)applications, used to troubleshoot security problems or guard against intruders by system and network administrators. Thelessons learned as a resultemphasis will be on unix applications and tools, but other platforms, particularly PC's and Macintoshes, will be mentioned where information is available. Most of theincident shouldtools and applications described below can beincludedfound inrevised security plan to preventone of theincident from re-occurring,following two archive sites: (1) CERT Coordination Center ftp://info.cert.org:/pub/tools (2) DFN-CERT ftp://ftp.cert.dfn.de/pub/tools/ (3)A new risk analysis shouldComputer Operations, Audit, and Security Tools (COAST) coast.cs.purdue.edu:/pub/tools Any references to CERT or COAST will refer to these two locations. These two sites act as repositories for most tools, exceptions will bedevelopednoted inlight of the incident, (4) An investigation and prosecution of the individuals who causedtheincident should commence, if ittext. *** It isdeemed desirable. All four steps should provide feedbackimportant to note that many sites, including CERT and COAST are mirrored throughout the Internet. Be careful to use a "well known" mirror sitesecurity policy committee, leadingtoprompt re-evaluation and amendment of the current policy. If an incident is based on poor policy,retrieve software andunless the policyto use whatever verification tools possible, checksums, md5 checksums, etc... to validate that software. A clever cracker might advertise Site Security Handbook Working Group [Page 40] Internet Draft Site Security Handbook November 1995 security software with designed flaws in order to gain access to data or machines. *** Applications The sad truth ischanged, then onethat there are very few security conscious applications currently available. The real reason isdoomed to repeatthepast. Onceneed for asite has recovered from and incident, site policy and procedures shouldsecurity infrastructure which must bereviewedfirst put into place for most applications toencompass changesoperate securely. There is considerable effort currently taking place toprevent similar incidents. Even without an incident, itplace this infrastructure so that applications can take advantage of secure communications. Unix based applications PGP MD5 S/KEY TROJAN.PL PEM KERBEROS Drawbridge Tripwire logdaemon TCP-Wrapper rpcbind/portmapper replacement cops tiger ISS SATAN smrsh swatch identd (not really a security tool) DES (non-US versions) lsof sfingerd passwd-replacements (npasswd / ANLpasswd / passwd+ / ...) A2 Mailing lists and other resources It would beprudent to review policies and procedures on a regular basis. Reviews are imperative due to today's changing computing environments. After an incident, it is prudentimpossible towrite a report describing the incident, methodlist all ofdiscovery, correction procedure, monitoring procedure,the mail-lists anda summary of lesson learned. This will aid inother resources dealing with site security. However, these are some "jump- points" from which theclear understandingreader can begin. All of these references are for theproblem. Remember, it is difficult to learn from an incident if you don't understand the source. <no text yet> + improving proactive methods + educate users, administrators"INTERNET" constituency. More specific (vendor andmanagers 5.6 Responsibilities <no text yet> This is a total new section but it address some aspects because we sometimes experiencegeographical) resouces can be found through these references. Mailing Lists (1) CERT Advisory Send mail to: cert-advisory-request@cert.org Message Body: subscribe cert FIRSTNAME LASTNAME A CERT advisory provides information on how to obtain asomewhat strange interpretationpatch or details ofresponsibility. One example: to protectanetwork isworkaround for agood thing, but to protect someone other's network is difficult, if you don't have the authority to do so. If you startknown computer security problem. The CERT Coordination Center works with vendors totry this new hacker tool onproduce anetwork, it is fair to contact some responsible person before. In the other case we experience (falseSite Security Handbook Working Group [Page 41] Internet Draft Site Security Handbook November 1995 workaround orunnecessary) alerts every nowa patch for a problem, andthen. If you think aboutdoes not publish vulnerability information until asuccessful breakin asworkaround or agood education lesson, you canpatch is available. A CERT advisory may also besued for this like an ordinary cracker. + testing of known vulnerabilities + disclosure of information + testing of local sites + tiger teams (for remote sites) + announcing site security contact information + check advice before acting on behalf (hacker claimsa warning tobe ROOT) + protect the communication + legal procedures was section 5.5.2 inour constituency about ongoing attacks (e.g., "CA-91:18.Active.Internet.tftp.Attacks"). CERT advisories are also published on theold RFC (most of this section is already included in 5.2.2) 6. MAINTENANCE and EVALUATION 6.1 Risk assessments 6.2 Notification of problems/events Appendices A1 Tools and Locations This section provides a brief overview of publicallyUSENET newsgroup: comp.security.announce CERT advisory archives are availablesecurity technology which can be downloadedvia anonymous FTP from info.cert.org in theInternet. Many of the items described below will undoubtedly be surpassed or made obsolete before this document is published. This section is divided into two major subsections, applications and tools. The applications heading will include all end user programs (clients) and their supporting system infrastructure (servers). The tools heading will deal with the tools that a general user will never see or need to use, but which may be part/pub/cert_advisories directory. (2) CERT Tools Mailing List Send mail to: cert-tools-request@cert.sei.cmu.edu Message Body: subscribe cert-tools FIRSTNAME LASTNAME The purpose ofor used by applications, usedthis moderated mailing list is totroubleshootencourage the exchange of information on securityproblems or guard against intruders by systemtools andnetwork administrators.techniques. Theemphasis willlist should not be used for security problem reports. (3) VIRUS-L List Send mail to: listserv%lehiibm1.bitnet@mitvma.mit.edu Message Body: subscribe virus-L FIRSTNAME LASTNAME VIRUS-L is a moderated mailing list with a focus onunix applicationscomputer virus issues. For more information, including a copy of the posting guidelines, see the file "virus-l.README", available by anonymous FTP from cs.ucr.edu. (4) Academic Firewalls Send mail to: majordomo@greatcircle.com Message Body: subscribe firewalls user@host The Firewalls mailing list is a discussion forum for firewall administrators andtools, but other platforms, particularly PC'simplementors. USENET newsgroups (1) comp.security.announce The comp.security.announce newsgroup is moderated andMacintoshes, will be mentioned where informationisavailable. Most ofused solely for thetools and applications described below can be found in onedistribution ofthe following two archive sites: 1. ftp://info.cert.org:/pub/tools CERT Coordination Center 2. coast.cs.purdue.edu:/pub/tools Computer Operations, Audit, and Security Tools (COAST) 3. ftp://ftp.cert.dfn.de/pub/tools/ DFN-CERT Any references toCERTor COAST will refer to these two locations. These two sites act as repositoriesadvisories. (2) comp.security.misc The comp.security.misc is a forum formost tools, exceptions will be noted inthetext. *** It is importantdiscussion of computer security, especially as it relates tonote that many sites, including CERT and COAST are mirrored throughouttheInternet. Be careful to useUNIX(r) Operating System. (3) alt.security The alt.security newsgroup is also a"well known" mirror site to retrieve software and to use whatever verification tools possible, checksums, md5 checksums, etc... to validate that software. A clever cracker might advertise security softwareforum for the Site Security Handbook Working Group [Page 42] Internet Draft Site Security Handbook November 1995 discussion of computer security, as well as other issues such as car locks and alarm systems. (4) comp.virus The comp.virus newsgroup is a moderated newsgroup withdesigned flawsa focus on computer virus issues. For more information, including a copy of the posting guidelines, see the file "virus-l.README", available via anonymous FTP on info.cert.org inorder to gain access to data or machines. *** Applicationsthe /pub/virus-l directory. (5) comp.risks Thesad truthcomp.risks newsgroup isthat there are very few security conscious applications currently available.a moderated forum on the risks to the public in computers and related systems. World-Wide Web Pages (1) http://www.first.org/ Computer Security Resource Clearinghouse. Thereal reasonmain focus is on crisis response information; information on computer security-related threats, vulnerabilities, and solutions. At theneed forsame time, the Clearinghouse strives to be a general index to computer securityinfrastructure which must be first put into place for most applicationsinformation on a broad variety of subjects, including general risks, privacy, legal issues, viruses, assurance, policy, and training. (2) http://www.telstra.com.au/info/security.html This Reference Index contains a list of links tooperate securely.information sources on Network and Computer Security. There isconsiderable effort currently taking placeno implied fitness toplacethe Tools, Techniques and Documents contained within thisinfrastructure soarchive. Many if not all of these items work well, but we do not guarantee thatapplications can take advantagethis will be so. This information is for the education and legitimate use ofsecure communications. Unix based applications PGP MD5 S/KEY TROJAN.PL PEM KERBEROS Drawbridge Tripwire logdaemon TCP-Wrapper rpcbind/portmapper replacement cops tiger ISS SATAN smrsh swatch identd (not really acomputer securitytool) DES (non-US versions) lsof sfingerd passwd-replacements (npasswd / ANLpasswd / passwd+ / ...) A2 Mailing liststechniques only. (3) http://www.alw.nih.gov/Security/security.html This page features general information about computer security. Information is organized by source andother resources <To be completed>each section is organized by topic. Recent modifications are noted in What's New page. Editor Information Barbara Y. Fraser Software Engineering Institute Carnegie Mellon University 5000 Forbes Avenue Pittsburgh, PA 15213 Phone: (412) 268-5010 Fax: (412) 268-6989 email: byf@cert.org Site Security Handbook Working Group [Page 43] ----