view Side-By-Side changes
Network Working Group L. Nerenberg Internet Draft: IMAP4 Binary Content ExtensionACI/MessagingDirectACI Worldwide Document:draft-nerenberg-imap-binary-02.txt Februarydraft-nerenberg-imap-binary-03.txt May 2001 IMAP4 Binary Content Extension Status of this memo This document is an Internet Draft and is in full conformance with all provisions of Section 10 of RFC 2026. Internet Drafts are working documents of the Internet Engineering Task Force (IETF), its areas, and its working groups. Note that other groups may also distribute working documents as Internet Drafts. Internet Drafts are draft documents valid for a maximum of six months and may be updated, replaced, or obsoleted by other documents at any time. It is inappropriate to use Internet Drafts as reference material or to cite them other than as "work inprogress.rqprogress." The list of current Internet Drafts can be accessed at http://www.ietf.org/ietf/1id-abstracts.txt The list of Internet Draft Shadow Directories can be accessed at http://www.ietf.org/shadow.html. A revised version of this draft document will be submitted to the RFC editor as a Proposed Standard for the Internet Community. Discussion and suggestions for improvement are requested. Distribution of this draft is unlimited.0. Administrivia Changes from version -01 Defined the use of <literal8> syntax to APPEND binary data. Added BINARYSTRUCTURE. Removed the restriction for terminal body sections only. Added missing ']' in section 4.2 definitions of BINARY and BINARY.PEEK. Clarified when to return [UNKNOWN-TRANSFER-ENCODING] reponse. Nerenberg draft-nerenberg-imap-binary-02.txt [Page 1] Internet Draft IMAP4 Binary Content Extension February 2001 Changes from version -00 Renamed CHARALL to BINCHAR. Changed syntax to "FETCH x BINARY ..." Defined FETCH BINARY response. Modified <literal8> syntax to distinguish it from <literal>. Added examples section.1. Abstract This memo defines the BINARY extension to the Internet Message Access Protocol [IMAP4rev1]. It provides a mechanism for IMAP4 clients and servers to exchange dataencoded with the MIME BINARY content-transfer-encoding.without using a content-trans- fer-encoding. 2. Conventions Used in this Document The key words "MUST," "MUST NOT," "SHOULD," "SHOULD NOT," and "MAY" in this document are to be interpreted as described in [KEYWORD]. The abbreviation CTE means content-transfer-encoding. In examples, "C:" and "S:" preface lines sent by the client and the server respectively. Nerenberg draft-nerenberg-imap-binary-03.txt [Page 1] Internet Draft IMAP4 Binary Content Extension May 2001 3. Overview The MIME extensions to Internet messaging allow for the transmis- sion of non-textual (binary) message content [MIME-IMB]. Since the traditional transports for messaging are not always capable of passing binary data transparently, MIME provides encoding schemes that allow binary content to be transmitted over transports that are not otherwise able to do so. The overhead of MIME encoding this content can be considerable in some contexts (e.g.clients connected overslow radiolinks).links, streaming multimedia). Reduc- ing the overhead associated with CTE schemes such as base64 can give a noticeable reduction in resource consumption. The BINARY extensionextendslets theIMAP4 protocolserver perform CTE decoding prior toallow clientstransmit- ting message data to the client. 4. Content-Transfer-Encoding Considerations Every MIME message has a content-transfer-encoding. (Those without an explicit Content-Transfer-Encoding header are implicitly labeled as "7bit".) In the terminology of [MIME-IMB], the CTE specifies both a decoding algorithm andserversthe domain of the decoded data. In the context of this memo, "decoding" refers toexchangethe CTE decoding step described in [MIME-IMB]. Certain CTEs use an identity encoding transformation. For these CTEs there is no decoding required, however the domain of the underlying data may not be expressable in the IMAP4 protocol (e.g. "8bit" with NUL octets). To accommodate these cases the BINARY extension introduces into the IMAP4 protocol abinary (unencoded) format. 4.new type of literal element that is fully 8bit transparent. Thus, server processing of the FETCH BINARY command involves two logical steps: 1) perform any CTE-related decoding 2) determine the domain of the decoded data Step 2 is necessary to determine which protocol element should be used to transmit the decoded data. (See FETCH Response Extensions for further details.) 5. Framework for the IMAP4 Binary Extension This memo defines the following extensions to [IMAP4rev1].4.1.5.1. CAPABILITY Identification IMAP4 servers that support this extension MUST include "BINARY" in the response list to the CAPABILITY command. Nerenbergdraft-nerenberg-imap-binary-02.txtdraft-nerenberg-imap-binary-03.txt [Page 2] Internet Draft IMAP4 Binary Content ExtensionFebruaryMay 20014.2.5.2. FETCH Command Extensions This extension defines three new FETCH command data items. BINARY[<section_binary>]<<partial>>The binary content of a particular body section. This data item acts as the [IMAP4rev1] BODY data item does, but with the following modifications: The server converts the MIME content-transfer-encoding ofRequests that the specified body sectionto BINARY before transmitting it to the client. This content conversion MUST NOT cause a loss of information. If the server cannot convert the content-transfer-encoding to BINARY it MUST reject the FETCH command with a NO response that includes the "UNKNOWN-TRANSFER-ENCODING" extended response code.be transmitted after performing CTE-related decoding. When performing a partial FETCH, the offset argument refers to the offset into theCONVERTEDENCODED body section. BINARY.PEEK[<section_binary>]<<partial>> An alternate form of BINARY[<section>] that does not implic- itly set the \Seen flag.BINARYSTRUCTURE The MIME body structureBINARY.SIZE[<section_binary>]<<partial>> Requests the decoded size of themessage after converting allbodysectionssection (i.e. the size to expect in response to the corresponding FETCH BINARYcontent-transfer-encoding. Therequest). When performing a partial FETCH, the offset argument refers to the offset into the ENCODED body section. Client authors are cautioned that this may be an expensive operation for some servercomputesimplementations. Needlessly issuing thisby parsingrequest could result in degraded performance due to servers having to calculate theMIME structure ofvalue every time themes- sage after performing any required content-transfer-encoding modifications. 4.3.request is issued. Requests that specify an offset will almost always require the server to calculate the result dynamically. 5.3. FETCH Response Extensions This extension defines two new FETCH response data items.BINARY[<section>]<<origin_octet>>BINARY[<section>]<<partial>> A <string> or <literal8> expressing the body content of the specified body section afterconverting the content-transfer-encoding to BINARY.removing any CTE-related encod- ing. If theorigin octet is specified, this string is a substringdomain of theentire body contents, starting at that origin octet. This means that BODY[]<0> MAY be truncated, but BODY[]decoded data isNEVER truncated. The offset refers to the body contents after conversion to the BINARY content-transfer-encoding. BINARYSTRUCTURE A parenthesized list describing the MIME body structure of the message after converting all the sections to the BINARY con- tent-transfer-encoding. The contents"8bit" andformat of the list are identical to those of the [IMAP4rev1] BODYSTRUCTURE response, but describe the converted body. Calculating the sizes of the converted content may impose Nerenberg draft-nerenberg-imap-binary-02.txt [Page 3] Internet Draft IMAP4 Binary Content Extension February 2001 considerable overhead on some implementations. A server MAY choose to defer this calculation until the client fetches the item. In this case, all size-related fields wherethesize ofdata does not contain theconverted content is unknown MUST be set to NIL. IfNUL character, the serverreports a non-NIL value for the message size it MUST match the size ofSHOULD return the<literal8>data in acorresponding FETCH BINARY response. For a complex message the server might defer the size calculations for only a subset<literal> instead ofthe sections. Ina <literal8>; thiscaseallows theserver SHOULD reportclient to determine if thesizes for those sec- tions where they can be easily determined. For example, given a message with a 7BIT-encoded section and a BASE64-encoded section,"8bit" data contains theserver should returnNUL char- acter without having to explicitly scan thesizedata stream forthe 7BIT- encoded section, since the 7BIT- and BINARY-encoded sizes will be the same.NULs. If the serveris unabledoes not know how toconvertdecode thecontent-transfer-encod- ing of a section to BINARY,body section's CTE, it MUSTreportfail thecorresponding content encoding and message size fields as NIL, and SHOULD report any other size-related fields as NIL. A section report- ing NIL for content encodingrequest andmessage size cannot be retrieved using FETCH BINARY, and servers MUST reject such requests withissue aNO"NO" responsethat includeswith the"UNKNOWN-TRANS- FER-ENCODING""UNKNOWN-TRANSFER-ENCODING" extended response code.Reporting a message encodingIf the range ofNIL with a non-NIL message size isaprotocol error; servers MUST NOT return this combi- nation of values. 4.4. APPEND Command Extensions The APPEND command is extended to allow the clientpartial FETCH does not refer toappend binary data by specifying the octet count using <literal8> syntax. The server SHOULD NOT perform any content-transfer-encoding conversiona valid sequence of octets for thedata. If the specified mailbox does not supportthestorage of binary contentbody section's CTE, the server MUSTrejectfail theAPPEND command with a NO response that includes the "UNKNOWN-TRANSFER-ENCODING" extended response code. 5. Examples The examples in this section are illustrative only; they DO NOT form part of this specification. Most of these examples uses a message containing two sections: a text/plainrequest andan application/octet-stream. MIME Content MIME Encoded Unencoded Part# Type Encoding Size Size ---------------------------------------------------------------- 1 text/plain 7bit 105 105 2 application/octet-stream base64 3324344 2429327issue a BAD response. Nerenbergdraft-nerenberg-imap-binary-02.txtdraft-nerenberg-imap-binary-03.txt [Page4]3] Internet Draft IMAP4 Binary Content ExtensionFebruaryMay 2001First,BINARY.SIZE[<section>]<<origin_octet>> The size of theclient requestssection after removing any CTE-related encod- ing. The value returned MUST match theBODYSTRUCTURE: C: 42 fetch 6 bodystructure S: * 6 FETCH (BODYSTRUCTURE (("TEXT" "PLAIN" ("CHARSET" "us-ascii") "<1070.981005923.1@localhost>" NIL "7BIT" 105 2 NIL NIL NIL) ("APPLICATION" "OCTET-STREAM" NIL "<1070.981005923.2@localhost>" "The latest BSD kernel" "BASE64" 3324344 NIL NIL NIL) "MIXED" ("BOUNDARY" "----- =_aaaaaaaaaa0") NIL NIL)) S: 42 OK<literal> or <lit- eral8> size that will be returned by a corresponding FETCHcompleted NextBINARY request. If theclient asks forserver does not know how to decode theBINARYSTRUCTURE: C: 43 fetch 6 binarystructure S: * 6 FETCH (BINARYSTRUCTURE (("TEXT" "PLAIN" ("CHARSET" "us-ascii") "<1070.981005923.1@localhost>" NIL "BINARY" 105 2 NIL NIL NIL) ("APPLICATION" "OCTET-STREAM" NIL "<1070.981005923.2@localhost>" "The latest BSD kernel" "BINARY" 2429327 NIL NIL NIL) "MIXED" ("BOUNDARY" "----- =_aaaaaaaaaa0") NIL NIL)) S: 43 OK FETCH completed Notice thatbody section's CTE, it MUST fail thecontent transfer encoding for both sections has changed to BINARY. The size forrequest and issue a "NO" response with the "UNKNOWN-TRANSFER-ENCODING" extended response code. If thesecond section has changedrange of a partial FETCH does not refer toreflect the conversiona valid sequence ofthe BASE64 encodingoctets forthat section;thesize ofthefirst section (both message size and line count) has not changed, sincebody section's CTE, thetransformation from 7BIT to BINARY did not result in any change toserver MUST fail thedata. For sections with transparent content transfer encodings (7BIT, 8BIT, BINARY), FETCH BODYrequest andFETCH BINARY return identical con- tent: C: 3 fetch 1 body[1] S: * 1 FETCH (BODY[1] {80} S: This isissue atest messageBAD response. 5.4. APPEND Command Extensions The APPEND command is extended touse inallow theexamples sectionclient to append data containing NULs by using the <literal8> syntax. The server MAY mod- ify the CTE of theS: IMAP BINARY RFC. S: ) S: 3 OK Completed C: 4 fetch 1 binary[1] S: * 1 FETCH (BINARY[1] ~{80} S: This is a test message to useappended data, however any such transformation MUST NOT result in a loss of data. If theexamples sectionspecified mailbox does not support the storage of binary content theS: IMAP BINARY RFC. S: ) S: 4 OK Completed Nerenberg draft-nerenberg-imap-binary-02.txt [Page 5] Internet Draft IMAP4 Binary Content Extension February 2001 Here is a misbehaving client trying to retrieveserver MUST reject the APPEND command with abodyNO response that includes theserver cannot decode: C: 1 fetch 13 bodystructure S: * 13 FETCH (BODYSTRUCTURE ("TEXT" "PLAIN" ("CHARSET" "iso-8859-1") NIL NIL "FROBOZZ" 1716 49 NIL NIL NIL)) S: 1 OK Completed C: 2 fetch 13 binarystructure S: * 13 FETCH (BINARYSTRUCTURE ("TEXT" "PLAIN" ("CHARSET" "iso-8859-1") NIL NIL NIL NIL NIL NIL NIL NIL)) S: 2 OK Completed C: 3 fetch 13 binary[1] S: 2 BAD [UNKNOWN-TRANSFER-ENCODING] Unknown content encoding (FROBOZZ) for message 13 section 1"UNKNOWN-TRANSFER-ENCODING" extended response code. 6. Interoperability Considerations Messaging clients and servers have been notoriously lax in theiradheranceadherence to the Internet CRLF convention for terminating lines of textual data in Internet protocols. When sending data using the BINARY extension, servers MUST ensure that textual line-oriented body sections are always transmitted using the IMAP4 CRLF line ter- mination syntax, regardless of the underlying storage representa- tion of the data on the server. While the server is allowed to modify the CTE of APPENDed <lit- eral8> data, this should only be done where absolutely necessary. Gratuitous encoding changes will render useless most cryptographic operations that may have been performed on the message. This extension provides an optimization that is useful in certain specific situations. It does not absolve clients from providing basic functionality (content transfer decoding) that should be available in all messaging clients. Clients supporting this exten- sion SHOULD be prepared to provide their own content transfer decoding of data. Nerenberg draft-nerenberg-imap-binary-03.txt [Page 4] Internet Draft IMAP4 Binary Content Extension May 2001 7. Formal Protocol Syntax The following syntax specification uses the augmented Backus-Naur Form (BNF) notation as used in [IMAP4rev1]. Except as noted otherwise, all alphabetic characters are case- insensitive. The use of upper or lower case characters to define token strings is for editorial clarity only. Implementations MUST accept these strings in a case-insensitive fashion. This syntax extends the grammar specified in [IMAP4rev1]. append =/ "APPEND" SPACE mailbox [SPACE flag_list] [SPACE date_time] SPACE literal8 BINCHAR ::= <0x00 - 0xff> fetch_att =/ "BINARY"[".PEEK"][ (".PEEK" / "SIZE") ] section_binary ["<" number "." nz_number ">"] /"BINARYSTRUCTURE" Nerenberg draft-nerenberg-imap-binary-02.txt [Page 6] Internet Draft IMAP4 Binary Content Extension February 2001literal8 ::= "~{" number "}" CRLF *BINCHAR ;; <number> represents the number of BINCHAR octets ;; in the response string. resp_code_text =/ "UNKNOWN-TRANSFER-ENCODING" section_binary ::= "[" [ (nz_number *["." nz_number] ] "]" 8. References [IMAP4rev1] Crispin, M., "Internet Message Access Protocol Version 4rev1," RFC2060, University of Washington, December 1996 [KEYWORD] Bradner, S., "Key words for use in RFCs to Indicate Requirement Levels," BCP 9, RFC2119, March 1997 [MIME-IMB] Freed, N., N. Borenstein, "Multipurpose Internet Mail Extensions (MIME) Part One: Format of Internet Message Bodies," RFC2045, November 1996. 9. Security ConsiderationsSending binary data to servers and clients thatThere areexpecting well- formed non-binary input is a method commonly used to attempt to bypass security. The IMAP4 BINARY extension is only initiated at a co-operating client's request, therefore the transmission of binary content as defined in this memo should not affect legacy clients that may be unable to properly copeno known security issues withsuch binary content. A new protocol syntax has been introduced to further distinguish between <literal> and <literal8> data.this extension. Nerenberg draft-nerenberg-imap-binary-03.txt [Page 5] Internet Draft IMAP4 Binary Content Extension May 2001 10. Authors' Address Lyndon NerenbergACI/MessagingDirectACI Worldwide Suite 900 10117 - Jasper Avenue Edmonton, Alberta Canada T5J 1W8 Email: lyndon@messagingdirect.com Nerenbergdraft-nerenberg-imap-binary-02.txtdraft-nerenberg-imap-binary-03.txt [Page7]6] ----