RFC 6464 - A Real-time Transport Protocol (RTP) Header Extension for Client-to-Mixer Audio Level Indication 日本語訳

URL : https://tools.ietf.org/html/rfc6464
タイトル : RFC 6464 - リアルタイムトランスポートプロトコルクライアントとミキサーのオーディオレベル表示について（RTP）ヘッダ拡張
翻訳編集 : 自動生成

Internet Engineering Task Force (IETF)                    J. Lennox, Ed.
Request for Comments: 6464                                         Vidyo
Category: Standards Track                                        E. Ivov
ISSN: 2070-1721                                                    Jitsi
                                                              E. Marocco
                                                          Telecom Italia
                                                           December 2011

       A Real-time Transport Protocol (RTP) Header Extension for
                 Client-to-Mixer Audio Level Indication

Abstract

抽象

This document defines a mechanism by which packets of Real-time Transport Protocol (RTP) audio streams can indicate, in an RTP header extension, the audio level of the audio sample carried in the RTP packet. In large conferences, this can reduce the load on an audio mixer or other middlebox that wants to forward only a few of the loudest audio streams, without requiring it to decode and measure every stream that is received.

この文書では、RTPヘッダ拡張内のリアルタイム転送プロトコル（RTP）音声ストリームのパケットを示すことができるメカニズム、RTPパケットで運ばれた音声サンプルの音声レベルを定義します。大きな会議では、これが受信されるすべてのストリームをデコードし、測定することを必要とせずに、最大音量のオーディオストリームのほんの数を転送したいオーディオミキサーまたは他のミドルボックスの負荷を軽減することができます。

Status of This Memo

このメモのステータス

This is an Internet Standards Track document.

これは、インターネット標準化過程文書です。

This document is a product of the Internet Engineering Task Force (IETF). It represents the consensus of the IETF community. It has received public review and has been approved for publication by the Internet Engineering Steering Group (IESG). Further information on Internet Standards is available in Section 2 of RFC 5741.

このドキュメントはインターネットエンジニアリングタスクフォース（IETF）の製品です。これは、IETFコミュニティの総意を表しています。これは、公開レビューを受けており、インターネットエンジニアリング運営グループ（IESG）によって公表のために承認されています。インターネット標準の詳細については、RFC 5741のセクション2で利用可能です。

Information about the current status of this document, any errata, and how to provide feedback on it may be obtained at http://www.rfc-editor.org/info/rfc6464.

このドキュメントの現在の状態、任意の正誤表、そしてどのようにフィードバックを提供するための情報がhttp://www.rfc-editor.org/info/rfc6464で取得することができます。

著作権表示

著作権（C）2011 IETF信託とドキュメントの作成者として特定の人物。全著作権所有。

This document is subject to BCP 78 and the IETF Trust's Legal Provisions Relating to IETF Documents (http://trustee.ietf.org/license-info) in effect on the date of publication of this document. Please review these documents carefully, as they describe your rights and restrictions with respect to this document. Code Components extracted from this document must include Simplified BSD License text as described in Section 4.e of the Trust Legal Provisions and are provided without warranty as described in the Simplified BSD License.

この文書では、BCP 78と、この文書の発行日に有効なIETFドキュメント（http://trustee.ietf.org/license-info）に関連IETFトラストの法律の規定に従うものとします。彼らは、この文書に関してあなたの権利と制限を説明するように、慎重にこれらの文書を確認してください。コードコンポーネントは、トラスト法規定のセクションで説明4.eおよび簡体BSDライセンスで説明したように、保証なしで提供されているよう簡体BSDライセンスのテキストを含める必要があり、この文書から抽出されました。

Table of Contents

   1. Introduction ....................................................2
   2. Terminology .....................................................3
   3. Audio Levels ....................................................3
   4. Signaling (Setup) Information ...................................5
   5. Considerations on Use ...........................................6
   6. Security Considerations .........................................6
   7. IANA Considerations .............................................7
   8. References ......................................................7
      8.1. Normative References .......................................7
      8.2. Informative References .....................................8

1. Introduction

1. はじめに

In a centralized Real-time Transport Protocol (RTP) [RFC3550] audio conference, an audio mixer or forwarder receives audio streams from many or all of the conference participants. It then selectively forwards some of them to other participants in the conference. In large conferences, it is possible that such a server might be receiving a large number of streams, of which only a few are intended to be forwarded to the other conference participants.

集中リアルタイムトランスポートプロトコル（RTP）[RFC3550]音声会議において、オーディオミキサーまたはフォワーダは、会議参加者の多くまたは全てからオーディオストリームを受信します。その後、選択的に会議の他の参加者にそれらのいくつかを転送します。大規模な会議では、そのようなサーバは、わずか数は、他の会議参加者に転送されることを意図しているのストリームを多数、受信されるかもしれないということも可能です。

In such a scenario, in order to pick the audio streams to forward, a centralized server needs to decode, measure audio levels, and possibly perform voice activity detection on audio data from a large number of streams. The need for such processing limits the size or number of conferences such a server can support.

そのようなシナリオでは、転送するオーディオストリームを選択するために、中央サーバは、復号オーディオレベルを測定し、そしておそらくストリームの多数からの音声データに音声アクティビティ検出を実行する必要があります。このような処理の必要性は、サーバーがサポートできる会議の大きさや数を制限します。

As an alternative, this document defines an RTP header extension [RFC5285] through which senders of audio packets can indicate the audio level of the packets' payload, reducing the processing load for a server.

代替として、この文書は、オーディオパケットの送信者は、サーバの処理負荷を軽減、パケットのペイロードの音声レベルを示すことができ、それを通してRTPヘッダ拡張[RFC5285]を定義します。

The header extension in this document is different than, but complementary with, the one defined in [RFC6465], which defines a mechanism by which audio mixers can indicate to clients the levels of the contributing sources that made up the mixed audio.

この文書のヘッダ拡張は、オーディオミキサーは、クライアントにミックスされたオーディオを構成した寄与ソースのレベルを示すことができるメカニズムを定義[RFC6465]で定義され、異なる、しかしと相補的です。

2. Terminology

2.用語

The key words "MUST", "MUST NOT", "REQUIRED", "SHALL", "SHALL NOT", "SHOULD", "SHOULD NOT", "RECOMMENDED", "MAY", and "OPTIONAL" in this document are to be interpreted as described in RFC 2119 [RFC2119] and indicate requirement levels for compliant implementations.

この文書のキーワード "MUST"、 "MUST NOT"、 "REQUIRED"、、、、 "べきではない" "べきである" "ないもの" "ものとし"、 "推奨"、 "MAY"、および "OPTIONAL" はありますRFC 2119 [RFC2119]に記載され、対応する実装の要求レベルを示すものと解釈されます。

3. Audio Levels

3.オーディオレベル

The audio level header extension carries the level of the audio in the RTP [RFC3550] payload of the packet with which it is associated. This information is carried in an RTP header extension element as defined by "A General Mechanism for RTP Header Extensions" [RFC5285].

音声レベルのヘッダの拡張は、それが関連付けられているパケットのRTP [RFC3550]ペイロードにオーディオのレベルを運びます。「RTPヘッダ拡張のための一般的なメカニズム」[RFC5285]で定義されるように、この情報は、RTPヘッダ拡張要素で運ばれます。

The payload of the audio level header extension element can be encoded using either the one-byte or two-byte header defined in [RFC5285]. Figures 1 and 2 show sample audio level encodings with each of these header formats.

音声レベルのヘッダ拡張要素のペイロードは1バイトまたは[RFC5285]で定義された2バイトのヘッダのいずれかを使用して符号化することができます。これらのヘッダ・フォーマットのそれぞれに1と2に示すサンプルのオーディオレベルの符号化を図。

                    0                   1
                    0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5
                   +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
                   |  ID   | len=0 |V| level       |
                   +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+

Figure 1: Sample Audio Level Encoding Using the One-Byte Header Format

図1：半角ヘッダー形式を使用したサンプルオーディオレベルのエンコード

      0                   1                   2                   3
      0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1
     +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
     |      ID       |     len=1     |V|    level    |    0 (pad)    |
     +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+

              Figure 2: Sample Audio Level Encoding Using the
                          Two-Byte Header Format

Note that, as indicated in [RFC5285], the length field in the one-byte header format takes the value 0 to indicate that 1 byte follows. In the two-byte header format, on the other hand, the length field takes the value of 1.

[RFC5285]に示されるように、1バイトのヘッダフォーマットの長さフィールドは1つのバイトが続くことを示すために、値0をとることに留意されたいです。 2バイトのヘッダフォーマットで、一方、長さフィールドは1の値をとります。

The magnitude of the audio level itself is packed into the seven least significant bits of the single byte of the header extension, shown in Figures 1 and 2. The least significant bit of the audio level magnitude is packed into the least significant bit of the byte. The most significant bit of the byte is used as a separate flag bit "V", defined below.

オーディオレベル自体の大きさは、音声レベルの大きさの最下位ビットはバイトの最下位ビットにパックされたヘッダの図1に示されている拡張、及び2の単一バイトの7つの最下位ビットにパックされています。バイトの最上位ビットは、以下に定義される別個のフラグビット「V」として使用されます。

The audio level is expressed in -dBov, with values from 0 to 127 representing 0 to -127 dBov. dBov is the level, in decibels, relative to the overload point of the system, i.e., the highest-intensity signal encodable by the payload format. (Note: Representation relative to the overload point of a system is particularly useful for digital implementations, since one does not need to know the relative calibration of the analog circuitry.) For example, in the case of u-law (audio/pcmu) audio [ITU.G711], the 0 dBov reference would be a square wave with values +/- 8031. (This translates to 6.18 dBm0, relative to u-law's dBm0 definition in Table 6 of [ITU.G711].)

オーディオレベルは-127 dBovに0を表す0から127までの値で、-dBovで発現されます。 dBovは、デシベル単位で、ペイロード形式により、すなわち、最高強度信号の符号化可能システムの過負荷点に対するレベルです。（注：一方はアナログ回路の相対的なキャリブレーションを知る必要がないので、システムの過負荷ポイントに表現に対しては、デジタル実装するために特に有用である。）、例えば、U字法の場合（オーディオ/ PCMU）にオーディオ[ITU.G711]、0 dBov基準は、値+/- 8031の方形波であろう（これは[ITU.G711]の表6のU-法則のdBm0で定義に対して6.18 dBm0で、に変換されます。）

The audio level for digital silence -- for a muted audio source, for example -- MUST be represented as 127 (-127 dBov), regardless of the dynamic range of the encoded audio format.

デジタル無音のオーディオレベル - ミュートオーディオソースの、例えば - かかわらず、エンコードされたオーディオフォーマットのダイナミックレンジの、127（-127 dBov）として表現されなければなりません。

The audio level header extension only carries the level of the audio in the RTP payload of the packet with which it is associated, with no long-term averaging or smoothing applied. For payload formats that contain extra error-correction bits or loss-concealment information, the level corresponds only to the data that would result from the payload's normal decoding process, not what it would produce under error or packet loss concealment. The level is measured as a root mean square of all the samples in the audio encoded by the packet.

音声レベルのヘッダの拡張のみ印加されていない長期平均化または平滑化と、それが関連付けられているパケットのRTPペイロード内のオーディオのレベルを、運びます。エキストラエラー訂正ビット又は損失隠蔽情報を含むペイロード形式のために、レベルは、それがエラー又はパケット損失隠蔽の下で生じるであろうもの、ペイロードだけの通常の復号化処理から生じるデータに対応しません。レベルパケットによって符号化されたオーディオ内の全てのサンプルの二乗平均平方根として測定されます。

To simplify implementation of the encoding procedures described here, Appendix A of [RFC6465] provides a sample Java implementation of an audio level calculator that helps obtain such values from raw linear Pulse Code Modulation (PCM) audio samples.

ここに記載の符号化手順の実装を簡素化するために、[RFC6465]の付録Aは、生線形パルス符号変調（PCM）オーディオサンプルからそのような値を得るのに役立つ音声レベル算出のサンプルのJava実装を提供します。

In addition, a flag bit (labeled "V") optionally indicates whether the encoder believes the audio packet contains voice activity. If the V bit is in use, the value 1 indicates that the encoder believes the audio packet contains voice activity, and the value 0 indicates that the encoder believes it does not. (The voice activity detection algorithm is unspecified and left implementation-specific.) If the V bit is not in use, its value is unspecified and MUST be ignored by receivers. The use of the V bit is signaled using the extension attribute "vad", discussed in Section 4.

また、フラグビット（標識された「V」）は、必要に応じて、エンコーダは、オーディオパケットが音声アクティビティが含まれていると考えているかどうかを示します。 Vビットが使用されている場合は、値1は、エンコーダは、音声パケットは、音声アクティビティが含まれていると考えていることを示し、値0は、エンコーダが、それはないと考えていることを示しています。（音声アクティビティ検出アルゴリズムが不特定と左実装固有である。）Vビットが使用されていない場合、その値は不定であり、受信機によって無視されなければなりません。 Vビットの使用は、セクション4で説明した拡張属性「VAD」を使用してシグナリングされます。

When this header extension is used with RTP data sent using the RTP Payload for Redundant Audio Data [RFC2198], the header's data describes the contents of the primary encoding.

このヘッダー拡張冗長音声データのRTPペイロード[RFC2198]を使用して送信されたRTPデータと共に使用される場合、ヘッダーのデータは、一次符号化の内容を記述する。

Note: This audio level is defined in the same manner as is audio noise level in the RTP Payload Comfort Noise specification [RFC3389]. In [RFC3389], the overall magnitude of the noise level in comfort noise is encoded into the first byte of the payload, with spectral information about the noise in subsequent bytes. This specification's audio level parameter is defined so as to be identical to the comfort noise payload's noise-level byte.

注：RTPペイロードコンフォートノイズ仕様[RFC3389]にオーディオノイズレベルがあるように、この音声レベルが同様に定義されます。 [RFC3389]に、快適ノイズのノイズレベルの全体的な大きさは、後続のバイトのノイズに関するスペクトル情報と、ペイロードの最初のバイトに符号化されます。コンフォートノイズのペイロードのノイズレベルのバイトに一致するように、この仕様のオーディオレベルのパラメータが定義されます。

4. Signaling (Setup) Information

4.シグナリング（セットアップ）情報

The URI for declaring this header extension in an extmap attribute is "urn:ietf:params:rtp-hdrext:ssrc-audio-level".

extmap属性にこのヘッダー拡張を宣言するためのURIは、 "IETF：paramsは：RTP-hdrext：SSRC音声レベルURN" です。

It has a single extension attribute, named "vad". It takes the form "vad=on" or "vad=off". If the header extension element is signaled with "vad=on", the V bit described in Section 3 is in use, and MUST be set by senders. If the header extension element is signaled with "vad=off", the V bit is not in use, and its value MUST be ignored by receivers. If the vad extension attribute is not specified, the default is "vad=on".

それは「VAD」という名前の単一の拡張属性を持っています。これは、または「VAD =オフ」「に= VAD」の形をとります。ヘッダ拡張要素が「オンVAD =」と通知された場合は、第3節に記載のVビットが使用され、そして送信者によって設定されなければなりません。ヘッダ拡張要素は、「VAD = OFF」と通知された場合、Vビットが使用されていない、その値は、受信機によって無視されなければなりません。 VAD拡張属性が指定されていない場合、デフォルトは「VAD =オン」です。

An example attribute line in the Session Description Protocol (SDP) for a conference might hence be:

会議のセッション記述プロトコル（SDP）の例属性ラインは、したがって次のようになります。

a=extmap:6 urn:ietf:params:rtp-hdrext:ssrc-audio-level vad=on

= extmap：6 URN：IETF：paramsは：RTP-hdrext：=でSSRC音声レベルVAD

The vad extension attribute only controls the semantics of this header extension attribute, and does not make any statement about whether the sender is using any other voice activity detection features, such as discontinuous transmission, comfort noise, or silence suppression.

VAD拡張属性は、このヘッダ拡張属性のセマンティクスを制御し、送信者が、このような不連続送信、快適雑音、または無音抑止などの他の音声アクティビティ検出機能を、使用しているかどうかについての声明を作成しません。

Using the mechanisms of [RFC5285], an endpoint MAY signal multiple instances of the header extension element, with different values of the vad attribute, so long as these instances use different values for the extension identifier. However, again following the rules of [RFC5285], the semantics chosen for a header extension element (including its vad setting) for a particular extension identifier value MUST NOT be changed within an RTP session.

[RFC5285]のメカニズムを使用して、エンドポイントがあれば、これらのインスタンスは拡張識別子に対して異なる値を使用するように、VADの属性の異なる値で、ヘッダ拡張要素の複数のインスタンスをシグナリングすることができます。しかし、もう一度[RFC5285]のルールに従って、特定の拡張識別子の値のために（そのVAD設定を含む）ヘッダ拡張要素のために選択されたセマンティクスはRTPセッション内で変更してはなりません。

5. Considerations on Use

使用上の注意事項5.

Mixers and forwarders generally ought not base audio forwarding decisions directly on packet-by-packet audio level information, but rather ought to apply some analysis of the audio levels and trends. This general rule applies whether audio levels are provided by endpoints (as defined in this document), or are calculated at a server, as would be done in the absence of this information. This section discusses several issues that mixers and forwarders may wish to take into account. (Note that this section provides design guidance only, and is not normative.)

ミキサーおよびフォワーダは、一般に、パケット・バイ・パケットの音声レベル情報に直接オーディオ転送決定の基礎ではなく、オーディオレベル及び傾向のいくつかの分析を適用するべきではないはずです。この情報の非存在下で行われるであろうように、この一般的なルールは、（本書で定義されるように）オーディオレベルをエンドポイントによって提供されているかどうかを適用する、またはサーバで計算されます。このセクションでは、ミキサーやフォワーダーに考慮することを望むかもしれないいくつかの問題について説明します。（この項は、設計ガイダンスを提供し、規範的ではないことに注意してください。）

First of all, audio levels generally ought to be measured over longer intervals than that of a single audio packet. In order to avoid false-positives for short bursts of sound (such as a cough or a dropped microphone), it is often useful to require that a participant's audio level be maintained for some period of time before considering it to be "real"; i.e., some type of low-pass filter ought to be applied to the audio levels. Note, though, that such filtering must be balanced with the need to avoid clipping of the beginning of a speaker's speech.

まず、オーディオレベルは、一般的に、単一の音声パケットよりも長い間隔で測定されるべきです。（そのような咳や落下マイクなど）の音の短いバーストのための偽陽性を避けるために、「本物」であることを考える前に、参加者の音声レベルがある程度の期間にわたって維持することを要求することが有用であることが多いです。すなわち、ローパスフィルタのいくつかのタイプは、オーディオレベルに適用されるべきです。このようなフィルタリングは、話者の音声の始まりのクリッピングを回避する必要性とバランスをとらなければならないこと、しかし、注意してください。

Additionally, different participants may have their audio input set differently. It may be useful to apply some sort of automatic gain control to the audio levels. There are a number of possible approaches to achieving this, e.g., by measuring peak audio levels, by average audio levels during speech, or by measuring background audio levels (average audio levels during non-speech).

また、別の参加者は、音声入力が異なって設定した可能性があります。オーディオレベルに自動利得制御のいくつかの並べ替えを適用することが有用であり得ます。発話中の平均音声レベルによるピークオーディオレベルを測定することによって、例えば、これを達成するための可能なアプローチの数、または（非発話中の平均音声レベル）バックグラウンドオーディオレベルを測定することです。

6. Security Considerations

6.セキュリティの考慮事項

A malicious endpoint could choose to set the values in this header extension falsely, so as to falsely claim that audio or voice is or is not present. It is not clear what could be gained by falsely claiming that audio is not present, but an endpoint falsely claiming that audio is present, or falsely exaggerating its reported levels, could perform a denial-of-service attack on an audio conference, so as to send silence to suppress other conference members' audio, or could dominate a conference by seizing its speaker-selection algorithm. Thus, if a device relies on audio level data from untrusted endpoints, it SHOULD periodically audit the level information transmitted, taking appropriate corrective action against endpoints that appear to be sending incorrect data. (However, as it is valid for an endpoint to choose to measure audio levels prior to encoding, some degree of discrepancy could be present. This would not indicate that an endpoint is malicious.)

誤ってオーディオ又は音声であるか、または存在しないことを主張するように悪意のあるエンドポイントは、誤ってこのヘッダー拡張の値を設定することを選択する可能性があります。ように本ではなく、誤ってその音声を主張するエンドポイントが存在している、または虚偽の報告されたレベルを誇張、音声会議にDoS攻撃を行う可能性があり、誤ってその音声を主張することによって得られることができるものは明らかではありません他の会議メンバーのオーディオを抑制するために沈黙を送信するために、またはその話者選択アルゴリズムを押収して会議を支配できました。デバイスは、信頼されていないエンドポイントからの音声レベルのデータに依存している場合したがって、それは定期的に誤ったデータを送信するように見えるエンドポイントに対して適切な是正措置を取って、送信レベル情報を監査すべきです。（ただし、それが有効であるとして、エンドポイントは、不一致のある程度は存在し得る符号化の前に、オーディオレベルを測定するために選択するために。これは、エンドポイントが悪意のあるであることを示しません。）

In the Secure Real-time Transport Protocol (SRTP) [RFC3711], RTP header extensions are authenticated but not encrypted. When this header extension is used, audio levels are therefore visible on a packet-by-packet basis to an attacker passively observing the audio stream. As discussed in [SRTP-VBR-AUDIO], such an attacker might be able to infer information about the conversation, possibly with phoneme-level resolution. In scenarios where this is a concern, additional mechanisms MUST be used to protect the confidentiality of the header extension. This mechanism could be header extension encryption [SRTP-ENCR-HDR], or a lower-level security and authentication mechanism such as IPsec [RFC4301].

セキュアリアルタイム転送プロトコル（SRTP）[RFC3711]では、RTPヘッダの拡張機能は、認証されますが、暗号化されていません。このヘッダー拡張を使用する場合、オーディオレベルは、したがって、攻撃者が受動的にオーディオストリームを観察するパケットごとに表示されています。 [SRTP-VBR-AUDIO]で議論するように、そのような攻撃者は、おそらく、音素レベルの分解能で、会話についての情報を推測することができるかもしれません。これが懸念されるシナリオでは、追加の機構は、ヘッダ拡張の機密性を保護するために使用されなければなりません。この機構は、ヘッダ拡張暗号化[SRTP-ENCR-HDR]、またはIPsecなどの低レベルのセキュリティと認証メカニズム[RFC4301]とすることができます。

7. IANA Considerations

7. IANAの考慮事項

This document defines a new extension URI in the RTP Compact Header Extensions subregistry of the Real-Time Transport Protocol (RTP) Parameters registry, according to the following data:

このドキュメントでは、次のデータによると、リアルタイムトランスポートプロトコル（RTP）パラメータレジストリのRTPコンパクトヘッダー拡張副登録の新しい拡張URIを定義します。

Extension URI: urn:ietf:params:rtp-hdrext:ssrc-audio-level Description: Audio Level Contact: jonathan@vidyo.com Reference: RFC 6464

拡張URI：URN：IETF：のparams：RTP-hdrext：SSRC-オーディオレベルの説明：オーディオレベルお問い合わせ：jonathan@vidyo.com参考：RFC 6464

8. References

8.参照文献

8.1. Normative References

8.1. 引用規格

[RFC2119] Bradner, S., "Key words for use in RFCs to Indicate Requirement Levels", BCP 14, RFC 2119, March 1997.

[RFC2119]ブラドナーの、S.、 "要件レベルを示すためにRFCsにおける使用のためのキーワード"、BCP 14、RFC 2119、1997年3月。

[RFC2198] Perkins, C., Kouvelas, I., Hodson, O., Hardman, V., Handley, M., Bolot, J., Vega-Garcia, A., and S. Fosse-Parisis, "RTP Payload for Redundant Audio Data", RFC 2198, September 1997.

[RFC2198]パーキンス、C.、Kouvelas、I.、ホドソン、O.、ハードマン、V.、ハンドレー、M.、Bolot、J.、ベガ・ガルシア、A.、およびS.フォッシー-Parisis、「RTPペイロード冗長オーディオ・データ」、RFC 2198、1997年9月のため。

[RFC3550] Schulzrinne, H., Casner, S., Frederick, R., and V. Jacobson, "RTP: A Transport Protocol for Real-Time Applications", STD 64, RFC 3550, July 2003.

[RFC3550] Schulzrinneと、H.、Casner、S.、フレデリック、R.、およびV.ヤコブソン、 "RTP：リアルタイムアプリケーションのためのトランスポートプロトコル"、STD 64、RFC 3550、2003年7月。

[RFC5285] Singer, D. and H. Desineni, "A General Mechanism for RTP Header Extensions", RFC 5285, July 2008.

[RFC5285]歌手、D.およびH. Desineni、 "RTPヘッダー拡張のための一般的なメカニズム"、RFC 5285、2008年7月。

8.2. Informative References

8.2. 参考文献

[ITU.G711] International Telecommunication Union, "Pulse Code Modulation (PCM) of Voice Frequencies", ITU-T Recommendation G.711, November 1988.

[ITU.G711]国際電気通信連合、ITU-T勧告G.711、1988年11月 "音声周波数の符号変調（PCM）をパルス"。

[RFC3389] Zopf, R., "Real-time Transport Protocol (RTP) Payload for Comfort Noise (CN)", RFC 3389, September 2002.

[RFC3389] Zopf、R.、RFC 3389、2002年9月 "コンフォートノイズ（CN）のためのリアルタイムトランスポートプロトコル（RTP）ペイロード"。

[RFC3711] Baugher, M., McGrew, D., Naslund, M., Carrara, E., and K. Norrman, "The Secure Real-time Transport Protocol (SRTP)", RFC 3711, March 2004.

[RFC3711] Baugher、M.、マグリュー、D.、Naslund、M.、カララ、E.、およびK. Norrman、 "セキュアリアルタイム転送プロトコル（SRTP）"、RFC 3711、2004年3月。

[RFC4301] Kent, S. and K. Seo, "Security Architecture for the Internet Protocol", RFC 4301, December 2005.

[RFC4301]ケント、S.とK. Seo、 "インターネットプロトコルのためのセキュリティアーキテクチャ"、RFC 4301、2005年12月。

[RFC6465] Ivov, E., Ed., Marocco, E., Ed., and J. Lennox, "A Real-time Transport Protocol (RTP) Header Extension for Mixer-to-Client Audio Level Indication", RFC 6465, December 2011.

[RFC6465] Ivov、E.、エド。、Marocco、E.、エド。、およびJ.レノックス、 "リアルタイム転送プロトコルミキサーからクライアントへのオーディオレベル表示について（RTP）ヘッダ拡張"、RFC 6465、 2011年12月。

[SRTP-ENCR-HDR] Lennox, J., "Encryption of Header Extensions in the Secure Real-Time Transport Protocol (SRTP)", Work in Progress, October 2011.

[SRTP-ENCR-HDR]レノックス、J.、進歩、2011年10月で、作業 "セキュアリアルタイムトランスポートプロトコル（SRTP）のヘッダの拡張の暗号化"。

[SRTP-VBR-AUDIO] Perkins, C. and JM. Valin, "Guidelines for the use of Variable Bit Rate Audio with Secure RTP", Work in Progress, July 2011.

[SRTP-VBR-AUDIO]パーキンス、C.およびJM。 Valinは、「セキュアRTPと可変ビットレート・オーディオの使用のためのガイドライン」、進歩、2011年7月での作業。

Authors' Addresses

著者のアドレス

Jonathan Lennox (editor) Vidyo, Inc. 433 Hackensack Avenue Seventh Floor Hackensack, NJ 07601 US

ジョナサン・レノックス（編集者）にVidyo社433ハッケンサックアベニュー7階ハッケンサック、NJ 07601米国

EMail: jonathan@vidyo.com

メールアドレス：jonathan@vidyo.com

Emil Ivov Jitsi Strasbourg 67000 France

エミールIvov Jitsiストラスブールフランス

EMail: emcho@jitsi.org

メールアドレス：emcho@jitsi.org

Enrico Marocco Telecom Italia Via G. Reiss Romoli, 274 Turin 10148 Italy

エンリコ・モロッコテレコムイタリアVia G.ライスRomoli、274 10148トリノイタリア

EMail: enrico.marocco@telecomitalia.it

メールアドレス：enrico.marocco@telecomitalia.it