Decoder.Convert 方法
定義
重要
部分資訊涉及發行前產品,在發行之前可能會有大幅修改。 Microsoft 對此處提供的資訊,不做任何明確或隱含的瑕疵擔保。
將編碼的位元組序列轉換為字串或字元陣列。
多載
| 名稱 | Description |
|---|---|
| Convert(ReadOnlySpan<Byte>, Span<Char>, Boolean, Int32, Int32, Boolean) |
將一段編碼的位元組轉換成 UTF-16 編碼字元,並將結果儲存在另一個區間緩衝區。 |
| Convert(Byte*, Int32, Char*, Int32, Boolean, Int32, Int32, Boolean) |
將編碼位元組的緩衝區轉換為 UTF-16 編碼字元,並將結果儲存在另一個緩衝區中。 |
| Convert(Byte[], Int32, Int32, Char[], Int32, Int32, Boolean, Int32, Int32, Boolean) |
將編碼的位元組陣列轉換為 UTF-16 編碼字元,並將結果儲存在字元陣列中。 |
備註
請記住, Decoder 物件會在呼叫 之間 Convert儲存狀態。
當應用程式處理一串資料時,應設定 flush 參數為 以確保 true 狀態資訊被清除。 使用此設定,解碼器會忽略資料區塊末端的無效位元組,並清除內部緩衝區。
任何剩餘的處理資料,屬於邏輯單元,例如代理對的高階代理,都會依照當前的備援設定轉換。
此 Convert 方法設計用於迴圈中解碼任意數量的輸入,例如從檔案或串流讀取的資料。
它將解碼操作的輸出儲存在固定大小的緩衝區中。
GetChars 如果輸出緩衝區不夠大,會拋出例外,但 Convert 會盡可能填滿空間並回傳讀取和寫入的位元組,前提是輸出陣列至少允許兩個字元。
還有更多留言可以參考 Encoding.GetChars 。
Convert(ReadOnlySpan<Byte>, Span<Char>, Boolean, Int32, Int32, Boolean)
將一段編碼的位元組轉換成 UTF-16 編碼字元,並將結果儲存在另一個區間緩衝區。
public:
virtual void Convert(ReadOnlySpan<System::Byte> bytes, Span<char> chars, bool flush, [Runtime::InteropServices::Out] int % bytesUsed, [Runtime::InteropServices::Out] int % charsUsed, [Runtime::InteropServices::Out] bool % completed);
public virtual void Convert(ReadOnlySpan<byte> bytes, Span<char> chars, bool flush, out int bytesUsed, out int charsUsed, out bool completed);
abstract member Convert : ReadOnlySpan<byte> * Span<char> * bool * int * int * bool -> unit
override this.Convert : ReadOnlySpan<byte> * Span<char> * bool * int * int * bool -> unit
Public Overridable Sub Convert (bytes As ReadOnlySpan(Of Byte), chars As Span(Of Char), flush As Boolean, ByRef bytesUsed As Integer, ByRef charsUsed As Integer, ByRef completed As Boolean)
參數
- bytes
- ReadOnlySpan<Byte>
一個只讀位元組,包含要轉換的序列。
- flush
- Boolean
true表示不再轉換資料;否則,。 false
- bytesUsed
- Int32
當此方法回傳時,包含由轉換產生的位元組數。 這個參數會未初始化傳遞。
- charsUsed
- Int32
當此方法回傳時,包含用於轉換的字元 chars 數。 這個參數會未初始化傳遞。
- completed
- Boolean
當此方法回傳時,若所有指定字元皆已轉換,則包含true;否則,。 false 這個參數會未初始化傳遞。
備註
completed輸出參數表示輸入位元組區間中的所有資料是否都已轉換並儲存在字元區間。 此參數設 false 為:當輸入位元組區間內的位元組數無法在不超過字元數的情況下轉換。 在這種情況下,應用程式應使用輸出緩衝區的內容或提供新的輸出緩衝區,將參數依照參數指定的bytesUsed位元組數遞增bytes,然後再次呼叫該Convert方法處理剩餘的輸入。
completed即使參數與位元組的長度相等,參數也可以設定為 false。bytesUsed 當物件中 Decoder 仍有未儲存在區 bytes 間的資料時,就會發生這種情況。
適用於
Convert(Byte*, Int32, Char*, Int32, Boolean, Int32, Int32, Boolean)
重要
此 API 不符合 CLS 規範。
將編碼位元組的緩衝區轉換為 UTF-16 編碼字元,並將結果儲存在另一個緩衝區中。
public:
virtual void Convert(System::Byte* bytes, int byteCount, char* chars, int charCount, bool flush, [Runtime::InteropServices::Out] int % bytesUsed, [Runtime::InteropServices::Out] int % charsUsed, [Runtime::InteropServices::Out] bool % completed);
[System.CLSCompliant(false)]
[System.Runtime.InteropServices.ComVisible(false)]
public virtual void Convert(byte* bytes, int byteCount, char* chars, int charCount, bool flush, out int bytesUsed, out int charsUsed, out bool completed);
[System.CLSCompliant(false)]
[System.Runtime.InteropServices.ComVisible(false)]
[System.Security.SecurityCritical]
public virtual void Convert(byte* bytes, int byteCount, char* chars, int charCount, bool flush, out int bytesUsed, out int charsUsed, out bool completed);
[System.CLSCompliant(false)]
public virtual void Convert(byte* bytes, int byteCount, char* chars, int charCount, bool flush, out int bytesUsed, out int charsUsed, out bool completed);
[<System.CLSCompliant(false)>]
[<System.Runtime.InteropServices.ComVisible(false)>]
abstract member Convert : nativeptr<byte> * int * nativeptr<char> * int * bool * int * int * bool -> unit
override this.Convert : nativeptr<byte> * int * nativeptr<char> * int * bool * int * int * bool -> unit
[<System.CLSCompliant(false)>]
[<System.Runtime.InteropServices.ComVisible(false)>]
[<System.Security.SecurityCritical>]
abstract member Convert : nativeptr<byte> * int * nativeptr<char> * int * bool * int * int * bool -> unit
override this.Convert : nativeptr<byte> * int * nativeptr<char> * int * bool * int * int * bool -> unit
[<System.CLSCompliant(false)>]
abstract member Convert : nativeptr<byte> * int * nativeptr<char> * int * bool * int * int * bool -> unit
override this.Convert : nativeptr<byte> * int * nativeptr<char> * int * bool * int * int * bool -> unit
參數
- bytes
- Byte*
包含要轉換的位元組序列的緩衝區位址。
- byteCount
- Int32
要轉換的位元組 bytes 數。
- chars
- Char*
緩衝區的位址,用來儲存轉換後的字元。
- charCount
- Int32
轉換時可使用的最大字 chars 元數。
- flush
- Boolean
true表示不再轉換資料;否則,。 false
- bytesUsed
- Int32
當此方法回傳時,包含由轉換產生的位元組數。 這個參數會未初始化傳遞。
- charsUsed
- Int32
當此方法回傳時,包含用於轉換的字元 chars 數。 這個參數會未初始化傳遞。
- completed
- Boolean
當此方法回傳時,若所有指定byteCount字元皆已轉換,則包含true;否則,。 false 這個參數會未初始化傳遞。
- 屬性
例外狀況
chars 或 bytes 是 null (Nothing)。
charCount 或 byteCount 小於零。
輸出緩衝區太小,無法容納任何轉換後的輸入。 輸出緩衝區應至少有 2 字元大小,以容納至少一對替代字元。
備註
completed輸出參數表示輸入緩衝區中的所有資料是否已轉換並儲存在輸出緩衝區。 若參數指定的位元組byteCount數無法在不超過該參數指定charCount字元數的情況下轉換,則設定為 。false 在這種情況下,應用程式應使用輸出緩衝區的內容或提供新的輸出緩衝區,將參數依照參數指定的bytesUsed位元組數遞增bytes,然後再次呼叫該Convert方法處理剩餘的輸入。
即使參數byteCount相等,參數也可以設為 false。bytesUsedcompleted 這種情況發生在物件中 Decoder 仍有未儲存在 bytes 緩衝區的資料時。
適用於
Convert(Byte[], Int32, Int32, Char[], Int32, Int32, Boolean, Int32, Int32, Boolean)
將編碼的位元組陣列轉換為 UTF-16 編碼字元,並將結果儲存在字元陣列中。
public:
virtual void Convert(cli::array <System::Byte> ^ bytes, int byteIndex, int byteCount, cli::array <char> ^ chars, int charIndex, int charCount, bool flush, [Runtime::InteropServices::Out] int % bytesUsed, [Runtime::InteropServices::Out] int % charsUsed, [Runtime::InteropServices::Out] bool % completed);
public virtual void Convert(byte[] bytes, int byteIndex, int byteCount, char[] chars, int charIndex, int charCount, bool flush, out int bytesUsed, out int charsUsed, out bool completed);
[System.Runtime.InteropServices.ComVisible(false)]
public virtual void Convert(byte[] bytes, int byteIndex, int byteCount, char[] chars, int charIndex, int charCount, bool flush, out int bytesUsed, out int charsUsed, out bool completed);
abstract member Convert : byte[] * int * int * char[] * int * int * bool * int * int * bool -> unit
override this.Convert : byte[] * int * int * char[] * int * int * bool * int * int * bool -> unit
[<System.Runtime.InteropServices.ComVisible(false)>]
abstract member Convert : byte[] * int * int * char[] * int * int * bool * int * int * bool -> unit
override this.Convert : byte[] * int * int * char[] * int * int * bool * int * int * bool -> unit
Public Overridable Sub Convert (bytes As Byte(), byteIndex As Integer, byteCount As Integer, chars As Char(), charIndex As Integer, charCount As Integer, flush As Boolean, ByRef bytesUsed As Integer, ByRef charsUsed As Integer, ByRef completed As Boolean)
參數
- bytes
- Byte[]
一個位元組陣列來轉換。
- byteIndex
- Int32
轉換的第一個要素 bytes 。
- byteCount
- Int32
需要轉換的 bytes 元素數量。
- chars
- Char[]
一個用來儲存轉換後字元的陣列。
- charIndex
- Int32
資料儲存的第一個元素 chars 。
- charCount
- Int32
轉換時可使用的最大元素 chars 數量。
- flush
- Boolean
true以表示不再需要轉換資料;否則,。 false
- bytesUsed
- Int32
當此方法回傳時,包含了用於轉換的位元組數。 這個參數會未初始化傳遞。
- charsUsed
- Int32
當此方法回傳時,包含了轉換所產生的字元 chars 數。 這個參數會未初始化傳遞。
- completed
- Boolean
當此方法回傳時,若所有指定byteCount字元皆已轉換,則包含true;否則,。 false 這個參數會未初始化傳遞。
- 屬性
例外狀況
chars 或 bytes 是 null (Nothing)。
charIndex, charCount, byteIndex, , byteCount 或 小於零。
-或-
的chars - charIndex長度小於 。charCount
-或-
的bytes - byteIndex長度小於 。byteCount
輸出緩衝區太小,無法容納任何轉換後的輸入。 輸出緩衝區應至少有 2 字元大小,以容納至少一對替代字元。
範例
以下範例使用 Convert 將 UTF-16 字元檔案轉換為 UTF-8 的方法。 接著會用此 Convert 方法將 UTF-8 字元轉換回 UTF-16 字元。
// This code example demonstrates the Encoder.Convert() and Decoder.Convert methods.
// This example uses files for input and output, but any source that can be expressed
// as a stream can be used instead.
using System;
using System.Text;
using System.IO;
public class Sample
{
static void Main(string[] args)
{
// Create a large file of UTF-16 encoded Unicode characters. The file is named Example.txt,
// and is used as input to the Encoder.Convert() method.
CreateTestFile("Example.txt");
// Using an input file of UTF-16 encoded characters named Example.txt, create an output file
// of UTF-8 encoded bytes named UTF8.txt.
EncoderConvert("Example.txt", "UTF8.txt", Encoding.UTF8);
// Using an input file of UTF-8 encoded bytes named UTF8.txt, create an output file
// of UTF-16 encoded characters named UTF16.txt.
DecoderConvert("UTF8.txt", "UTF16.txt", Encoding.UTF8);
}
// --------------------------------------------------------------------------------------------
// Use the Encoder.Convert() method to convert a file of characters to a file of encoded bytes.
// --------------------------------------------------------------------------------------------
static void EncoderConvert(String inputFileName, String outputFileName, Encoding enc)
{
// Convert an input file of characters to an output file of encoded bytes.
// StreamWriter could convert the input file for us, but we'll perform the conversion
// ourselves.
FileStream fs = new FileStream(outputFileName, FileMode.Create);
BinaryWriter outputFile = new BinaryWriter(fs);
// StreamReader will detect Unicode encoding from the Byte Order Mark that heads the input file.
StreamReader inputFile = new StreamReader(inputFileName);
// Get an Encoder.
Encoder encoder = enc.GetEncoder();
// Guarantee the output buffer large enough to convert a few characters.
int UseBufferSize = 64;
if (UseBufferSize < enc.GetMaxByteCount(10))
UseBufferSize = enc.GetMaxByteCount(10);
byte[] bytes = new byte[UseBufferSize];
// Intentionally make the input character buffer larger than the output byte buffer so the
// conversion loop executes more than one cycle.
char[] chars = new char[UseBufferSize * 4];
int charsRead;
do
{
// Read at most the number of characters that will fit in the input buffer. The return
// value is the actual number of characters read, or zero if no characters remain.
charsRead = inputFile.Read(chars, 0, UseBufferSize * 4);
bool completed = false;
int charIndex = 0;
int charsUsed;
int bytesUsed;
while (!completed)
{
// If this is the last input data, flush the encoder's internal buffer and state.
bool flush = (charsRead == 0);
encoder.Convert(chars, charIndex, charsRead - charIndex,
bytes, 0, UseBufferSize, flush,
out charsUsed, out bytesUsed, out completed);
// The conversion produced the number of bytes indicated by bytesUsed. Write that number
// of bytes to the output file.
outputFile.Write(bytes, 0, bytesUsed);
// Increment charIndex to the next block of characters in the input buffer, if any, to convert.
charIndex += charsUsed;
}
}
while(charsRead != 0);
outputFile.Close();
fs.Close();
inputFile.Close();
}
// --------------------------------------------------------------------------------------------
// Use the Decoder.Convert() method to convert a file of encoded bytes to a file of characters.
// --------------------------------------------------------------------------------------------
static void DecoderConvert(String inputFileName, String outputFileName, Encoding enc)
{
// Convert an input file of of encoded bytes to an output file characters.
// StreamWriter could convert the input file for us, but we'll perform the conversion
// ourselves.
StreamWriter outputFile = new StreamWriter(outputFileName, false, Encoding.Unicode);
// Read the input as a binary file so we can detect the Byte Order Mark.
FileStream fs = new FileStream(inputFileName, FileMode.Open);
BinaryReader inputFile = new BinaryReader(fs);
// Get a Decoder.
Decoder decoder = enc.GetDecoder();
// Guarantee the output buffer large enough to convert a few characters.
int UseBufferSize = 64;
if (UseBufferSize < enc.GetMaxCharCount(10))
UseBufferSize = enc.GetMaxCharCount(10);
char[] chars = new char[UseBufferSize];
// Intentionally make the input byte buffer larger than the output character buffer so the
// conversion loop executes more than one cycle.
byte[] bytes = new byte[UseBufferSize * 4];
int bytesRead;
do
{
// Read at most the number of bytes that will fit in the input buffer. The
// return value is the actual number of bytes read, or zero if no bytes remain.
bytesRead = inputFile.Read(bytes, 0, UseBufferSize * 4);
bool completed = false;
int byteIndex = 0;
int bytesUsed;
int charsUsed;
while (!completed)
{
// If this is the last input data, flush the decoder's internal buffer and state.
bool flush = (bytesRead == 0);
decoder.Convert(bytes, byteIndex, bytesRead - byteIndex,
chars, 0, UseBufferSize, flush,
out bytesUsed, out charsUsed, out completed);
// The conversion produced the number of characters indicated by charsUsed. Write that number
// of characters to the output file.
outputFile.Write(chars, 0, charsUsed);
// Increment byteIndex to the next block of bytes in the input buffer, if any, to convert.
byteIndex += bytesUsed;
}
}
while(bytesRead != 0);
outputFile.Close();
fs.Close();
inputFile.Close();
}
// --------------------------------------------------------------------------------------------
// Create a large file of UTF-16 encoded Unicode characters.
// --------------------------------------------------------------------------------------------
static void CreateTestFile(String FileName)
{
// StreamWriter defaults to UTF-8 encoding so explicitly specify Unicode, that is,
// UTF-16, encoding.
StreamWriter file = new StreamWriter(FileName, false, Encoding.Unicode);
// Write a line of text 100 times.
for (int i = 0; i < 100; i++)
{
file.WriteLine("This is an example input file used by the convert example.");
}
// Write Unicode characters from U+0000 to, but not including, the surrogate character range.
for (char c = (char)0; c < (char)0xD800; c++)
{
file.Write(c);
}
file.Close();
}
}
/*
This code example produces the following results:
(Execute the -dir- console window command and examine the files created.)
Example.txt, which contains 122,594 bytes (61,297 UTF-16 encoded characters).
UTF8.txt, which contains 169,712 UTF-8 encoded bytes.
UTF16.txt, which contains 122,594 bytes (61,297 UTF-16 encoded characters).
(Execute the -comp- console window command and compare the two Unicode files.)
>comp example.txt utf16.txt /L
Comparing example.txt and utf16.txt...
Files compare OK
(The two files are equal.)
*/
' This code example demonstrates the Encoder.Convert() and Decoder.Convert methods.
' This example uses files for input and output, but any source that can be expressed
' as a stream can be used instead.
Imports System.Text
Imports System.IO
Public Class Sample
Shared Sub Main(ByVal args() As String)
' Create a large file of UTF-16 encoded Unicode characters. The file is named Example.txt,
' and is used as input to the Encoder.Convert() method.
CreateTestFile("Example.txt")
' Using an input file of UTF-16 encoded characters named Example.txt, create an output file
' of UTF-8 encoded bytes named UTF8.txt.
EncoderConvert("Example.txt", "UTF8.txt", Encoding.UTF8)
' Using an input file of UTF-8 encoded bytes named UTF8.txt, create an output file
' of UTF-16 encoded characters named UTF16.txt.
DecoderConvert("UTF8.txt", "UTF16.txt", Encoding.UTF8)
End Sub
' --------------------------------------------------------------------------------------------
' Use the Encoder.Convert() method to convert a file of characters to a file of encoded bytes.
' --------------------------------------------------------------------------------------------
Shared Sub EncoderConvert(ByVal inputFileName As String, ByVal outputFileName As String, ByVal enc As Encoding)
' Convert an input file of characters to an output file of encoded bytes.
' StreamWriter could convert the input file for us, but we'll perform the conversion
' ourselves.
Dim fs As New FileStream(outputFileName, FileMode.Create)
Dim outputFile As New BinaryWriter(fs)
' StreamReader will detect Unicode encoding from the Byte Order Mark that heads the input file.
Dim inputFile As New StreamReader(inputFileName)
' Get an Encoder.
Dim encoder As Encoder = enc.GetEncoder()
' Guarantee the output buffer large enough to convert a few characters.
Dim UseBufferSize As Integer = 64
If UseBufferSize < enc.GetMaxByteCount(10) Then
UseBufferSize = enc.GetMaxByteCount(10)
End If
Dim bytes(UseBufferSize) As Byte
' Intentionally make the input character buffer larger than the output byte buffer so the
' conversion loop executes more than one cycle.
Dim chars(UseBufferSize * 4) As Char
Dim charsRead As Integer
Do
' Read at most the number of characters that will fit in the input buffer. The return
' value is the actual number of characters read, or zero if no characters remain.
charsRead = inputFile.Read(chars, 0, UseBufferSize * 4)
Dim completed As Boolean = False
Dim charIndex As Integer = 0
Dim charsUsed As Integer
Dim bytesUsed As Integer
While Not completed
' If this is the last input data, flush the encoder's internal buffer and state.
Dim flush As Boolean = charsRead = 0
encoder.Convert(chars, charIndex, charsRead - charIndex, bytes, 0, UseBufferSize, flush, charsUsed, bytesUsed, completed)
' The conversion produced the number of bytes indicated by bytesUsed. Write that number
' of bytes to the output file.
outputFile.Write(bytes, 0, bytesUsed)
' Increment charIndex to the next block of characters in the input buffer, if any, to convert.
charIndex += charsUsed
End While
Loop While charsRead <> 0
outputFile.Close()
fs.Close()
inputFile.Close()
End Sub
' --------------------------------------------------------------------------------------------
' Use the Decoder.Convert() method to convert a file of encoded bytes to a file of characters.
' --------------------------------------------------------------------------------------------
Shared Sub DecoderConvert(ByVal inputFileName As String, ByVal outputFileName As String, ByVal enc As Encoding)
' Convert an input file of of encoded bytes to an output file characters.
' StreamWriter could convert the input file for us, but we'll perform the conversion
' ourselves.
Dim outputFile As New StreamWriter(outputFileName, False, Encoding.Unicode)
' Read the input as a binary file so we can detect the Byte Order Mark.
Dim fs As New FileStream(inputFileName, FileMode.Open)
Dim inputFile As New BinaryReader(fs)
' Get a Decoder.
Dim decoder As Decoder = enc.GetDecoder()
' Guarantee the output buffer large enough to convert a few characters.
Dim UseBufferSize As Integer = 64
If UseBufferSize < enc.GetMaxCharCount(10) Then
UseBufferSize = enc.GetMaxCharCount(10)
End If
Dim chars(UseBufferSize) As Char
' Intentionally make the input byte buffer larger than the output character buffer so the
' conversion loop executes more than one cycle.
Dim bytes(UseBufferSize * 4) As Byte
Dim bytesRead As Integer
Do
' Read at most the number of bytes that will fit in the input buffer. The
' return value is the actual number of bytes read, or zero if no bytes remain.
bytesRead = inputFile.Read(bytes, 0, UseBufferSize * 4)
Dim completed As Boolean = False
Dim byteIndex As Integer = 0
Dim bytesUsed As Integer
Dim charsUsed As Integer
While Not completed
' If this is the last input data, flush the decoder's internal buffer and state.
Dim flush As Boolean = bytesRead = 0
decoder.Convert(bytes, byteIndex, bytesRead - byteIndex, chars, 0, UseBufferSize, flush, bytesUsed, charsUsed, completed)
' The conversion produced the number of characters indicated by charsUsed. Write that number
' of characters to the output file.
outputFile.Write(chars, 0, charsUsed)
' Increment byteIndex to the next block of bytes in the input buffer, if any, to convert.
byteIndex += bytesUsed
End While
Loop While bytesRead <> 0
outputFile.Close()
fs.Close()
inputFile.Close()
End Sub
' --------------------------------------------------------------------------------------------
' Create a large file of UTF-16 encoded Unicode characters.
' --------------------------------------------------------------------------------------------
Shared Sub CreateTestFile(ByVal FileName As String)
' StreamWriter defaults to UTF-8 encoding so explicitly specify Unicode, that is,
' UTF-16, encoding.
Dim file As New StreamWriter(FileName, False, Encoding.Unicode)
' Write a line of text 100 times.
Dim i As Integer
For i = 0 To 99
file.WriteLine("This is an example input file used by the convert example.")
Next i
' Write Unicode characters from U+0000 to, but not including, the surrogate character range.
Dim c As Integer
For c = 0 To &HD800
file.Write(ChrW(c))
Next c
file.Close()
End Sub
End Class
'
'This code example produces the following results:
'
'(Execute the -dir- console window command and examine the files created.)
'
'Example.txt, which contains 122,594 bytes (61,297 UTF-16 encoded characters).
'UTF8.txt, which contains 169,712 UTF-8 encoded bytes.
'UTF16.txt, which contains 122,594 bytes (61,297 UTF-16 encoded characters).
'
'(Execute the -comp- console window command and compare the two Unicode files.)
'
'>comp example.txt utf16.txt /L
'Comparing example.txt and utf16.txt...
'Files compare OK
'
'(The two files are equal.)
'
備註
completed輸出參數表示輸入緩衝區中的所有資料是否已轉換並儲存在輸出緩衝區。 若參數指定的位元組byteCount數無法在不超過該參數指定charCount字元數的情況下轉換,則設定為 。false 在這種情況下,應用程式應使用輸出緩衝區的內容或提供新的輸出緩衝區,將參數依照參數指定的bytesUsed位元組數遞增bytes,然後再次呼叫該Convert方法處理剩餘的輸入。
即使參數byteCount相等,參數也可以設為 false。bytesUsedcompleted 這種情況發生在物件中 Decoder 仍有未儲存在 bytes 緩衝區的資料時。