【问题标题】:C# equivalent of LZMA-JS compressC# 等效于 LZMA-JS 压缩
【发布时间】:2015-04-15 15:14:40
【问题描述】:

我使用了一些客户端 javscript 代码来压缩使用 this javascript library 的一些用户输入。 在后端代码中,我使用了来自this post 的代码示例,使用 C# 解压数据服务器端。

完美的作品。

现在我希望能够像 javascript 一样压缩字符串。当我使用this sample 压缩代码时,我得到一个范围从-128 到128 的有符号整数数组。现在我想使用我的后端代码来做同样的事情。

javascript 中的 LMZA 属性与 C# 代码中的默认属性有点不同,但即使我将它们更改为相同的值,我也会从两个库中得到不同的结果。

起初,C# 代码的输出值是无符号的。 其次,返回的字符数不同。 解码器的属性可能会引入差异,但我不知道如何使两个库对齐。

我的 C# 使用来自 7zip 的 LMZA SDK

我的 C# 代码解压 javascript 压缩数据(逗号分隔的有符号整数数组):

public static void Decompress(Stream inStream, Stream outStream)
{
    byte[] properties = new byte[5];
    inStream.Read(properties, 0, 5);
    SevenZip.Compression.LZMA.Decoder decoder = new SevenZip.Compression.LZMA.Decoder();
    decoder.SetDecoderProperties(properties);
    long outSize = 0;
    for (int i = 0; i < 8; i++)
    {
        int v = inStream.ReadByte();
        outSize |= ((long)(byte)v) << (8 * i);
    }
    long compressedSize = inStream.Length - inStream.Position;
    decoder.Code(inStream, outStream, compressedSize, outSize, null);
}

public static string DecompressLzma(string inputstring)
{
    if (!string.IsNullOrEmpty(inputstring))
    {
        byte[] myInts = Array.ConvertAll(inputstring.Split(','), s => (byte)int.Parse(s));
        var stream = new MemoryStream(myInts);
        var outputStream = new MemoryStream();
        Decompress(stream, outputStream);
        using (var reader = new StreamReader(outputStream))
        {
            outputStream.Position = 0;
            string output = reader.ReadToEnd();
            return output;
        }
    }

    return "";
}

压缩数据的代码是这样的(字节数不同,无符号):

public static string CompressLzma(string inputstring)
{
    if (!string.IsNullOrEmpty(inputstring))
    {               
        var stream = new MemoryStream(Encoding.Unicode.GetBytes(inputstring ?? ""));
        var outputStream = new MemoryStream();
        Compress(stream, outputStream);



        byte[] bytes = outputStream.ToArray();


    }

    return "";
}

public static void Compress(MemoryStream inStream, MemoryStream outStream)
{
    CoderPropID[] propIDs;
    object[] properties;
    PrepareEncoder(out propIDs, out properties);

    SevenZip.Compression.LZMA.Encoder encoder = new SevenZip.Compression.LZMA.Encoder();
    encoder.SetCoderProperties(propIDs, properties);
    encoder.WriteCoderProperties(outStream);
    Int64 fileSize = inStream.Length;
    for (int i = 0; i < 8; i++)
    {
        outStream.WriteByte((Byte)(fileSize >> (8 * i)));
    }
    encoder.Code(inStream, outStream, -1, -1, null);
}

public static void PrepareEncoder(out CoderPropID[] propIDs, out object[] properties)
{
    bool eos = true;
    Int32 dictionary = 1 << 16;
    Int32 posStateBits = 2;
    Int32 litContextBits = 3; // for normal files
    // UInt32 litContextBits = 0; // for 32-bit data
    Int32 litPosBits = 0;
    // UInt32 litPosBits = 2; // for 32-bit data
    Int32 algorithm = 2;
    Int32 numFastBytes = 32;
    string mf = "bt2";

    propIDs = new CoderPropID[]
    {
        CoderPropID.DictionarySize,
        CoderPropID.PosStateBits,
        CoderPropID.LitContextBits,
        CoderPropID.LitPosBits,
        CoderPropID.Algorithm,
        CoderPropID.NumFastBytes,
        CoderPropID.MatchFinder,
        CoderPropID.EndMarker
    };
    properties = new object[]
    {
        dictionary,
        posStateBits,
        litContextBits,
        litPosBits,
        algorithm,
        numFastBytes,
        mf,
        eos
    };
}

【问题讨论】:

  • 尝试使用Encoding.UTF8 insted of Encoding.Unicode
  • 确实是UTF8,我假设javascript编码是UTF16并没有尝试UTF8编码,将所有字节转换为有符号整数后,输出完全一样

标签: javascript c# compression


【解决方案1】:

此代码用于创建与 javascript 代码相同的字符串,包括 LMZA 设置:

public static string CompressLzma(string inputstring)
{
    if (!string.IsNullOrEmpty(inputstring))
    {
        var stream = new MemoryStream(Encoding.UTF8.GetBytes(inputstring ?? ""));
        var outputStream = new MemoryStream();
        Compress(stream, outputStream);


        byte[] bytes = outputStream.ToArray();
        var result = string.Join(",", Array.ConvertAll(bytes, v => signedInt((int)v)));
        return result;
    }

    return "";
}


public static void PrepareEncoder(out CoderPropID[] propIDs, out object[] properties)
{
    bool eos = true;
    Int32 dictionary = 1 << 16;
    Int32 posStateBits = 2;
    Int32 litContextBits = 3; // for normal files
    // UInt32 litContextBits = 0; // for 32-bit data
    Int32 litPosBits = 0;
    // UInt32 litPosBits = 2; // for 32-bit data
    Int32 algorithm = 2;
    Int32 numFastBytes = 64;
    string mf = "bt4";

    propIDs = new CoderPropID[]
    {
       CoderPropID.DictionarySize,
       CoderPropID.PosStateBits,
       CoderPropID.LitContextBits,
       CoderPropID.LitPosBits,
       CoderPropID.Algorithm,
       CoderPropID.NumFastBytes,
       CoderPropID.MatchFinder,
       CoderPropID.EndMarker
    };
    properties = new object[]
    {
       dictionary,
       posStateBits,
       litContextBits,
       litPosBits,
       algorithm,
       numFastBytes,
       mf,
       eos
    };
}

private static int signedInt(int unsignedInt)
{
    return unsignedInt >= 128 ? Math.Abs(128 - unsignedInt) - 128 : unsignedInt;
}


public static void Compress(MemoryStream inStream, MemoryStream outStream)
{
    CoderPropID[] propIDs;
    object[] properties;
    PrepareEncoder(out propIDs, out properties);

    SevenZip.Compression.LZMA.Encoder encoder = new SevenZip.Compression.LZMA.Encoder();
    encoder.SetCoderProperties(propIDs, properties);
    encoder.WriteCoderProperties(outStream);
    Int64 fileSize = inStream.Length;
    for (int i = 0; i < 8; i++)
    {
        outStream.WriteByte((Byte)(fileSize >> (8 * i)));
    }
    encoder.Code(inStream, outStream, -1, -1, null);
}

【讨论】:

    猜你喜欢
    • 1970-01-01
    • 2011-09-23
    • 1970-01-01
    • 1970-01-01
    • 2012-12-30
    • 1970-01-01
    • 1970-01-01
    • 1970-01-01
    • 1970-01-01
    相关资源
    最近更新 更多