webassembly / decompressor-prototype Goto Github PK

View Code? Open in Web Editor NEW

10.0 10.0 11.0 2.7 MB

License: Apache License 2.0

Makefile 4.29% C++ 95.31% Shell 0.16% C 0.16% WebAssembly 0.08%

decompressor-prototype's People

Contributors

Stargazers

Watchers

Forkers

karlschimpf flagxor alexxnica kryndex dalavancloud thegreatrambler seanpm2001

decompressor-prototype's Issues

~std::vector causing runtime crashes?

For reasons I don't understand, I'm am getting runtime crashes (free of bad pointer) when std::~vector is being called.

It occurs in vectors in classes SymbolTable and CounterWriter. It is very consistent.

Before fixing the std::vector's in these classes to be a (heap allocated) pointer that is not deleted, the code consistently died on:

build/debug/bin/compress-int -i test/test-sources/toy./wasm -l 4

(any number greater than 3 for -l did this).

Reading Stack Height?

Hello, I'm experimenting with adding metering to wasm. Currently it is done through an AST transform. It is pretty simple. It reads the AST, counts the number of nodes for each branch and append a metering statement on each leaf. more here.

So two questions

would it even make sense to inject the metering statements using a filter? is this to far out-of-scope?
if so how would you do it? I think it currently impossible. You need to count the number of AST node. I think you would have to be able to read what the stack height.

Merge "table" operator into a cached define argument

The code is the same, but it allows us to use the "define" construct to define multiple tables, allowing for the possibility of having different tables for each section in the WASM file.

Different forms of define arguments.

Currently, all arguments to a define are "pass by expression", matching a structured form of a static macro call. To add more power, we would really like to allow many different forms of parameter passing (done at the Eval node). The proposed new parameter forms are:

(params) - The define gets no arguments.

(params n) - The define gets N values. These values are evaluated before they are passed to the define.

(exprs n) - The define gets n expressions that are only evaluated on demand, within the define.

(exprs.cached n) - The define gets n expressions, but the data for expressions are cached at the point of the call, rather than on demand. When reading, this allows the same data to be read multiple times, effectively allowing the insertion of multiple copies.

(cached) - The data for the macro (and not its arguments) is cached, based on the single argument value passed in.

(args E1 ... EN) - A mixed list of arguments, composed by its arguments, each of which can be one of the previous argument specifiers.

The integer compressor should compress multiple integer sequences first

The current compressor is somewhat complex because it treats singleton integer patterns the same was as multiple integer sequence patterns.

The problem is that singletons have considerable less savings because they are only being replaced by an abbreviation value. Hence, it may "shrink" the width (slightly), but doesn't remove values from the stream (as multiple integer sequence patterns do).

We should first schedule multiple integer sequences first. Then we should chose which of the remaining singletons should be converted to a pattern. This does two things:

It allows us to still encode single integers using abbreviations (size based on frequency use), and
the assignment of Huffman encoding values can be merged with multiple integer sequences.
It simplifies the selection of multiple integer sequence patterns.

Need cleaner granularity on Symbol tables and headers

To generalize the decompressor reader, we need multiple copies of default (algorithm) symbol tables. Currently, this isn't possible.

Either we should add a "copy" symbol table method, or modify the "root" install methods to allow us to update the default copy.

Another approach is to realize that for the decompressor, we only need header built so that we can choose the algorithm to apply. Once the algorithm is known, we can load on demand as needed. This solution will remove the need for copying the default algorithm, since it will be loaded (on demand) for as many casses as it is needed.

Simplifying Queue and Pipe data structures.

The original intent of the Queue data structure was do define a data structure that allowed simultaneous reading and writing to it.

The "Queue.FirstPage" field was used to automatically clean up when streaming. However, because there was only one first page for both reading and writing, the Pipe concept could not be implemented.

A better solution is to add a "read" first page and a "write" first page concept. Each of these (shared) pointers keep track of the portion of the queue that is being used for reading and writing. Either page pointer may be null, if the corresponding concept is not being used.

If there is both a "read" and a "write" first page, the "read" page should always be behind the "write" first page, unless "eof" has been frozen (in which case the remainder of the input can be read).

In addition, advancement of read cursors should be limited to pages that appear before the first write page (only applies if first write page is non-null). This guarantees that the input is no longer changing.

Currently, --Huffman and --cism options don't work for compress-int.

There is a bug in the --cism option that doesn't work if Huffman encoding of abbreviations is not used.

readHeaderValue doesn't communicate success/failure

Because readHeaderValue doesn't communicate success/failure, it will match header constant zero at eof. This is wrong behavior and should be fixed.

webassembly / decompressor-prototype Goto Github PK

decompressor-prototype's People

Contributors

Stargazers

Watchers

Forkers

decompressor-prototype's Issues

~std::vector causing runtime crashes?

Reading Stack Height?

Merge "table" operator into a cached define argument

Different forms of define arguments.

The integer compressor should compress multiple integer sequences first

Need cleaner granularity on Symbol tables and headers

Simplifying Queue and Pipe data structures.

Currently, --Huffman and --cism options don't work for compress-int.

readHeaderValue doesn't communicate success/failure

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent