Combining SIMD and many/multi-core parallelism for finite state machines with enumerative speculation

Peng Jiang; Gagan Agrawal

doi:10.1145/3018743.3018760

Combining SIMD and many/multi-core parallelism for finite state machines with enumerative speculation

Peng Jiang, Gagan Agrawal

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

13 Scopus citations

Abstract

Finite State Machine (FSM) is the key kernel behind ma popular applications, including regular expression matc ing, text tokenization, and Huffman decoding. Parallelizi FSMs is extremely difficult because of the strong depende cies and unpredictable memory accesses. Previous effo have largely focused on multi-core parallelization, and us different approaches, including speculative and enumerati execution, both of which have been effective but also ha limitations. With increasing width and improving flexibil in SIMD instruction sets, this paper focuses on combini SIMD and multi/many-core parallelism for FSMs. We ha developed a novel strategy, called enumerative speculatio Instead of speculating on a single state as in speculative e ecution or enumerating all possible states as in enumerati execution, our strategy speculates transitions from seve possible states, reducing the prediction overheads of speculation approach and the large amount of redundant work in the enumerative approach. A simple lookback approach produces a set of guessed states to achieve high speculation success rates in our enumerative speculation. We evaluate our method with four popular FSM applications: Huffman decoding, regular expression matching, HTML tokenization, and Div7. We obtain up to 2.5x speedup using SIMD on one core and up to 95x combining SIMD with 60 cores of an Intel Xeon Phi. On a single core, we outperform the best single-state speculative execution version by an average of 1.6x, and in combining SIMD and many-core parallelism, outperform enumerative execution by an average of 2x.

Original language	English (US)
Title of host publication	PPoPP 2017 - Proceedings of the 22nd ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming
Publisher	Association for Computing Machinery
Pages	179-191
Number of pages	13
ISBN (Electronic)	9781450344937
DOIs	https://doi.org/10.1145/3018743.3018760
State	Published - Jan 26 2017
Externally published	Yes
Event	22nd ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, PPoPP 2017 - Austin, United States Duration: Feb 4 2017 → Feb 8 2017

Publication series

Name	Proceedings of the ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, PPOPP

Conference

Conference	22nd ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, PPoPP 2017
Country/Territory	United States
City	Austin
Period	2/4/17 → 2/8/17

ASJC Scopus subject areas

Software

Access to Document

10.1145/3018743.3018760

Cite this

Jiang, P., & Agrawal, G. (2017). Combining SIMD and many/multi-core parallelism for finite state machines with enumerative speculation. In PPoPP 2017 - Proceedings of the 22nd ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming (pp. 179-191). (Proceedings of the ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, PPOPP). Association for Computing Machinery. https://doi.org/10.1145/3018743.3018760

Combining SIMD and many/multi-core parallelism for finite state machines with enumerative speculation. / Jiang, Peng; Agrawal, Gagan.
PPoPP 2017 - Proceedings of the 22nd ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming. Association for Computing Machinery, 2017. p. 179-191 (Proceedings of the ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, PPOPP).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

Jiang, P & Agrawal, G 2017, Combining SIMD and many/multi-core parallelism for finite state machines with enumerative speculation. in PPoPP 2017 - Proceedings of the 22nd ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming. Proceedings of the ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, PPOPP, Association for Computing Machinery, pp. 179-191, 22nd ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, PPoPP 2017, Austin, United States, 2/4/17. https://doi.org/10.1145/3018743.3018760

Jiang P, Agrawal G. Combining SIMD and many/multi-core parallelism for finite state machines with enumerative speculation. In PPoPP 2017 - Proceedings of the 22nd ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming. Association for Computing Machinery. 2017. p. 179-191. (Proceedings of the ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, PPOPP). doi: 10.1145/3018743.3018760

Jiang, Peng ; Agrawal, Gagan. / Combining SIMD and many/multi-core parallelism for finite state machines with enumerative speculation. PPoPP 2017 - Proceedings of the 22nd ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming. Association for Computing Machinery, 2017. pp. 179-191 (Proceedings of the ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, PPOPP).

@inproceedings{08f71592e5404e5a851261d5d1d76e0e,

title = "Combining SIMD and many/multi-core parallelism for finite state machines with enumerative speculation",

abstract = "Finite State Machine (FSM) is the key kernel behind ma popular applications, including regular expression matc ing, text tokenization, and Huffman decoding. Parallelizi FSMs is extremely difficult because of the strong depende cies and unpredictable memory accesses. Previous effo have largely focused on multi-core parallelization, and us different approaches, including speculative and enumerati execution, both of which have been effective but also ha limitations. With increasing width and improving flexibil in SIMD instruction sets, this paper focuses on combini SIMD and multi/many-core parallelism for FSMs. We ha developed a novel strategy, called enumerative speculatio Instead of speculating on a single state as in speculative e ecution or enumerating all possible states as in enumerati execution, our strategy speculates transitions from seve possible states, reducing the prediction overheads of speculation approach and the large amount of redundant work in the enumerative approach. A simple lookback approach produces a set of guessed states to achieve high speculation success rates in our enumerative speculation. We evaluate our method with four popular FSM applications: Huffman decoding, regular expression matching, HTML tokenization, and Div7. We obtain up to 2.5x speedup using SIMD on one core and up to 95x combining SIMD with 60 cores of an Intel Xeon Phi. On a single core, we outperform the best single-state speculative execution version by an average of 1.6x, and in combining SIMD and many-core parallelism, outperform enumerative execution by an average of 2x.",

author = "Peng Jiang and Gagan Agrawal",

note = "Publisher Copyright: {\textcopyright} 2017 ACM.; 22nd ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, PPoPP 2017 ; Conference date: 04-02-2017 Through 08-02-2017",

year = "2017",

month = jan,

day = "26",

doi = "10.1145/3018743.3018760",

language = "English (US)",

series = "Proceedings of the ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, PPOPP",

publisher = "Association for Computing Machinery",

pages = "179--191",

booktitle = "PPoPP 2017 - Proceedings of the 22nd ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming",

}

TY - GEN

T1 - Combining SIMD and many/multi-core parallelism for finite state machines with enumerative speculation

AU - Jiang, Peng

AU - Agrawal, Gagan

PY - 2017/1/26

Y1 - 2017/1/26

N2 - Finite State Machine (FSM) is the key kernel behind ma popular applications, including regular expression matc ing, text tokenization, and Huffman decoding. Parallelizi FSMs is extremely difficult because of the strong depende cies and unpredictable memory accesses. Previous effo have largely focused on multi-core parallelization, and us different approaches, including speculative and enumerati execution, both of which have been effective but also ha limitations. With increasing width and improving flexibil in SIMD instruction sets, this paper focuses on combini SIMD and multi/many-core parallelism for FSMs. We ha developed a novel strategy, called enumerative speculatio Instead of speculating on a single state as in speculative e ecution or enumerating all possible states as in enumerati execution, our strategy speculates transitions from seve possible states, reducing the prediction overheads of speculation approach and the large amount of redundant work in the enumerative approach. A simple lookback approach produces a set of guessed states to achieve high speculation success rates in our enumerative speculation. We evaluate our method with four popular FSM applications: Huffman decoding, regular expression matching, HTML tokenization, and Div7. We obtain up to 2.5x speedup using SIMD on one core and up to 95x combining SIMD with 60 cores of an Intel Xeon Phi. On a single core, we outperform the best single-state speculative execution version by an average of 1.6x, and in combining SIMD and many-core parallelism, outperform enumerative execution by an average of 2x.

AB - Finite State Machine (FSM) is the key kernel behind ma popular applications, including regular expression matc ing, text tokenization, and Huffman decoding. Parallelizi FSMs is extremely difficult because of the strong depende cies and unpredictable memory accesses. Previous effo have largely focused on multi-core parallelization, and us different approaches, including speculative and enumerati execution, both of which have been effective but also ha limitations. With increasing width and improving flexibil in SIMD instruction sets, this paper focuses on combini SIMD and multi/many-core parallelism for FSMs. We ha developed a novel strategy, called enumerative speculatio Instead of speculating on a single state as in speculative e ecution or enumerating all possible states as in enumerati execution, our strategy speculates transitions from seve possible states, reducing the prediction overheads of speculation approach and the large amount of redundant work in the enumerative approach. A simple lookback approach produces a set of guessed states to achieve high speculation success rates in our enumerative speculation. We evaluate our method with four popular FSM applications: Huffman decoding, regular expression matching, HTML tokenization, and Div7. We obtain up to 2.5x speedup using SIMD on one core and up to 95x combining SIMD with 60 cores of an Intel Xeon Phi. On a single core, we outperform the best single-state speculative execution version by an average of 1.6x, and in combining SIMD and many-core parallelism, outperform enumerative execution by an average of 2x.

UR - http://www.scopus.com/inward/record.url?scp=85014496180&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85014496180&partnerID=8YFLogxK

U2 - 10.1145/3018743.3018760

DO - 10.1145/3018743.3018760

M3 - Conference contribution

AN - SCOPUS:85014496180

T3 - Proceedings of the ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, PPOPP

SP - 179

EP - 191

BT - PPoPP 2017 - Proceedings of the 22nd ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming

PB - Association for Computing Machinery

T2 - 22nd ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, PPoPP 2017

Y2 - 4 February 2017 through 8 February 2017

ER -

Combining SIMD and many/multi-core parallelism for finite state machines with enumerative speculation

Abstract

Publication series

Conference

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this