Combining SIMD and Many/Multi-core Parallelism for Finite State Machines with Enumerative Speculation

Peng Jiang; Gagan Agrawal

doi:10.1145/3018743.3018760

Combining SIMD and Many/Multi-core Parallelism for Finite State Machines with Enumerative Speculation

Peng Jiang, Gagan Agrawal

Research output: Contribution to journal › Article › peer-review

5 Scopus citations

Abstract

Finite State Machine (FSM) is the key kernel behind many popular applications, including regular expression matching, text tokenization, and Huffman decoding. Parallelizing FSMs is extremely difficult because of the strong dependencies and unpredictable memory accesses. Previous efforts have largely focused on multi-core parallelization, and used different approaches, including {\em speculative} and {\em enumerative} execution, both of which have been effective but also have limitations. With increasing width and improving flexibility in SIMD instruction sets, this paper focuses on combining SIMD and multi/many-core parallelism for FSMs. We have developed a novel strategy, called {\em enumerative speculation}. Instead of speculating on a single state as in speculative execution or enumerating all possible states as in enumerative execution, our strategy speculates transitions from several possible states, reducing the prediction overheads of speculation approach and the large amount of redundant work in the enumerative approach. A simple lookback approach produces a set of guessed states to achieve high speculation success rates in our enumerative speculation. We evaluate our method with four popular FSM applications: Huffman decoding, regular expression matching, HTML tokenization, and Div7. We obtain up to 2.5x speedup using SIMD on one core and up to 95x combining SIMD with 60 cores of an Intel Xeon Phi. On a single core, we outperform the best single-state speculative execution version by an average of 1.6x, and in combining SIMD and many-core parallelism, outperform enumerative execution by an average of 2x.

Original language	English (US)
Pages (from-to)	179-191
Number of pages	13
Journal	ACM SIGPLAN Notices
Volume	52
Issue number	8
DOIs	https://doi.org/10.1145/3018743.3018760
State	Published - Jan 26 2017
Externally published	Yes

Keywords

enumerative speculation
finite state machines
simd

ASJC Scopus subject areas

General Computer Science

Access to Document

10.1145/3018743.3018760

Cite this

@article{b4c7777090f545488621cd646ec11054,

title = "Combining SIMD and Many/Multi-core Parallelism for Finite State Machines with Enumerative Speculation",

abstract = "Finite State Machine (FSM) is the key kernel behind many popular applications, including regular expression matching, text tokenization, and Huffman decoding. Parallelizing FSMs is extremely difficult because of the strong dependencies and unpredictable memory accesses. Previous efforts have largely focused on multi-core parallelization, and used different approaches, including {\em speculative} and {\em enumerative} execution, both of which have been effective but also have limitations. With increasing width and improving flexibility in SIMD instruction sets, this paper focuses on combining SIMD and multi/many-core parallelism for FSMs. We have developed a novel strategy, called {\em enumerative speculation}. Instead of speculating on a single state as in speculative execution or enumerating all possible states as in enumerative execution, our strategy speculates transitions from several possible states, reducing the prediction overheads of speculation approach and the large amount of redundant work in the enumerative approach. A simple lookback approach produces a set of guessed states to achieve high speculation success rates in our enumerative speculation. We evaluate our method with four popular FSM applications: Huffman decoding, regular expression matching, HTML tokenization, and Div7. We obtain up to 2.5x speedup using SIMD on one core and up to 95x combining SIMD with 60 cores of an Intel Xeon Phi. On a single core, we outperform the best single-state speculative execution version by an average of 1.6x, and in combining SIMD and many-core parallelism, outperform enumerative execution by an average of 2x.",

keywords = "enumerative speculation, finite state machines, simd",

author = "Peng Jiang and Gagan Agrawal",

note = "Publisher Copyright: {\textcopyright} 2017 ACM.",

year = "2017",

month = jan,

day = "26",

doi = "10.1145/3018743.3018760",

language = "English (US)",

volume = "52",

pages = "179--191",

journal = "ACM SIGPLAN Notices",

issn = "1523-2867",

publisher = "Association for Computing Machinery (ACM)",

number = "8",

}

TY - JOUR

T1 - Combining SIMD and Many/Multi-core Parallelism for Finite State Machines with Enumerative Speculation

AU - Jiang, Peng

AU - Agrawal, Gagan

PY - 2017/1/26

Y1 - 2017/1/26

N2 - Finite State Machine (FSM) is the key kernel behind many popular applications, including regular expression matching, text tokenization, and Huffman decoding. Parallelizing FSMs is extremely difficult because of the strong dependencies and unpredictable memory accesses. Previous efforts have largely focused on multi-core parallelization, and used different approaches, including {\em speculative} and {\em enumerative} execution, both of which have been effective but also have limitations. With increasing width and improving flexibility in SIMD instruction sets, this paper focuses on combining SIMD and multi/many-core parallelism for FSMs. We have developed a novel strategy, called {\em enumerative speculation}. Instead of speculating on a single state as in speculative execution or enumerating all possible states as in enumerative execution, our strategy speculates transitions from several possible states, reducing the prediction overheads of speculation approach and the large amount of redundant work in the enumerative approach. A simple lookback approach produces a set of guessed states to achieve high speculation success rates in our enumerative speculation. We evaluate our method with four popular FSM applications: Huffman decoding, regular expression matching, HTML tokenization, and Div7. We obtain up to 2.5x speedup using SIMD on one core and up to 95x combining SIMD with 60 cores of an Intel Xeon Phi. On a single core, we outperform the best single-state speculative execution version by an average of 1.6x, and in combining SIMD and many-core parallelism, outperform enumerative execution by an average of 2x.

AB - Finite State Machine (FSM) is the key kernel behind many popular applications, including regular expression matching, text tokenization, and Huffman decoding. Parallelizing FSMs is extremely difficult because of the strong dependencies and unpredictable memory accesses. Previous efforts have largely focused on multi-core parallelization, and used different approaches, including {\em speculative} and {\em enumerative} execution, both of which have been effective but also have limitations. With increasing width and improving flexibility in SIMD instruction sets, this paper focuses on combining SIMD and multi/many-core parallelism for FSMs. We have developed a novel strategy, called {\em enumerative speculation}. Instead of speculating on a single state as in speculative execution or enumerating all possible states as in enumerative execution, our strategy speculates transitions from several possible states, reducing the prediction overheads of speculation approach and the large amount of redundant work in the enumerative approach. A simple lookback approach produces a set of guessed states to achieve high speculation success rates in our enumerative speculation. We evaluate our method with four popular FSM applications: Huffman decoding, regular expression matching, HTML tokenization, and Div7. We obtain up to 2.5x speedup using SIMD on one core and up to 95x combining SIMD with 60 cores of an Intel Xeon Phi. On a single core, we outperform the best single-state speculative execution version by an average of 1.6x, and in combining SIMD and many-core parallelism, outperform enumerative execution by an average of 2x.

KW - enumerative speculation

KW - finite state machines

KW - simd

UR - http://www.scopus.com/inward/record.url?scp=85084185668&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85084185668&partnerID=8YFLogxK

U2 - 10.1145/3018743.3018760

DO - 10.1145/3018743.3018760

M3 - Article

AN - SCOPUS:85084185668

SN - 1523-2867

VL - 52

SP - 179

EP - 191

JO - ACM SIGPLAN Notices

JF - ACM SIGPLAN Notices

IS - 8

ER -

Combining SIMD and Many/Multi-core Parallelism for Finite State Machines with Enumerative Speculation

Abstract

Keywords

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this