Over a million developers have joined DZone.

An Even Faster Java Expression Evaluator

Download Forrester’s “Vendor Landscape, Application Performance Management” report that examines the evolving role of APM as a key driver of customer satisfaction and business success, brought to you in partnership with BMC.

I’ve been reading the How to write one of the fastest expression evaluators in Java article (also published over at JCG) and thought to myself – there is an even faster way!

Thus I whipped up a Caliper benchmark which you can check out on my GitHub account.

The benchmark compiles the performance of the following libraries:

Before getting into the results, a couple of general words about the state of (open source) expression evaluator libraries in Java (I couldn’t test closed source ones because they impose restrictions – like “only 15 expressions evaluated” – which make it impossible to benchmark them):

  • None of them are in Maven central (or even in Maven for that matter – with the exception of Janino, but there you still need a different repo to get the latest version)
  • Their development is mostly stalled
  • They are lacking in features and/or are buggy

The benchmark consist in evaluating the 2 + (7-5) * 3.14159 * x + sin(0) expression (similar to the original article). Unfortunately the exponentiation part had to be eliminated because not all libraries support it. Also, x had to be fixed to 0 because parsii erroneously always considers it zero.

Most of the libraries (JEPLite and MathEval being the exception) separate the idea of “compiling” and “evaluating” the expression. This can be very useful if we wish to evaluate the same expression for a multitude of values of the contained variable. Thus the benchmark tests two cases: compile + evaluate and only evaluate (depending on your usecase one or the other is more relevant). Also, Jeval is by default excluded from the benchmark since it performs so poorly that it eclipses all the other results. If you wish to see just how poorly it performs, uncomment the relevant methods from BenchmarkExpressionEvaluation.

Without further ado, here are the results (smaller is better):


As you can see, the custom implementation beats parsii by a factor of 10x! So how does it work? You can always check out the source code, but I will also explain: It uses Janino to compile a class of the form:

import static java.lang.Math.*; // to get sin, cos, etc

public static final class JaninoCompiledFastexpr1234 implements UnaryDoubleFunction {
    public double evaluate(double x) {
        return (/* your expression here */);

which you can later invoke. The advantages of this approach are:

  • raw speed – as you can see from the benchmark, the Java parser / compiler is very well optimized and hard to beat, even when rolling your own parser for a subset of cases
  • raw speed the second time – the JIT helps us immensely and gives optimizations like constant folding or vectorization for “free”
  • brevity – the whole proof of concept is implemented in 60 lines, probably less lines than this article contains
  • “garbage free” – after the initial compilation no allocations are made in the heap, but rather all parameters are passed on the stack (since they are primitives)

Of course the solutions is not without its drawbacks:

  • depending on the source of the expression we could be compiling and executing arbitrary Java code sent to us by a malicious user. Not a pretty prospect. We try to mitigate this in two ways:
    • whitelist the set of characters allowed in the expression
    • execute all expressions under a special classloader which uses a ProtectionDomain that has no permissions
  • the above isn’t wholly satisfactory however:
    • the compiler itself can have bugs (it has happened in the past)
    • neither of the safety measures efficiently prevent the potential denial of service conditions (like creating infinite loops or consuming all the memory)
  • also, this is just a proof of concept – for a more complete solution one would need to add more interfaces (like BinaryDoubleFunction, TernaryDoubleFunction and also UnaryLongFunction, etc) as well as some settings related to the naming of the parameters.

Still, this is a great example as to the power of the Java ecosystem and what can be achieved by thinking  a little outside of the box.

If you run the benchmark for yourself, you might find a couple of interesting things:

  • the “Janino” test continuously warns about recompilation. This is expected if you think a little bit about it – since at every invocation it “compiles” the expression. The same is true for JaninoFastexpr
  • both Janino and JaninoPrecompiled complain about excessive garbage collection. This is because they use auto-boxing (varargs) to pass the parameters which isn’t very efficient (generates a lot of garbage)

This is it folks, enjoy the source code and hopefully this will inspire people to find better solutions to their problems!

See Forrester’s Report, “Vendor Landscape, Application Performance Management” to identify the right vendor to help IT deliver better service at a lower cost, brought to you in partnership with BMC.


Published at DZone with permission of Attila-Mihaly Balazs, DZone MVB. See the original article here.

Opinions expressed by DZone contributors are their own.

The best of DZone straight to your inbox.

Please provide a valid email address.

Thanks for subscribing!

Awesome! Check your inbox to verify your email so you can start receiving the latest in tech news and resources.

{{ parent.title || parent.header.title}}

{{ parent.tldr }}

{{ parent.urlSource.name }}