Apache Arrow seems to be targeting the use of SIMD which is a very JVM/Runtime dependent feature. If the runtime can't detect this out-of-the-box then create recognized method or some sort of intrinsic to coax the runtime to SIMD-ize the operation.
I understand the performance gains of this but why not add this functionality to existing projects like Parquet or HTable etc...
This just comes to mind: https://xkcd.com/927/
Here's someone else's blog post from 2010 on different categories of columnar store DBs:
http://dbmsmusings.blogspot.com/2010/03/distinguishing-two-m...
Nothing that end users will ever really have to know anything about.
This post explains the intention better than the project webpage:
http://blog.cloudera.com/blog/2016/02/introducing-apache-arr...