申请试用
HOT
登录
注册
 
Vectorized R Execution in Apache Spark
Vectorized R Execution in Apache Spark

Vectorized R Execution in Apache Spark

Spark开源社区
/
发布于
/
4327
人观看

Apache Spark already has a vectorization optimization in many operations, for instance, internal columnar format, Parquet/ORC vectorized read, Pandas UDFs, etc. Vectorization improves performance greatly in general. In this talk, the performance aspect of SparkR will be discussed and vectorization in SparkR will be introduced with technical details. SparkR vectorization allows users to use the existing codes as are but boost the performance around several thousand present faster when they execute R native functions or convert Spark DataFrame to/from R DataFrame.

11点赞
4收藏
0下载
确认
3秒后跳转登录页面
去登陆