Spark Component

Overview

The Spark Component is used in Semarchy xDI to integrate with this technology and produce integration flows.

Install the Spark Component

If it is not available yet in your Semarchy xDI Designer, install the Spark Component from Designer using the component installation process.

Supported Features

Spark 2

Feature Description

LOAD

Data can be loaded to Spark: HBase, HDFS, Hive, RBDMS, Vertica, Parquet, Elasticsearch

Data can also be loaded from Spark: Hive, RDBMS, Vertica, Parquet, Elasticsearch

INTEGRATE

Data can in integrated from Spark: HDFS , Hive, RDBMS

STAGE

Spark Metadata can be used as a stage (between loading and integration) to boost Hadoop Mappings.

Spark Stage can be: SQL, Java