[关闭]
@Arslan6and6 2016-10-10T06:52:51.000000Z 字数 2457 阅读 717

spark streaming集成flume、kafka

spark


集成flume

maven工程中添加依赖

  1. <!-- flume -->
  2. <dependency>
  3. <groupId>org.apache.spark</groupId>
  4. <artifactId>spark-streaming-flume_2.10</artifactId>
  5. <version>${spark.version}</version>
  6. <scope>compile</scope>
  7. </dependency>

可以在IDEA中运行

在命令行中运行需添加如下包到spark lib目录
spark源码包下 external/flume/target/spark-streaming-flume_2.10-1.6.1.jar
flume目录lib下 flume-avro-source-1.5.0-cdh5.3.6.jar 和 flume-ng-sdk-1.5.0-cdh5.3.6.jar

并添加到spark目录下自定义关联jar包目录 inputJars 就可以在shell命令行运行

  1. [beifeng@hadoop-senior spark-1.6.1-bin-2.5.0-cdh5.3.6]$ bin/spark-shell \
  2. > --master local[3] \
  3. > --jars /opt/modules/spark-1.6.1-bin-2.5.0-cdh5.3.6/inputJars/mysql-connector-java-5.1.27-bin.jar ,\
  4. > /opt/modules/spark-1.6.1-bin-2.5.0-cdh5.3.6/inputJars/flume-avro-source-1.5.0-cdh5.3.6.jar ,\
  5. > /opt/modules/spark-1.6.1-bin-2.5.0-cdh5.3.6/inputJars/flume-ng-sdk-1.5.0-cdh5.3.6.jar ,\
  6. > /opt/modules/spark-1.6.1-bin-2.5.0-cdh5.3.6/inputJars/spark-streaming-flume_2.10-1.6.1.jar

集成kafka

  1. <!-- kafka -->
  2. <dependency>
  3. <groupId>org.apache.spark</groupId>
  4. <artifactId>spark-streaming-kafka_2.10</artifactId>
  5. <version>${spark.version}</version>
  6. <scope>compile</scope>
  7. </dependency>

在命令行中运行需添加如下包
到spark目录下自定义关联jar包目录 inputJars 就可以在shell命令行运行

  1. [beifeng@hadoop-senior spark-1.6.1-bin-2.5.0-cdh5.3.6]$ bin/spark-shell \
  2. --master local[3] \
  3. --jars /opt/modules/spark-1.6.1-bin-2.5.0-cdh5.3.6/inputJars/mysql-connector-java-5.1.27-bin.jar ,\
  4. /opt/modules/spark-1.6.1-bin-2.5.0-cdh5.3.6/inputJars/kafka_2.10-0.8.2.1.jar ,\
  5. /opt/modules/spark-1.6.1-bin-2.5.0-cdh5.3.6/inputJars/kafka-clients-0.8.2.1.jar ,\
  6. /opt/modules/spark-1.6.1-bin-2.5.0-cdh5.3.6/inputJars/metrics-core-2.2.0.jar ,\
  7. /opt/modules/spark-1.6.1-bin-2.5.0-cdh5.3.6/inputJars/spark-streaming-kafka_2.10-1.6.1.jar ,\
  8. /opt/modules/spark-1.6.1-bin-2.5.0-cdh5.3.6/inputJars/zkclient-0.3.jar

或是

  1. export SPARK_CLASSPATH=$SPARK_CLASSPATH:/opt/modules/spark-1.6.1-bin-2.5.0-cdh5.3.6/inputJars/mysql-connector-java-5.1.27-bin.jar
  2. export SPARK_CLASSPATH=$SPARK_CLASSPATH:/opt/modules/spark-1.6.1-bin-2.5.0-cdh5.3.6/inputJars/kafka_2.10-0.8.2.1.jar
  3. export SPARK_CLASSPATH=$SPARK_CLASSPATH:/opt/modules/spark-1.6.1-bin-2.5.0-cdh5.3.6/inputJars/kafka-clients-0.8.2.1.jar
  4. export SPARK_CLASSPATH=$SPARK_CLASSPATH:/opt/modules/spark-1.6.1-bin-2.5.0-cdh5.3.6/inputJars/metrics-core-2.2.0.jar
  5. export SPARK_CLASSPATH=$SPARK_CLASSPATH:/opt/modules/spark-1.6.1-bin-2.5.0-cdh5.3.6/inputJars/spark-streaming-kafka_2.10-1.6.1.jar
  6. export SPARK_CLASSPATH=$SPARK_CLASSPATH:/opt/modules/spark-1.6.1-bin-2.5.0-cdh5.3.6/inputJars/zkclient-0.3.jar
添加新批注
在作者公开此批注前,只有你和作者可见。
回复批注