1.在windows下安裝maven
Eclipse開發(fā)工具
Jdk1.7
Apache maven
下載maven[http://maven.apache.org/download.cgi]
(1)Maven安裝
安裝起始很簡單,解壓壓縮包,配置環(huán)境變量
--解壓maven壓縮包,配置MAVEN_HOME和Path環(huán)境變量。
--打開命令窗口,執(zhí)行“mvn -v”,若出現(xiàn)如下界面,證明環(huán)境變量配置成功
(2)Eclipse下搭建maven開發(fā)環(huán)境并構(gòu)建項(xiàng)目
--Eclipse安裝maven插件
打開eclipse --》eclipse market --》輸入maven -->安裝

--要讓eclipse中的maven重新定位一下
Window -> Preference -> Maven -> Installation -> Add進(jìn)行設(shè)置
現(xiàn)在可以創(chuàng)建maven項(xiàng)目了
2.編寫代碼
生產(chǎn)者代碼
package com.chh.test;
import java.util.Properties;
import kafka.producer.KeyedMessage;
import kafka.producer.ProducerConfig;
public class KafkaProducer extends Thread
{
private final kafka.javaapi.producer.Producer<Integer, String> producer;
private final String topic;
private final Properties props = new Properties();
public KafkaProducer(String topic)
{
props.put("serializer.class", "kafka.serializer.StringEncoder");
props.put("metadata.broker.list", "master:9092,slave1:9092,slave2:9092");
producer = new kafka.javaapi.producer.Producer<Integer, String>(new ProducerConfig(props));
this.topic = topic;
}
@Override
public void run() {
int messageNo = 1;
while (true)
{
String messageStr = new String("Message_" + messageNo);
System.out.println("Send:" + messageStr);
producer.send(new KeyedMessage<Integer, String>(topic, messageStr));
messageNo++;
try {
sleep(3000);
} catch (InterruptedException e) {
// TODO Auto-generated catch block
e.printStackTrace();
}
}
}
}
消費(fèi)者代碼
package com.chh.test;
import java.util.HashMap;
import java.util.List;
import java.util.Map;
import java.util.Properties;
import kafka.consumer.ConsumerConfig;
import kafka.consumer.ConsumerIterator;
import kafka.consumer.KafkaStream;
import kafka.javaapi.consumer.ConsumerConnector;
public class KafkaConsumer extends Thread
{
private final ConsumerConnector consumer;
private final String topic;
public KafkaConsumer(String topic)
{
consumer = kafka.consumer.Consumer.createJavaConsumerConnector(
createConsumerConfig());
this.topic = topic;
}
private static ConsumerConfig createConsumerConfig()
{
Properties props = new Properties();
props.put("zookeeper.connect", KafkaProperties.zkConnect);
props.put("group.id", KafkaProperties.groupId);
props.put("zookeeper.session.timeout.ms", "40000");
props.put("zookeeper.sync.time.ms", "200");
props.put("auto.commit.interval.ms", "1000");
return new ConsumerConfig(props);
}
@Override
public void run() {
Map<String, Integer> topicCountMap = new HashMap<String, Integer>();
topicCountMap.put(topic, new Integer(1));
Map<String, List<KafkaStream<byte[], byte[]>>> consumerMap = consumer.createMessageStreams(topicCountMap);
KafkaStream<byte[], byte[]> stream = consumerMap.get(topic).get(0);
ConsumerIterator<byte[], byte[]> it = stream.iterator();
while (it.hasNext()) {
System.out.println("receive:" + new String(it.next().message()));
try {
sleep(3000);
} catch (InterruptedException e) {
e.printStackTrace();
}
}
}
}
主程序
package com.chh.test;
public class KafkaDemo {
public static void main(String[] args)
{
KafkaProducer producerThread = new KafkaProducer(KafkaProperties.topic);
producerThread.start();
KafkaConsumer consumerThread = new KafkaConsumer(KafkaProperties.topic);
consumerThread.start();
}
}
配置文件
package com.chh.test;
public interface KafkaProperties {
final static String zkConnect = "master:2181,slave1:2181,slave2:2181";
final static String groupId = "group1";
final static String topic = "topic2";
final static int kafkaProducerBufferSize = 64 * 1024;
final static int connectionTimeOut = 20000;
final static int reconnectInterval = 10000;
final static String clientId = "SimpleConsumerDemoClient";
}
才坑指南
(1)總是報(bào)錯(cuò):kafka Failed to send messages after 3 tries
原因自己linux中用的都是hosts,沒用真是的ip,而程序中用了真是的ip。
首先程序會去server獲得zookeeper的地址,如master、slave2
然后再用slave2,master訪問,此時(shí)訪問不到zookper
http://www.tuicool.com/articles/eeIvyy3
(2)連接虛擬機(jī)的時(shí)候用wifi,不能直接用線!??!
原理講解
1.消費(fèi)者模型
(1)分區(qū)消費(fèi)模型


(2)組(Group)消費(fèi)模型


Consumer分配算法

2.消費(fèi)者
(1)同步生產(chǎn)模型

(2)異步生產(chǎn)模型


(3)兩種生產(chǎn)模型對比
同步生產(chǎn)模型:
(1)低消息丟失率;
(2)高消息重復(fù)率(由于網(wǎng)絡(luò)原因,回復(fù)確認(rèn)未收到);
(3)高延遲
異步生產(chǎn)模型:
(1)低延遲;
(2)高發(fā)送性能;
(3)高消息丟失率(無確認(rèn)機(jī)制,發(fā)送端隊(duì)列滿)