-
show-me-the-code Public
Forked from Yixiaohan/show-me-the-codePython 练习册,每天一个小程序
UpdatedJul 25, 2017 -
awesome-wechat-weapp Public
Forked from justjavac/awesome-wechat-weapp微信小程序开发资源汇总 💯
JavaScript GNU General Public License v3.0 UpdatedJul 24, 2017 -
-
maxwell Public
Forked from zendesk/maxwellMaxwell's daemon, a mysql-to-json kafka producer
Java Other UpdatedApr 26, 2017 -
papers-we-love Public
Forked from papers-we-love/papers-we-lovePapers from the computer science community to read and discuss.
UpdatedApr 21, 2017 -
kafka-offset-manager Public
Forked from foobaar/kafka-offset-managerMove Consumer offsets as you please
Java UpdatedApr 19, 2017 -
hbase-indexer Public
Forked from prazanna/hbase-indexerLily HBase Indexer - indexing HBase, one row at a time
Java Apache License 2.0 UpdatedApr 9, 2017 -
-
kafkaLowLevelConsumer Public
Forked from MOBX/kafkaLowLevelConsumerkafka low level consumer api
Java Apache License 2.0 UpdatedMar 30, 2017 -
wechat_sogou_crawl Public
Forked from pujinxiao/wechat_sogou_crawl基于搜狗微信的公众号文章爬虫
Python UpdatedMar 28, 2017 -
hive-third-functions Public
Forked from aaronshan/hive-third-functionsSome useful custom hive udf functions, especial array and json functions.
Java UpdatedMar 17, 2017 -
streamingpro Public
Forked from byzer-org/byzer-langBuild Spark Streaming Application by SQL
JavaScript UpdatedFeb 10, 2017 -
kafka-example-in-scala Public
Forked from smallnest/kafka-example-in-scalaa kafka producer and consumer example in scala and java
Java Apache License 2.0 UpdatedJan 12, 2017 -
dw_etl Public
Forked from JasonWiki/dw_etldw etl 工具 mysql 增量、全量抽取 to hive. 合并 hive 数据表, 等数据平台清洗工具
Python UpdatedDec 21, 2016 -
wechat_spider Public
Forked from CoolWell/wechat_spider基于搜狗微信入口的微信爬虫程序。 由基于phantomjs的python实现。 使用了收费的动态代理。 采集包括文章文本、阅读数、点赞数、评论以及评论赞数。 效率:500公众号/小时。 根据采集的公众号划分为多线程,可以实现并行采集。
Python UpdatedDec 6, 2016 -
FinancialNewsSearchEngine Public
Forked from nbro/FinancialNewsSearchEngineVery simple search engine "specialised" in searching financial news (written using Nutch, Hbase, Solr, SpringBoot, Bootstrap and AngularJS)
Shell Apache License 2.0 UpdatedDec 5, 2016 -
CDS Public
Forked from XavientInformationSystems/CDSContent Data Store (HDFS/HBase)
Java UpdatedDec 1, 2016 -
KafkaProducerTool Public
Forked from kevin00chen/KafkaProducerTool对kafka自定义producer进行封装
Java UpdatedOct 18, 2016 -
yugong Public
Forked from alibaba/yugong阿里巴巴去Oracle数据迁移同步工具(全量+增量,目标支持MySQL/DRDS)
Java GNU General Public License v2.0 UpdatedJun 8, 2016 -
hbase-increment-index Public
Forked from qindongliang/hbase-increment-indexhbase+solr实现hbase的二级索引
Java UpdatedMay 31, 2016 -
puppet-cdh Public
Forked from ottomata/puppet-cdh4Puppet module for Hadoop and the rest of Cloudera's CDH 5.
Puppet MIT License UpdatedMay 25, 2016 -
ThinkBayes Public
Forked from AllenDowney/ThinkBayesCode repository for Think Bayes.
TeX UpdatedMay 10, 2016 -
canal Public
Forked from alibaba/canal阿里巴巴mysql数据库binlog的增量订阅&消费组件
Java Apache License 2.0 UpdatedMay 10, 2016 -
-
clouderasizer Public
Forked from bkvarda/clouderasizerMultipurpose tool for discovering and collecting Cloudera Manager metrics.
Python UpdatedMay 9, 2016 -
reair Public
Forked from airbnb/reairReAir is a collection of easy-to-use tools for replicating tables and partitions between Hive data warehouses.
Java Apache License 2.0 UpdatedMay 6, 2016 -
CDHExample Public
Forked from dailong/CDHExampleCDH集群环境Hdfs、MapReduce、Hive、Hbase、Kafka、Solr、Spark、Zookeeper、Mahout示例代码
Java UpdatedApr 11, 2016 -
-
scrapyd Public
Forked from scrapy/scrapydA service daemon to run Scrapy spiders
Python BSD 3-Clause "New" or "Revised" License UpdatedFeb 23, 2016 -
django-dynamic-scraper Public
Forked from holgerd77/django-dynamic-scraperCreating Scrapy scrapers via the Django admin interface
Python BSD 3-Clause "New" or "Revised" License UpdatedFeb 19, 2016