Skip to content

Latest commit

 

History

History
221 lines (220 loc) · 63.3 KB

File metadata and controls

221 lines (220 loc) · 63.3 KB

KDD2014 Paper List

论文 作者 组织 摘要 翻译 代码 引用数
DeepWalk: online learning of social representations Bryan Perozzi, Rami AlRfou, Steven Skiena code 4455
Knowledge vault: a web-scale approach to probabilistic knowledge fusion Xin Dong, Evgeniy Gabrilovich, Geremy Heitz, Wilko Horn, Ni Lao, Kevin Murphy, Thomas Strohmann, Shaohua Sun, Wei Zhang code 763
Clustering and projected clustering with adaptive neighbors Feiping Nie, Xiaoqian Wang, Heng Huang code 399
GeoMF: joint geographical modeling and matrix factorization for point-of-interest recommendation Defu Lian, Cong Zhao, Xing Xie, Guangzhong Sun, Enhong Chen, Yong Rui code 388
Travel time estimation of a path using sparse trajectories Yilun Wang, Yu Zheng, Yexiang Xue code 333
Efficient mini-batch training for stochastic optimization Mu Li, Tong Zhang, Yuqiang Chen, Alexander J. Smola code 302
A dirichlet multinomial mixture model-based approach for short text clustering Jianhua Yin, Jianyong Wang code 278
Learning time-series shapelets Josif Grabocka, Nicolas Schilling, Martin Wistuba, Lars SchmidtThieme code 227
Jointly modeling aspects, ratings and sentiments for movie recommendation (JMARS) Qiming Diao, Minghui Qiu, ChaoYuan Wu, Alexander J. Smola, Jing Jiang, Chong Wang code 219
Inferring gas consumption and pollution emission of vehicles throughout a city Jingbo Shang, Yu Zheng, Wenzhu Tong, Eric Chang, Yong Yu code 166
Open question answering over curated and extracted knowledge bases Anthony Fader, Luke Zettlemoyer, Oren Etzioni code 152
Optimal real-time bidding for display advertising Weinan Zhang, Shuai Yuan, Jun Wang code 151
FastXML: a fast, accurate and stable tree-classifier for extreme multi-label learning Yashoteja Prabhu, Manik Varma code 136
Streaming submodular maximization: massive data summarization on the fly Ashwinkumar Badanidiyuru, Baharan Mirzasoleiman, Amin Karbasi, Andreas Krause code 131
Meta-path based multi-network collective link prediction Jiawei Zhang, Philip S. Yu, ZhiHua Zhou code 127
COM: a generative model for group recommendation Quan Yuan, Gao Cong, ChinYew Lin code 126
Focused clustering and outlier detection in large attributed graphs Bryan Perozzi, Leman Akoglu, Patricia Iglesias Sánchez, Emmanuel Müller code 125
Inferring user demographics and social strategies in mobile social networks Yuxiao Dong, Yang Yang, Jie Tang, Yang Yang, Nitesh V. Chawla code 124
Marble: high-throughput phenotyping from electronic health records via sparse nonnegative tensor factorization Joyce C. Ho, Joydeep Ghosh, Jimeng Sun code 121
Unsupervised learning of disease progression models Xiang Wang, David A. Sontag, Fei Wang code 121
A cost-effective recommender system for taxi drivers Meng Qu, Hengshu Zhu, Junming Liu, Guannan Liu, Hui Xiong code 118
Reducing the sampling complexity of topic models Aaron Q. Li, Amr Ahmed, Sujith Ravi, Alexander J. Smola code 116
'Beating the news' with EMBERS: forecasting civil unrest using open source indicators Naren Ramakrishnan, Patrick Butler, Sathappan Muthiah, Nathan Self, Rupinder Paul Khandpur, Parang Saraf, Wei Wang, Jose Cadena, Anil Vullikanti, Gizem Korkmaz, Chris J. Kuhlman, Achla Marathe, Liang Zhao, Ting Hua, Feng Chen, ChangTien Lu, Bert Huang, Aravind Srinivasan, Khoa Trinh, Lise Getoor, Graham Katz, Andy Doyle, Chris Ackermann, Ilya Zavorin, Jim Ford, Kristen Maria Summers, Youssef Fayed, Jaime Arredondo, Dipak Gupta, David Mares code 112
Mobile app recommendations with security and privacy awareness Hengshu Zhu, Hui Xiong, Yong Ge, Enhong Chen code 110
Log-based predictive maintenance Ruben Sipos, Dmitriy Fradkin, Fabian Mörchen, Zhuang Wang code 110
Seven rules of thumb for web site experimenters Ron Kohavi, Alex Deng, Roger Longbotham, Ya Xu code 110
Heat kernel based community detection Kyle Kloster, David F. Gleich code 102
Who to follow and why: link prediction with explanations Nicola Barbieri, Francesco Bonchi, Giuseppe Manco code 100
Prediction of human emergency behavior and their mobility following large-scale disaster Xuan Song, Quanshi Zhang, Yoshihide Sekimoto, Ryosuke Shibasaki code 99
Core decomposition of uncertain graphs Francesco Bonchi, Francesco Gullo, Andreas Kaltenbrunner, Yana Volkovich code 96
CatchSync: catching synchronized behavior in large directed graphs Meng Jiang, Peng Cui, Alex Beutel, Christos Faloutsos, Shiqiang Yang code 95
Guilt by association: large scale malware detection by mining file-relation graphs Acar Tamersoy, Kevin A. Roundy, Duen Horng Chau code 91
Graph sample and hold: a framework for big-graph analytics Nesreen K. Ahmed, Nick G. Duffield, Jennifer Neville, Ramana Rao Kompella code 89
Balanced graph edge partition Florian Bourse, Marc Lelarge, Milan Vojnovic code 87
Modeling human location data with mixtures of kernel densities Moshe Lichman, Padhraic Smyth code 86
Mining topics in documents: standing on the shoulders of big data Zhiyuan Chen, Bing Liu code 82
Non-parametric scan statistics for event detection and forecasting in heterogeneous social media graphs Feng Chen, Daniel B. Neill code 81
Differentially private network data release via structural inference Qian Xiao, Rui Chen, KianLee Tan code 80
Community membership identification from small seed sets Isabel M. Kloumann, Jon M. Kleinberg code 80
On the permanence of vertices in network communities Tanmoy Chakraborty, Sriram Srinivasan, Niloy Ganguly, Animesh Mukherjee, Sanjukta Bhowmick code 80
Large scale visual recommendations from street fashion images Vignesh Jagadeesh, Robinson Piramuthu, Anurag Bhardwaj, Wei Di, Neel Sundaresan code 79
Active semi-supervised learning using sampling theory for graph signals Akshay Gadde, Aamir Anis, Antonio Ortega code 79
Unfolding physiological state: mortality modelling in intensive care units Marzyeh Ghassemi, Tristan Naumann, Finale DoshiVelez, Nicole Brimmer, Rohit Joshi, Anna Rumshisky, Peter Szolovits code 79
From micro to macro: data driven phenotyping by densification of longitudinal electronic medical records Jiayu Zhou, Fei Wang, Jianying Hu, Jieping Ye code 77
MMRate: inferring multi-aspect diffusion networks with multi-pattern cascades Senzhang Wang, Xia Hu, Philip S. Yu, Zhoujun Li code 76
ClusCite: effective citation recommendation by information network-based clustering Xiang Ren, Jialu Liu, Xiao Yu, Urvashi Khandelwal, Quanquan Gu, Lidan Wang, Jiawei Han code 73
EARS (earthquake alert and report system): a real time decision support system for earthquake crisis management Marco Avvenuti, Stefano Cresci, Andrea Marchetti, Carlo Meletti, Maurizio Tesconi code 73
A case study: privacy preserving release of spatio-temporal density in paris Gergely Ács, Claude Castelluccia code 72
We know what you want to buy: a demographic-based system for product recommendation on microblogs Wayne Xin Zhao, Yanwei Guo, Yulan He, Han Jiang, Yuexin Wu, Xiaoming Li code 68
Large-scale high-precision topic modeling on twitter ShuangHong Yang, Alek Kolcz, Andy Schlaikjer, Pankaj Gupta code 68
Automated hypothesis generation based on mining scientific literature W. Scott Spangler, Angela D. Wilkins, Benjamin J. Bachman, Meena Nagarajan, Tajhal Dayaram, Peter J. Haas, Sam Regenbogen, Curtis R. Pickering, Austin Comer, Jeffrey N. Myers, Ioana Stanoi, Linda Kato, Ana Lelescu, Jacques J. Labrie, Neha Parikh, Andreas Martin Lisewski, Lawrence A. Donehower, Ying Chen, Olivier Lichtarge code 65
Exploiting geographic dependencies for real estate appraisal: a mutual perspective of ranking and clustering Yanjie Fu, Hui Xiong, Yong Ge, Zijun Yao, Yu Zheng, ZhiHua Zhou code 64
People on drugs: credibility of user statements in health communities Subhabrata Mukherjee, Gerhard Weikum, Cristian DanescuNiculescuMizil code 64
FAST-PPR: scaling personalized pagerank estimation for large graphs Peter Lofgren, Siddhartha Banerjee, Ashish Goel, Seshadhri Comandur code 62
Modeling delayed feedback in display advertising Olivier Chapelle code 60
Gradient boosted feature selection Zhixiang Eddie Xu, Gao Huang, Kilian Q. Weinberger, Alice X. Zheng code 59
On social event organization Keqian Li, Wei Lu, Smriti Bhagat, Laks V. S. Lakshmanan, Cong Yu code 59
Event detection in activity networks Polina Rozenshtein, Aris Anagnostopoulos, Aristides Gionis, Nikolaj Tatti code 58
Scalable diffusion-aware optimization of network topology Elias Boutros Khalil, Bistra Dilkina, Le Song code 57
Methods for ordinal peer grading Karthik Raman, Thorsten Joachims code 55
FUNNEL: automatic mining of spatially coevolving epidemics Yasuko Matsubara, Yasushi Sakurai, Willem G. van Panhuis, Christos Faloutsos code 54
Large margin distribution machine Teng Zhang, ZhiHua Zhou code 51
SigniTrend: scalable detection of emerging topics in textual streams by hashed significance thresholds Erich Schubert, Michael Weiler, HansPeter Kriegel code 49
Correlating events with time series for incident diagnosis Chen Luo, JianGuang Lou, Qingwei Lin, Qiang Fu, Rui Ding, Dongmei Zhang, Zhe Wang code 49
Effective global approaches for mutual information based feature selection Xuan Vinh Nguyen, Jeffrey Chan, Simone Romano, James Bailey code 47
Integrating spreadsheet data via accurate and low-effort extraction Zhe Chen, Michael J. Cafarella code 43
Detecting moving object outliers in massive-scale trajectory streams Yanwei Yu, Lei Cao, Elke A. Rundensteiner, Qin Wang code 43
Crowdsourced time-sync video tagging using temporal and personalized topic modeling Bin Wu, Erheng Zhong, Ben Tan, Andrew Horner, Qiang Yang code 42
Matching users and items across domains to improve the recommendation quality ChungYi Li, ShouDe Lin code 41
An empirical study of reserve price optimisation in real-time bidding Shuai Yuan, Jun Wang, Bowei Chen, Peter Mason, Sam Seljan code 40
TCS: efficient topic discovery over crowd-oriented service data Yongxin Tong, Caleb Chen Cao, Lei Chen code 39
Fast influence-based coarsening for large networks Manish Purohit, B. Aditya Prakash, Chanhyun Kang, Yao Zhang, V. S. Subrahmanian code 39
Inside the atoms: ranking on a network of networks Jingchao Ni, Hanghang Tong, Wei Fan, Xiang Zhang code 38
Grouping students in educational settings Rakesh Agrawal, Behzad Golshan, Evimaria Terzi code 38
Top-k frequent itemsets via differentially private FP-trees Jaewoo Lee, Christopher W. Clifton code 37
FEMA: flexible evolutionary multi-faceted analysis for dynamic behavioral pattern discovery Meng Jiang, Peng Cui, Fei Wang, Xinran Xu, Wenwu Zhu, Shiqiang Yang code 37
A system to grade computer programming skills using machine learning Shashank Srikant, Varun Aggarwal code 37
Exponential random graph estimation under differential privacy Wentian Lu, Gerome Miklau code 36
A hazard based approach to user return time prediction Komal Kapoor, Mingxuan Sun, Jaideep Srivastava, Tao Ye code 36
Dynamics of news events and social media reaction Mikalai Tsytsarau, Themis Palpanas, Malú Castellanos code 36
Minimizing seed set selection with probabilistic coverage guarantee in a social network Peng Zhang, Wei Chen, Xiaoming Sun, Yajun Wang, Jialin Zhang code 36
Utilizing temporal patterns for estimating uncertainty in interpretable early decision making Mohamed F. Ghalwash, Vladan Radosavljevic, Zoran Obradovic code 35
Identifying and labeling search tasks via query-based hawkes processes Liangda Li, Hongbo Deng, Anlei Dong, Yi Chang, Hongyuan Zha code 34
Prototype-based learning on concept-drifting data streams Junming Shao, Zahra Ahmadi, Stefan Kramer code 34
GLAD: group anomaly detection in social media analysis Qi Rose Yu, Xinran He, Yan Liu code 34
Efficient multi-task feature learning with calibration Pinghua Gong, Jiayu Zhou, Wei Fan, Jieping Ye code 33
Budget pacing for targeted online advertisements at LinkedIn Deepak Agarwal, Souvik Ghosh, Kai Wei, Siyu You code 33
Active learning for sparse bayesian multilabel classification Deepak Vasisht, Andreas C. Damianou, Manik Varma, Ashish Kapoor code 33
Modeling professional similarity by mining professional career trajectories Ye Xu, Zang Li, Abhishek Gupta, Ahmet Bugdayci, Anmol Bhasin code 32
Incremental and decremental training for linear classification ChengHao Tsai, ChiehYen Lin, ChihJen Lin code 32
Online multiple kernel regression Doyen Sahoo, Steven C. H. Hoi, Bin Li code 30
Correlation clustering in MapReduce Flavio Chierichetti, Nilesh N. Dalvi, Ravi Kumar code 30
Predicting student risks through longitudinal analysis Ashay Tamhane, Shajith Ikbal, Bikram Sengupta, Mayuri Duggirala, James Appleton code 30
Activity-edge centric multi-label classification for mining heterogeneous information networks Yang Zhou, Ling Liu code 29
Almost linear-time algorithms for adaptive betweenness centrality using hypergraph sketches Yuichi Yoshida code 29
Provable deterministic leverage score sampling Dimitris S. Papailiopoulos, Anastasios Kyrillidis, Christos Boutsidis code 28
Time-varying learning and content analytics via sparse factor analysis Andrew S. Lan, Christoph Studer, Richard G. Baraniuk code 28
Using strong triadic closure to characterize ties in social networks Stavros Sintos, Panayiotis Tsaparas code 28
Style in the long tail: discovering unique interests with latent variable models in large scale social E-commerce Diane Hu, Rob Hall, Josh Attenberg code 27
Large-scale adaptive semi-supervised learning via unified inductive and transductive model De Wang, Feiping Nie, Heng Huang code 27
Open-domain quantity queries on web tables: annotation, response, and consensus models Sunita Sarawagi, Soumen Chakrabarti code 27
Activity ranking in LinkedIn feed Deepak Agarwal, BeeChung Chen, Rupesh Gupta, Joshua Hartman, Qi He, Anand Iyer, Sumanth Kolar, Yiming Ma, Pannagadatta Shivaswamy, Ajit Singh, Liang Zhang code 26
Improving management of aquatic invasions by integrating shipping network, ecological, and environmental data: data mining for social good Jian Xu, Thanuka L. Wickramarathne, Nitesh V. Chawla, Erin K. Grey, Karsten Steinhaeuser, Reuben P. Keller, John M. Drake, David M. Lodge code 26
Predicting employee expertise for talent management in the enterprise Kush R. Varshney, Vijil Chenthamarakshan, Scott W. Fancher, Jun Wang, DongPing Fang, Aleksandra Mojsilovic code 26
LASTA: large scale topic assignment on multiple social networks Nemanja Spasojevic, Jinyun Yan, Adithya Rao, Prantik Bhattacharyya code 26
Scalable hands-free transfer learning for online advertising Brian Dalessandro, Daizhuo Chen, Troy Raeder, Claudia Perlich, Melinda Han Williams, Foster J. Provost code 25
FBLG: a simple and effective approach for temporal dependence discovery from time series data Dehua Cheng, Mohammad Taha Bahadori, Yan Liu code 24
Experiments with non-parametric topic models Wray L. Buntine, Swapnil Mishra code 24
Profit-maximizing cluster hires Behzad Golshan, Theodoros Lappas, Evimaria Terzi code 24
Scalable heterogeneous translated hashing Ying Wei, Yangqiu Song, Yi Zhen, Bo Liu, Qiang Yang code 23
Stability of influence maximization Xinran He, David Kempe code 23
Personalized search result diversification via structured learning Shangsong Liang, Zhaochun Ren, Maarten de Rijke code 22
Algorithms for interpretable machine learning Cynthia Rudin code 22
Detecting anomalies in dynamic rating data: a robust probabilistic model for rating evolution Stephan Günnemann, Nikou Günnemann, Christos Faloutsos code 22
Scalable near real-time failure localization of data center networks Herodotos Herodotou, Bolin Ding, Shobana Balakrishnan, Geoff Outhred, Percy Fitter code 22
Community detection in graphs through correlation Lian Duan, William Nick Street, Yanchi Liu, Haibing Lu code 20
Up next: retrieval methods for large scale related video suggestion Michael Bendersky, Lluis Garcia Pueyo, Jeremiah J. Harmsen, Vanja Josifovski, Dima Lepikhin code 19
Unveiling clusters of events for alert and incident management in large-scale enterprise it Derek Lin, Rashmi Raghu, Vivek Ramamurthy, Jin Yu, Regunathan Radhakrishnan, Joseph Fernandez code 19
Clinical risk prediction with multilinear sparse logistic regression Fei Wang, Ping Zhang, Buyue Qian, Xiang Wang, Ian Davidson code 19
Streamed approximate counting of distinct elements: beating optimal batch methods Daniel Ting code 18
Box drawings for learning with imbalanced data Siong Thye Goh, Cynthia Rudin code 18
Applying data mining techniques to address critical process optimization needs in advanced manufacturing Li Zheng, Chunqiu Zeng, Lei Li, Yexi Jiang, Wei Xue, Jingxuan Li, Chao Shen, Wubai Zhou, Hongtai Li, Liang Tang, Tao Li, Bing Duan, Ming Lei, Pengnian Wang code 18
Modeling impression discounting in large-scale recommender systems Pei Lee, Laks V. S. Lakshmanan, Mitul Tiwari, Sam Shah code 17
Knock it off: profiling the online storefronts of counterfeit merchandise Matthew F. Der, Lawrence K. Saul, Stefan Savage, Geoffrey M. Voelker code 17
Simultaneous feature and feature group selection through hard thresholding Shuo Xiang, Tao Yang, Jieping Ye code 17
Quantifying herding effects in crowd wisdom Ting Wang, Dashun Wang, Fei Wang code 17
Temporal skeletonization on sequential data: patterns, categorization, and visualization Chuanren Liu, Kai Zhang, Hui Xiong, Geoff Jiang, Qiang Yang code 16
The interplay between dynamics and networks: centrality, communities, and cheeger inequality Rumi Ghosh, ShangHua Teng, Kristina Lerman, Xiaoran Yan code 16
Corporate residence fraud detection Enric Junqué de Fortuny, Marija Stankova, Julie Moeyersoms, Bart Minnaert, Foster J. Provost, David Martens code 16
Spatially embedded co-offence prediction using supervised learning Mohammad A. Tayebi, Martin Ester, Uwe Glässer, Patricia L. Brantingham code 16
Entity profiling with varying source reliabilities Furong Li, MongLi Lee, Wynne Hsu code 15
Analyzing expert behaviors in collaborative networks Huan Sun, Mudhakar Srivatsa, Shulong Tan, Yang Li, Lance M. Kaplan, Shu Tao, Xifeng Yan code 14
Probabilistic latent network visualization: inferring and embedding diffusion networks Takeshi Kurashima, Tomoharu Iwata, Noriko Takaya, Hiroshi Sawada code 14
Representative clustering of uncertain data Andreas Züfle, Tobias Emrich, Klaus Arthur Schmid, Nikos Mamoulis, Arthur Zimek, Matthias Renz code 14
Proactive workflow modeling by stochastic processes with application to healthcare operation and management Chuanren Liu, Yong Ge, Hui Xiong, Keli Xiao, Wei Geng, Matt Perkins code 14
New algorithms for parking demand management and a city-scale deployment Onno Zoeter, Christopher R. Dance, Stéphane Clinchant, JeanMarc Andreoli code 14
ISIS: a networked-epidemiology based pervasive web app for infectious disease pandemic planning and response Richard J. Beckman, Keith R. Bisset, Jiangzhuo Chen, Bryan L. Lewis, Madhav V. Marathe, Paula Elaine Stretz code 14
LWI-SVD: low-rank, windowed, incremental singular value decompositions on time-evolving data sets Xilun Chen, K. Selçuk Candan code 13
Scaling out big data missing value imputations: pythia vs. godzilla Christos Anagnostopoulos, Peter Triantafillou code 13
Modeling mass protest adoption in social network communities using geometric brownian motion Fang Jin, Rupinder Paul Khandpur, Nathan Self, Edward R. Dougherty, Sheng Guo, Feng Chen, B. Aditya Prakash, Naren Ramakrishnan code 13
Identifying tourists from public transport commuters Mingqiang Xue, Huayu Wu, Wei Chen, Wee Siong Ng, Gin Howe Goh code 13
Shallow semantic parsing of product offering titles (for better automatic hyperlink insertion) Gabor Melli code 12
Class-distribution regularized consensus maximization for alleviating overfitting in model combination Sihong Xie, Jing Gao, Wei Fan, Deepak S. Turaga, Philip S. Yu code 12
Distance metric learning using dropout: a structured regularization approach Qi Qian, Juhua Hu, Rong Jin, Jian Pei, Shenghuo Zhu code 12
Learning with dual heterogeneity: a nonparametric bayes model Hongxia Yang, Jingrui He code 12
Sentiment expression conditioned by affective transitions and social forces Moritz Sudhof, Andrés Goméz Emilsson, Andrew L. Maas, Christopher Potts code 12
Learning multifractal structure in large networks Austin R. Benson, Carlos Riquelme, Sven Schmit code 12
Optimal recommendations under attraction, aversion, and social influence Wei Lu, Stratis Ioannidis, Smriti Bhagat, Laks V. S. Lakshmanan code 11
The setwise stream classification problem Charu C. Aggarwal code 11
Predicting long-term impact of CQA posts: a comprehensive viewpoint Yuan Yao, Hanghang Tong, Feng Xu, Jian Lu code 11
Scaling up deep learning Yoshua Bengio code 11
Targeting direct cash transfers to the extremely poor Brian Abelson, Kush R. Varshney, Joy Sun code 10
FoodSIS: a text mining system to improve the state of food safety in singapore Kiran Kate, Sneha Chaudhari, Andy Prapanca, Jayant Kalagnanam code 10
Efficient SimRank computation via linearizationPublication of this article pending inquiry Takanori Maehara, Mitsuru Kusumoto, Kenichi Kawarabayashi code 9
Semantic visualization for spherical representation Tuan M. V. Le, Hady Wirawan Lauw code 9
Networked bandits with disjoint linear payoffs Meng Fang, Dacheng Tao code 9
Supervised deep learning with auxiliary networks Junbo Zhang, Guangjian Tian, Yadong Mu, Wei Fan code 9
Recommendation in social media: recent advances and new frontiers Jiliang Tang, Jie Tang, Huan Liu code 8
Good-enough brain model: challenges, algorithms and discoveries in multi-subject experiments Evangelos E. Papalexakis, Alona Fyshe, Nicholas D. Sidiropoulos, Partha Pratim Talukdar, Tom M. Mitchell, Christos Faloutsos code 8
SMVC: semi-supervised multi-view clustering in subspace projections Stephan Günnemann, Ines Färber, Matthias Sebastian Rüdiger, Thomas Seidl code 8
Towards scalable critical alert mining Bo Zong, Yinghui Wu, Jie Song, Ambuj K. Singh, Hasan Çam, Jiawei Han, Xifeng Yan code 8
LaSEWeb: automating search strategies over semi-structured web data Oleksandr Polozov, Sumit Gulwani code 7
Fast flux discriminant for large-scale sparse nonlinear classification Wenlin Chen, Yixin Chen, Kilian Q. Weinberger code 7
Parallel gibbs sampling for hierarchical dirichlet processes via gamma processes equivalence Dehua Cheng, Yan Liu code 7
A bayesian framework for estimating properties of network diffusions Varun R. Embar, Rama Kumar Pasumarthi, Indrajit Bhattacharya code 7
Reducing gang violence through network influence based targeting of social programs Paulo Shakarian, Joseph Salmento, William R. Pulleyblank, John Bertetto code 7
Correlation clustering: from theory to practice Francesco Bonchi, David GarcíaSoriano, Edo Liberty code 7
Leveraging user libraries to bootstrap collaborative filtering Laurent Charlin, Richard S. Zemel, Hugo Larochelle code 6
Active collaborative permutation learning Jialei Wang, Nathan Srebro, James Evans code 6
Topic-factorized ideal point estimation model for legislative voting network Yupeng Gu, Yizhou Sun, Ning Jiang, Bingyu Wang, Ting Chen code 6
Active-transductive learning with label-adapted kernels Dan Kushnir code 6
Safe and efficient screening for sparse support vector machine Zheng Zhao, Jun Liu, James Cox code 6
Empirical glitch explanations Tamraparni Dasu, Ji Meng Loh, Divesh Srivastava code 6
Deep learning Ruslan Salakhutdinov code 6
Scalable noise mining in long-term electrocardiographic time-series to predict death following heart attacks ChihChun Chia, Zeeshan Syed code 5
Unifying learning to rank and domain adaptation: enabling cross-task document scoring Mianwei Zhou, Kevin ChenChuan Chang code 5
LUDIA: an aggregate-constrained low-rank reconstruction algorithm to leverage publicly released health data Yubin Park, Joydeep Ghosh code 5
Improved testing of low rank matrices Yi Li, Zhengyu Wang, David P. Woodruff code 5
User effort minimization through adaptive diversification Mahbub Hasan, Abhijith Kashyap, Vagelis Hristidis, Vassilis J. Tsotras code 5
Online chinese restaurant process ChienLiang Liu, TsungHsun Tsai, ChiaHoang Lee code 5
From labor to trader: opinion elicitation via online crowds as a market Caleb Chen Cao, Lei Chen, H. V. Jagadish code 5
Relevant overlapping subspace clusters on categorical data Xiao He, Jing Feng, Bettina Konte, Son T. Mai, Claudia Plant code 5
An efficient algorithm for weak hierarchical lasso Yashu Liu, Jie Wang, Jieping Ye code 5
Bringing structure to text: mining phrases, entities, topics, and hierarchies Jiawei Han, Chi Wang, Ahmed ElKishky code 5
Frontiers in E-commerce personalization Sri Subramaniam code 4
Improving the modified nyström method using spectral shifting Shusen Wang, Chao Zhang, Hui Qian, Zhihua Zhang code 4
Scalable histograms on large probabilistic data Mingwang Tang, Feifei Li code 4
Computational epidemiology Madhav V. Marathe, Anil Kumar S. Vullikanti code 4
Statistically sound pattern discovery Wilhelmiina Hämäläinen, Geoffrey I. Webb code 4
Distance queries from sampled data: accurate and efficient Edith Cohen code 3
A multi-class boosting method with direct optimization Shaodan Zhai, Tian Xia, Shaojun Wang code 3
Factorized sparse learning models with interpretable high order feature interactions Sanjay Purushotham, Martin Renqiang Min, C.C. Jay Kuo, Rachel Ostroff code 2
Fast DTT: a near linear algorithm for decomposing a tensor into factor tensors Xiaomin Fang, Rong Pan code 2
Who are experts specializing in landscape photography?: analyzing topic-specific authority on content sharing services Bin Bi, Ben Kao, Chang Wan, Junghoo Cho code 2
Data science through the lens of social science Drew Conway code 2
Mining text snippets for images on the web Anitha Kannan, Simon Baker, Krishnan Ramnath, Juliet Fiss, Dahua Lin, Lucy Vanderwende, Rizwan Ansary, Ashish Kapoor, Qifa Ke, Matt Uyttendaele, XinJing Wang, Lei Zhang code 2
Novel geospatial interpolation analytics for general meteorological measurements Bingsheng Wang, Jinjun Xiong code 2
Multi-task copula by sparse graph regression Tianyi Zhou, Dacheng Tao code 1
Does social good justify risking personal privacy? Raghu Ramakrishnan, Geoffrey I. Webb code 1
Product selection problem: improve market share by learning consumer behavior Silei Xu, John ChiShing Lui code 1
Batch discovery of recurring rare classes toward identifying anomalous samples Murat Dundar, Halid Ziya Yerebakan, Bartek Rajwa code 1
Sleep analytics and online selective anomaly detection Tahereh Babaie, Sanjay Chawla, Romesh G. Abeysuriya code 1
A data driven approach to diagnosing and treating disease Eric E. Schadt code 1
Network structural analysis via core-tree-decomposition Publication of this article pending inquiry Takuya Akiba, Takanori Maehara, Kenichi Kawarabayashi code 1
Network mining and analysis for social applications Feida Zhu, Huan Sun, Xifeng Yan code 1
Medicine in the age of electronic health records Nigam Shah code 0
Filling context-ad vocabulary gaps with click logs Yukihiro Tagami, Toru Hotta, Yusuke Tanaka, Shingo Ono, Koji Tsukamoto, Akira Tajima code 0
Management and analytic of biomedical big data with cloud-based in-memory database and dynamic querying: a hands-on experience with real-world data Mengling Feng, Mohammad M. Ghassemi, Thomas Brennan, John Ellenberger, Ishrar Hussain, Roger G. Mark code 0
Bugbears or legitimate threats?: (social) scientists' criticisms of machine learning? Sendhil Mullainathan code 0
The battle for the future of data mining Oren Etzioni code 0
Data, predictions, and decisions in support of people and society Eric Horvitz code 0
Dual beta process priors for latent cluster discovery in chronic obstructive pulmonary disease James C. Ross, Peter J. Castaldi, Michael H. Cho, Jennifer G. Dy code 0
Predictive modeling in practice: a case study from sprint Tracy De Poalo, Jeremy Howard code 0
Information environment security Rand Waltzman code 0
Big data for social good Nathan Eagle code 0
Bringing data science to the speakers of every language Robert Munro code 0
Large scale predictive modeling for micro-simulation of 3G air interface load Dejan Radosavljevik, Peter van der Putten code 0