Giter Club home page Giter Club logo

kdd2020's Introduction

KDD-2020 statistical information: papers,awards,workshops

avatar

1.NOTE

This repo is based on the KDD 2020 official website, to summarize the following aspects, for myself and everyone to quickly get the whole idea of this conference.

  • the key words of accepted papers
  • the topic of accepted papers
  • author information of accepted papers
  • KDD awards
  • KDD workshop, tutorial

Outline of this repo:

2.Summary:

  • Total num (RT+ADS), Research Track, and Applied Data Science (ADS) track of paper in KDD 2020 is : **338 217 121 **;
  • Total number of submission: 2035 (the highest in history,over 13% more than the second highest one)
  • Research track(long paper): 1279 submition, 216 accepted, 216 / 1279 = 16.9%;
  • Conference schedule: Click here
  • KDD recent 7 years submission and recept data:

avatar

[2012-ACM SIGKDD 数据挖掘及知识发现会议]

An introduction about the history of KDD, the topic of KDD, and the general issue about the 2012 KDD conference to be held in Beijing, China, by Prof. Jiangyong Wang from Tsinghua University.

3.Keywork cloud: Generate keyword cloud based on titles

avatar

4.Topic cloud: Classify papers by topics

topic num_papers
graph 74
convolutional neural networks 6
unsupervised 6
generative adversarial 6
reinforcement learning 6
time series 5
semi-supervised 5
real-time 5
GAN 4
GNN 4
multi-task 4
GCN 3
meta-learning 3
Federated 2
privacy 2
private 1
interpretability 1
Knowledge Distillation 1
co-training 1
CNN 1
GANs 1
transfer learning 1

avatar

5. KDD Awards

5.1 SIGKDD Dissertation Award

Dissertation Award Running Up: Jingbo Shang, Assistant professor in UCSD, PHD from UIUC with Prof.Jiawei Han;

In the speech, Prof.Shang introduced his work during PhD study about automatic learning from text: turning unstructured data into structured knowledge;

avatar

QA:

1)How to improve our research taste? How to choose a good topic?

I focused on three parts: real problem; real data; real solution.

**If an algorithm is very specific, if we release some condition, the method will not work that well. So we have the chance to improve.**For example, for my PHD study, supervised learning on text is well studied, but i move to unsupervised learning, to form my phd thesis.

2)What is your future research interest?

two things,

  • week supervision. Utilize huge amount of unlabelled data;
  • How to combine sturctured and unstructured data;

Dissertation Award: Rediet Abebe, Assistant Professor in UC Berkeley, PHD from Cornell University;

avatar

5.2 Test of time award - research--Victor S. Sheng

"Get Another Label? Improving Data Quality And Data Mining Using Multiple, Noisy Labelers."

Victor S. Sheng, Foster Provost and Panagiotis Ipeirotis

5.3 Test of time award - applied data science -- Jie Tang

"ArnetMiner: Extraction And Mining Of Academic Social Networks."

Jie Tang, Jing Zhang, Limin Yao, Juanzi Li, Li Zhang and Zhong Su

5.4 ACM SIGKDD Rising Star Award

avatar

avatar

5.5 ACM SIGKDD Service Award

Mike Zeller, Head of AI strategy and solutions , Temask.

avatar

5.6 Best paper award

Research Track

Best Paper Award On Sampled Metrics for Item Recommendation Walid Krichene: Google; Steffen Rendle: Google

Best Paper Runner-up Malicious Attacks against Deep Reinforcement Learning Interpretations Mengdi Huai: University of Virginia; Jianhui Sun: University of Virginia; Renqin Cai: University of Virginia; Liuyi Yao: University of New York at Buffalo; Aidong Zhang: University of Virginia

Best Student Paper Award TIPRDC: Task-Independent Privacy-Respecting Data Crowdsourcing Framework for Deep Learning with Anonymized Intermediate Representations Ang Li: Duke University; Yixiao Duan: Beihang University; Huanrui Yang: Duke University; Yiran Chen: Duke University; Jianlei Yang: Beihang University

Applied Data Science Track

Best Paper Award

Temporal-Contextual Recommendation in Real-Time Yifei Ma; Murali Balakrishnan Narayanaswamy, Haibin Lin and Hao Ding from Amazon.

6.Some interesting activities in the conference (lecture-style tutorial, workshop, research paper)

6.1. Lecture-style Tutorial: Adversarial Attacks and Defenses: Frontiers, Advances and Practice

Presenter: Han Xu, Yaxin Li, Wei Jin, Jiliang Tang, from Michigan State University

tutorial website

tutorial slide

6.2. 2020 INTERNATIONAL WORKSHOP ON TALENT AND MANAGEMENT COMPUTING

workshop website

paper list:

  • Salary Prediction Problem as Ordinal Regression, Speaker: Tomoki Ohtsuki
  • Dynamic Non-negative Matrix Factorization with Temporal Smoothness and Its Application to Email Analysis, Speaker: Mandar Chaudhary
  • Large-Scale Talent Flow Embedding for Company Competitive Analysis, Speaker: Le Zhang
  • Fine-grained Job Salary Benchmarking with Nonparametric Dirichlet-process-based Latent Factor Model, Speaker: Qinxin Meng
  • Personalized Employee Tranining Course Recommendation with Career Development Awareness, Speaker: Chao Wang
  • Research on Medical Talent Evaluation through Applied Behavior Analysis, Speaker: Bo Jin

topics:

  • - Talent behavior modeling
  • - Talent personality and leadership
  • - Talent performance assessment
  • - Talent recruitment
  • - Talent retention and incentive
  • - Talent email sentiment analysis
  • - Job recommendation
  • - Fairness on recruitment
  • - Skill recommendation
  • - Person-job fit and job satisfaction
  • - Career development
  • - Professional social networks
  • - Team formation and task assignment
  • - Group-based decision-making
  • - Organizational change and stability
  • - Organizational culture and communication
  • - Organizational competition analysis
  • - Strategic management and planning

6.3. 1st ACM SIGKDD Workshop on Deep Learning for Spatiotemporal Data, Applications, and Systems

workshop website

topics:

1)Novel Deep Learning Techniques for Spatial and Spatio-Temporal Data: Spatial representation learning and deep neural networks for spatio-temporal data and geometric data Interpretable deep learning for spatial-temporal data Deep generative models for spatio-temporal data Deep reinforcement learning for spatio-temporal decision making problems

2)Novel Applications of Deep Learning Techniques to Spatio-temporal Computing Problems. : Geo-imagery and point cloud analysis (for remote sensing, Earth science, etc.) Deep learning for mobility and traffic data analytics Location-based social network data analytics, spatial event prediction and forecasting Learning for biological data with spatial structures (bio-molecule, brain networks, etc.)

3)Novel Deep Learning Systems for Spatio-temporal Applications: Real-time decision-making systems for traffic management, crime prediction, accident risk analysis, etc. GIS systems using deep learning (e.g., mapping, routing, or Smart city) Mobile computing systems using deep learning Interpretable deep learning systems for spatio-temporal temporal data

paper list:

6.4. Research track: Unsupervised learning 3

paper list:

  • CICLAD: A Fast and Memory-efficient Closed Itemset Miner for Streams
  • Parameterized Correlation Clustering in Hypergraphs and Bipartite Graphs
  • A Non-Iterative Quantile Change Detection Method in Mixture Model with Heavy-Tailed Components
  • Learning Based Distributed Tracking
  • Stable Learning via Differentiated Variable Decorrelation

These 5 papers with one sentence summary:

[CICLAD: A Fast and Memory-efficient Closed Itemset Miner for Streams]

2020 KDD, Tomas Martin, Guy Francoeur, Petko Valtchev from Centre de recherche en intelligence artificielle (CRIA), UQÀM, Canada;

summary: This paper proposes a new intersection-based sliding-window (frequent closed itemset )FCI miner, to mine the association rules.

[Parameterized Correlation Clustering in Hypergraphs and Bipartite Graphs]

2020 KDD, Nate Veldt (Cornell University), Anthony Wirth (U of Melbourne), David F. Gleich (Purdue);

summary: This paper solves clustering problem in Hypergraphs and Bipartite Graphs.

[A Non-Iterative Quantile Change Detection Method in Mixture Model with Heavy-Tailed Components]

2020 KDD,Yuantong Li from Purdue, Qi Ma and Sujit K. Ghosh from North Carolina State University;

summary: This is a proof-of-concept paper. This paper proposes a data-driven method for parameter estimation in mixture models with heavy-tailed component. While traditionally, most litereatures use iterative Expectation Maximization (EM) method, this paper proposes Non-Iterative Quantile Change Detection (NIQCD) by using change-point detection methods.

[Learning based distributed tracking]

2020 KDD, Hao Wu, Junhao Gan, Rui Zhang from U of Melbourne;

summary: This paper studies the fundamental problem called Distributed Tracking. WIth the popularity of machine learning, people starts to explore the theory of ML via data distribution. This paper follows this line of research and proposes two methods, i.e., w and w/o known of data distribution in advance, to minimize the communication cost in coordinator and players.

[Stable Learning via Differentiated Variable Decorrelation]

2020 KDD, Zheyean Shen: Tsinghua University; Peng Cui: Tsinghua University; Jiashuo Liu: Tsinghua University; Tong Zhang: Hong Kong University of Science and Technology; Bo Li: Tsinghua University; Zhitang Chen: Huawei Noah's Ark Lab

summary: This paper studies model robustness, i.e., the model could achieve similar performance in the chaning wild environment.

Method: This paper incorporates the unlabelled data from multiple environment in the variable decorrelation framework.

6.5. The 9th SIGKDD International Workshop on Urban Computing

official website

Topics of Interest:

Topics of interest include, but not limited to, the following aspects :

  • Data mining for urban planning and city configuration evaluation
  • Mining urban environmental, pollution, and ecological data
  • Knowledge discovery from sensor data for saving energy and resources
  • Data mining for sustainable and intelligent cities
  • Urban sensing and city dynamics sensing
  • Knowledge fusion from data across different domains
  • City-wide traffic modeling, visualization, analysis, and prediction
  • City-wide human mobility modeling, visualization, and understanding
  • City-wide intelligent transportation systems
  • Anomaly detection and event discovery in urban areas
  • Mining urban economics
  • Social behavior modeling, understanding, and patterns mining in urban spaces
  • City-wide mobile social applications in urban areas
  • Location-based social networks enabling urban computing scenarios
  • Smart recommendations in urban spaces
  • Intelligent delivery services and logistics industries in cities
  • Mining data from the Internet of Things in urban areas
  • Managing urban big data on the cloud
  • Interactive visual data analytics for urban computing
  • Federated learning for urban computing

Accepted papers:

  1. Towards Dynamic Urban Bike Usage Prediction for Station Network Reconfiguration [***PDF]*** Xi Yang and Suining He (The University of Connecticut)

  2. Leveraging Change Point Detection for Activity Transition Mining in the Context of Environmental Crowdsensing [***PDF]*** Hafsa El Hafyani(UVSQ-Université Paris-Saclay), Karine Zeitouni(UVSQ-Université Paris-Saclay), Yehia Taher(UVSQ-Université Paris-Saclay), Mohammad Abboud(Lebanese University)

  3. The Anatomy of the Daily Usage of Bike Sharing Systems: Elevation, Distance and Seasonality [***PDF]*** Injung Kim(University of Pittsburgh), Konstantinos Pelechrinis(University of Pittsburgh), Adam J. Lee(University of Pittsburgh)

  4. A Distributed Travel Time Estimation Capability for Metropolitan-sized Road Transportation Networks [***PDF]*** AArif Khan(Pacific Northwest National Laboratory), Arun V Sathanur(Pacific Northwest National Laboratory), Kelsey Maass(University of Washington), Robert Rallo(Pacific Northwest National Laboratory)

  5. An Interactive Data Visualization and Analytics Tool to Evaluate Mobility and Sociability Trends During COVID-19 [***PDF]*** Fan Zuo(New York University), Jingxing Wang(University of Washington), Jingqin Gao(New York University), Kaan Ozbay(New York University), Xuegang Jeff Ban(University of Washington), Yubin Shen(New York University), Hong Yang(Old Dominion University), Shri Iyer(New York University)

  6. Trends Drift Discovery for Individual Highway Drivers through Ensemble Learning [***PDF]*** Weilong Ding(North China University of Technology), Zhe Wang(North China University of Technology), Jianwu Wang(University of Maryland), and Yanbo Han(North China University of Technology)

  7. Cross-Domain Customer Profiling: Mining Passengers’ Food Ordering Patterns From Transportation Habits [***PDF]*** Xueou Wang(National University of Singapore), Yifang Yin(National University of Singapore), Bryan Hooi(National University of Singapore), See Kiong Ng(National University of Singapore), Wynne Hsu(National University of Singapore), Renrong Weng(Grab), Xiang Hui Nicholas Lim(Grab), Rui Tan(Grab)

  8. Speed Anomalies and Safe Departure Times from Uber Movement Data [***PDF]*** Nabil Al Nahin Ch(University of New Hampshire), John Krumm(Microsoft Research), Andrew Kun(University of New Hampshire)

  9. Bus Fuel Consumption Problem: An in-depth Analysis and Prediction [***PDF]*** Thanh-Nam Doan, Le Tuan Phan, Mina Sartipi. (University of Tennessee at Chattanooga)

  10. Online Weighted Bipartite Matching with Capacity Constraints [***PDF]*** Hao Wang, Zhenzhen Yan, Xiaohui Bei. (Nanyang Technological University)

  11. TESLA: A Centralized Taxi Dispatching Approach to Optimizing Revenue Efficiency with Global Fairness [***PDF]*** Huigui Rong(Hunan University), Qun Zhang(Hunan University), Xun Zhou(University of Iowa), Hongbo Jiang(Hunan University), Da Cao(Hunan University), Keqin Li(SUNY New Platz).

  12. RAD: Rapid Automobile Data Analytics Framework for Structured Data [***PDF]*** Nikhil Muralidhar(Virginia Polytechnic Institute and State University), Brian Mayer(Virginia Polytechnic Institute and State University), Nathan Self(Virginia Polytechnic Institute and State University), Naren Ramakrishnan(Virginia Polytechnic Institute and State University), Panduranga Kondoju(Ford Motor Company), Basavaraj Tonshal(Ford Motor Company), John Schmotzer(Ford Motor Company)

kdd2020's People

Contributors

rh20624 avatar

Watchers

James Cloos avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.