Below is the list of KDD-2004 accepted papers
http://www.acm.org/sigkdd/kdd2004/program/papers/
(1) RESEARCH TRACK ACCEPTED PAPERS
(A) FULL PAPERS
Interestingness of Frequent Itemsets Using Bayesian Networks
as Background Knowledge
Authors: Szymon Jaroszewicz, Dan Simovici
Rapid Detection of Significant Spatial Clusters
Authors: Daniel Neill, Andrew Moore
Clustering time series from ARMA models with clipped data
Authors: Anthony Bagnall, Garath Janacek
Mining the Space of Graph Properties
Authors: Glen Jeh, Jennifer Widom
Fast Discovery of 'Connection Subgraphs'
Authors: Christos Faloutsos, Kevin McCurley, Andrew Tomkins
Regularized Multi-Task Learning
Authors: Theodoros Evgeniou, Massimiliano Pontil
Systematic Data Selection to Mine Concept-drifting Data Streams
Authors: Wei Fan
Cyclic Pattern Kernels for Predictive Graph Mining
Authors: Tamas Horvath, Thomas Gaertner, Stefan Wrobel
Incorporating Prior Knowledge with Weighted Margin Support Vector
Machines
Authors: Xiaoyun Wu, Rohini Srihari
Mining and Summarizing Customer Reviews
Authors: Minqing Hu, Bing Liu
Discovering Complex Matchings across Web Query Interfaces: A
Correlation Mining Approach
Authors: Bin He, Kevin Chang, Jiawei Han
Support Envelopes: A Technique for Exploring the Structure of
Association Patterns
Authors: Michael Steinbach, Pang-Ning Tan, Vipin Kumar
Towards Parameter-Free Data Mining
Authors: Eamonn Keogh, Stefano Lonardi, Chotirat Ann Ratanamahatana
Selection, Combination, and Evaluation of Effective Software
Sensors for Detecting Abnormal Computer Usage
Authors: Jude Shavlik, Mark Shavlik
Turning CARTwheels: An Alternating Algorithm for Mining Redescriptions
Authors: Naren Ramakrishnan, Deept Kumar, Bud Mishra, Malcolm
Potts, Richard Helm
The Complexity of Mining Maximal Frequent Itemsets and Maximal
Frequent Patterns
Authors: Guizhen Yang
Exploiting a Support-based Upper Bound of Pearson's Correlation
Coefficient for Efficiently Identifying Strongly Correlated Pairs
Authors: Hui Xiong, Shashi Shekhar, Pang-Ning Tan, Vipin Kumar
Fully Automatic Cross-Assocations
Authors: Deepayan Chakrabarti, Spiros Papadimitriou, Dharmendra
Modha, Christos Faloutsos
Adversarial Classification
Authors: Nilesh Dalvi, Pedro Domingos, Mausam Mausam, Sumit Sanghai,
Deepak Verma
GPCA: An Efficient Dimension Reduction Scheme for Image Compression
and Retrieval
Authors: Jieping Ye, Ravi Janardan, Qi Li
IDR/QR: An Incremental Dimension Reduction Algorithm via QR Decomposition
Authors: Jieping Ye, Qi Li, Hui Xiong, Park Haesun, Ravi Janardan,
Vipin Kumar
Recovering Latent Time-Series from their Observed Sums: Network
Tomography with Particle Filters
Authors: Edoardo Airoldi, Christos Faloutsos
Mining, Indexing, and Querying Historical Spatiotemporal Data
Authors: Nikos Mamoulis, Huping Cao, George Kollios, Marios Hadjieleftheriou,
Yufei Tao, David W. L. Cheung
Web Usage Mining Based on Probabilistic Latent Semantic Analysis
Authors: Xin Jin, Yanzan Zhou, Bamshad Mobasher
Fast Mining of Spatial Collocations
Authors: Xin Zhang, Nikos Mamoulis, David W. L. Cheung, Yutao
Shou
Incremental Maintenance of Quotient Cube for Median
Authors: Cuiping Li, Gao Cong, Anthony K. H. Tung, Shan Wang
A Graph-Theoretic Approach to Extract Storylines from Search
Results
Authors: Ravi Kumar, Uma Mahadevan, D. Sivakumar
Probabilistic Author-Topic Models for Information Discovery
Authors: Mark Steyvers, Padhraic Smyth, Michal Rosen-Zvi, Thomas
Griffiths
Mining Reference Tables for Automatic Text Segmentation
Authors: Eugene Agichtein, Venkatesh Ganti
On the Discovery of Significant Statistical Quantitative Rules
Authors: Hong Zhang, Balaji Padmanabhan, Alexander Tuzhilin
Approximating a Collection of Frequent Sets
Authors: Foto Afrati, Aristides Gionis, Heikki Mannila
Online Data Mining for Query Relaxation
Authors: Ion Muslea
A Probabilistic Framework for Semi-Supervised Clustering
Authors: Sugato Basu, Mikhail Bilenko, Raymond Mooney
Efficient Closed Pattern Mining in the Presence of Tough Block
Constraints
Authors: Krishna Gade, Jianyong Wang, George Karypis
Exploiting Dictionaries in Named Entity Extraction: Combining
SemiMarkov Extraction Processes and Data Integration Methods
Authors: William Cohen, Sunita Sarawagi
Scalable Mining Large Disk-Based Graph Databases
Authors: Chen Wang, Wei Wang, Jian Pei, Yongtai Zhu, Baile Shi
A Bayesian Network Framework for Reject Inference
Authors: Andrew Smith, Charles Elkan
An Iterative Method for Multi-Class Cost-Sensitive Learning
Authors: Naoki Abe, Bianca Zadrozny
ata Mining in Metric Space: An Empirical Analysis of Supervised
Learning Performance Criteria
Authors: Richard Caruana, Alex Niculescu-Mizil
Fast Galactic Morphology via Eigenimages
Authors: Brigham Anderson, Andrew Moore, Andrew Connolly, Bob
Nichol
(B) POSTER PAPERS
Extending the Notion of Support
Authors: Michael Steinbach, Pang-Ning Tan, Hui Xiong, Vipin Kumar
When Do Data Mining Results Violate Privacy?
Authors: Murat Kantarcioglu, Jiashun Jin, Chris Clifton
A generative probabilistic approach to visualizing sets of symbolic
sequences
Authors: Peter Tino, Ata Kaban, Yi Sun
An Alpha-Semantics Graph Based Data Mining Approach to Retrieving
Image Database
Authors: Ruofei Zhang, Zhongfei (Mark) Zhang
Sleeved coClustering
Authors: Avraham Melkman, Eran Shaham
Discovering Additive Structure in Black Box Functions
Authors: Giles Hooker
A New Privacy Model and Association-Rule Mining Algorithm for
Large-Scale Distributed Environments
Authors: Bobi Gilburd, Assaf Schuster, Ran Wolff
The IOC algorithm: Efficient Many-Class Non-parametric Classification
for High-Dimensional Data
Authors: Ting Liu, Ke Yang, Andrew Moore
Visualization for Classification Problems, with Examples Using
Support Vector Machines
Authors: Dianne Cooke, Doina Caragea, Vasant Honavar
IMMC: Incremental Maximum Margin Criterion
Authors: Jun Yang, Benyu Zhang, Shuicheng Yan, Zheng Chan, Fan
Weiguo, Qiang Yang, Ma Wei-Ying, Qiansheng Cheng
On Demand Classification of Data Streams
Authors: Charu Aggarwal, Jiawei Han, Jianyong Wang, Philip Yu
Dense Itemsets
Authors: Jouni K. Seppän, Heikki Mannila
2PXMiner - Efficient Mining of Frequent XML Query Patterns with
Repeated Siblings
Authors: Liang-Huai Yang, Mong Li Lee, Wynne Hsu
On Detecting Space-Time Clusters
Authors: Vijay Iyengar
Cluster-based Concept Invention for Statistical Relational Learning
Authors: Alexandrin Popescul, Lyle Ungar
Mining Scale-free Networks using Geodesic Clustering
Authors: Andrew Wu, Michael Garland, Jiawei Han
Privacy-Preserving Bayesian Network Structure Computation on
Distributed Heterogeneous Data
Authors: Rebecca Wright, Zhiqiang Yang
Parallel Computation of High Dimensional Robust Correlation and
Covariance Matrices
Authors: Alan Wagner, James Chilson, Raymond Ng, Ruben Zamar
Automatic Multimedia Cross-modal Correlation Discovery
Authors: Jia-Yu Pan, Hyung-Jeong Yang, Christos Faloutsos, Pinar
Duygulu
Locating Secret Messages in Images
Authors: Ian Davidson, Goutam Paul
Diagnosing Extrapolation: Tree-Based Density Estimation
Authors: Giles Hooker
IncSpan: Incremental Mining of Sequential Patterns in Large Database
Authors: Hong Cheng, Xifeng Yan, Jiawei Han
Identifying Early Buyers from Purchase Data
Authors: Paat Rusmevichientong, Shenghuo Zhu, David Selinger
Optimal Randomization for Privacy Preserving Data Mining
Authors: Yu Zhu, Lei Liu
A DEA Approach for Model Combination
Authors: Eric Zheng, Balaji Padamanabhan
Learning Spatially Variant Dissimilarity (SVaD) Measures
Authors: Krishna Kummamuru, Raghu Krishnapuram, Rekesh Agrawal
Ordering Patterns by Combining Opinions from Multiple Sources
Authors: Pang-Ning Tan, Rong Jin
Semantic Representation, Search and Mining of Multimedia Content
Authors: Apostol Natsev, Milind Naphade, John R. Smith
SPIN: Mining Maximal Frequent Subgraphs from Graph Databases
Authors: Jun Huan, Wei Wang, Jan Prins, Jiong Yang
An Objective Evaluation Criterion for Clustering
Authors: Arindam Banerjee, John Langford
Clustering Moving Objects
Authors: Yifan Li, Jiawei Han, Jiong Yang
Rotation Invariant Measures for Trajectories
Authors: Michail Vlachos, Dimitrios Gunopulos, Gautam Das
A Framework for Ontology-Driven Subspace Clustering
Authors: Jinze Liu, Wei Wang, Jiong Yang
A Quickstart in Frequent Structure Mining can make a difference
Authors: Siegfried Nijssen, Joost N. Kok
Why Collective Inference Improves Relational Classification
Authors: David Jensen, Jennifer Neville, Brian Gallagher
Privacy Preserving Regression Modelling via Distributed Computation
Authors: Ashish Sanil, Alan Karr, Xiaodong Lin, Jerome Reiter
Column-Generation Boosting Methods for Mixture of Kernels
Authors: Jinbo Bi, Tong Zhang, Kristin Bennett
Redundancy Based Feature Selection for Microarray Data
Authors: Lei Yu, Huan Liu
Improved robustness of signature-based near-replica detection
via lexicon randomization
Authors: Aleksander Kolcz, Abdur Chowdhury, Joshua Alspector
Estimating the Size of the Telephone Universe: A Bayesian Mark-Recapture
Approach
Authors: David Poole
Belief State Approaches to Signaling Alarms in Surveillance Systems
Authors: Kaustav Das, Jeff Schneider
A Unified View of Kernel k-means, Spectral Clustering and Graph
Cuts
Authors: Brian Kulis, Yuqiang Guan, Inderjit Dhillon
A Cross-Collection Mixture Model for Comparative Text Mining
Authors: ChangXiang Zhai, Atulya Velivelli, Bei Yu
A Microeconomic Data Mining Problem: Customer-Oriented Catalog
Segmentation
Authors: Martin Ester, Rong Ge, Wen Jin, Zengjian Hu
A Generalized Maximum Entropy Approach to Bregman Co-clusteringand
Matrix Approximation
Authors: Arindam Banerjee, Inderjit Dhillon, Joydeep Ghosh, Srujana
Merugu, Dharmendra Modha
(2) INDUSTRIAL TRACK ACCEPTED PAPERS
(A) FULL PAPERS
TiVo: Making Show Recommendations Using a Distributed Collaborative
Filtering Architecture
Authors: Kamal Ali, Wijnand Van Stam
V-Miner: Using Enhanced Parallel Coordinates to Mine Product
Design and TestData
Authors: Kaidi Zhao, Bing Liu, Thomas Tirpak, Andreas Schaller
Early Detection of Insider Trading in Option Markets
Authors: Steve Donoho
Density-Based Spam Detector
Authors: Kenichi Yoshida, Fuminori Adachi, Takashi Washio, Hiroshi
Motoda, Teruaki Homma, Akihiro Nakashima, Hiromitsu Fujikawa,
Katsuyuki Yamazaki
Visually Mining and Monitoring Massive Time Series
Authors: Jessica Lin, Jeff Lankford, Eamonn Keogh, Stefano Lonardi
A Rank Sum Test Method for Informative Gene Discovery
Authors: Lin Deng, Jian Pei, Jinwen Ma, Dik Lun Lee
Learning to Detect Malicious Executables in the Wild
Authors: Jeremy Kolter, Marcus A. Maloof
Interactive Training of Advanced Classifiers for Mining Remote
Sensing Image Archives
Authors: Selim Aksoy, Krzysztof Koperski, Giovanni Marchisio,
Carsten Tusk
Predicting Customer Grocery Shopping Lists from POS Purchase
Data
Authors: Chad Cumby, Andy Fano, Rayid Ghani, Marko Krema
Eigenspace-based Anomaly Detection in Computer Systems
Authors: Tsuyoshi Ide, Hisashi Kashima
Mining Coherent Gene Clusters from Three-Dimensional Microarray
Data
Authors: Daxin Jiang, Jian Pei, Murali Ramanathan, Chun Tang,
Aidong Zhang
Predicting Prostate Cancer Recurrence via Maximizing the Concordance
Index
Authors: Lian Yan, David Verbel, Olivier Saidi
Effective Localized Regression for Damage Detection in Large
Complex Mechanical Structures
Authors: Aleksander Lazarevic, Ramdev Kanapady, Chandrika Kamath,
Vipin Kumar, Kumar Tamma
(B) POSTER PAPERS
A General Approach to Incorporate Data Quality Matrices into
Data Mining Algorithms
Authors: Ian Davidson, Ashish Grover, Ashwin Satyanarayana, Giri
Tayi
Tracking Dynamics of Topic Trends Using a Finite Mixture Model
Authors: Satoshi Morinaga, Kenji Yamanishi
ANN Quality Diagnostic Models for Packaging Manufacturing: An
Industrial Data Mining Case Study
Authors: Nicolas de Abajo, Alberto B. Diez, Vanesa Lobato, Sergio
R. Cuesta
Programming the K-means Clustering Algorithm in SQL
Authors: Carlos Ordonez
Document Preprocessing For Naive Bayes Classification and Clustering
with Mixture of Multinomials
Authors: Dmitry Pavlov, Ramnath Balasubramanyan, Byron Dom, Shyam
Kapur, Jignashu Parikh
Mining Traffic Data from Probe-Car System for Travel Time Prediction
Authors: Takayuki Nakata, Jun-ichi Takeuchi
1-Dimensional Splines as Building Blocks for Improving Accuracy
of Risk Outcomes Models
Authors: David Vogel, Morgan Wang
Learning a Complex Metabolomic Dataset Using Random Forests and
Support Vector Machines
Authors: Young Truong, Chris Beecher, Adele Cutler, Leanna House,
Xiaodong Lin, Stanley Young
Feature Selection in Scientific Applications
Authors: Erick Cantu-Paz, Shawn Newsam, Chandrika Kamath
Cross Channel Optimized Marketing by Reinforcement Learning
Authors: Naoki Abe, Naval Verma, Chidanand Apte, Robert Schroko
A System for Automated Mapping of Bill-of-Material Part Numbers
Authors: Jayant Kalagnanam, Moninder Singh, Sudhir Verma, Michael
Patek, Yuk Wah Wong
Analytical View of Business Data
Authors: Adam Yeh, Jonathan Tang, Youxuan Jin
Exploring the Community Structure of Newsgroups
Authors: Christian Borgs, Jennifer Chayes, Mohammad Mahdian,
Amin Saberi
---