Publications: F. Provost

← Back to Author Profile

  • Affiliation: None
  • Author ID: 1752722
  • Total Publications: 189

Download CSV for F. Provost

Title Year Citations Score
Data Science and its Relationship to Big Data and Data-Driven Decision Making
Big Data, 2013
View Details
2013 1372 99.9%
Get another label? improving data quality and data mining using multiple, noisy labelers
Knowledge Discovery and Data Mining, 2008
View Details
2008 1305 99.9%
E-Commerce Recommendation Applications
Data mining and knowledge discovery, 2004
View Details
2004 1825 99.9%
Robust Classification for Imprecise Environments
Machine-mediated learning, 2000
View Details
2000 1368 99.8%
The Case against Accuracy Estimation for Comparing Induction Algorithms
International Conference on Machine Learning, 1998
View Details
1998 1258 99.8%
Quality management on Amazon Mechanical Turk
AAAI Conference on Human Computation & Crowdsourcing, 2010
View Details
2010 1100 99.8%
Learning When Training Data are Costly: The Effect of Class Distribution on Tree Induction
Journal of Artificial Intelligence Research, 2003
View Details
2003 1009 99.7%
Adaptive Fraud Detection
Data mining and knowledge discovery, 1997
View Details
1997 935 99.7%
Analysis and Visualization of Classifier Performance: Comparison under Imprecise Class and Cost Distributions
Knowledge Discovery and Data Mining, 1997
View Details
1997 899 99.7%
Network-Based Marketing: Identifying Likely Adopters Via Consumer Networks
2006
View Details
2006 618 99.5%
Machine Learning from Imbalanced Data Sets 101
2008
View Details
2008 522 99.5%
Tree Induction for Probability-Based Ranking
Machine-mediated learning, 2003
View Details
2003 605 99.4%
Explaining Data-Driven Document Classifications
MIS Q., 2014
View Details
2014 316 99.4%
Data science for business
2013
View Details
2013 293 99.2%
The effect of class distribution on classifier learning: an empirical study
2001
View Details
2001 528 99.2%
Activity monitoring: noticing interesting changes in behavior
Knowledge Discovery and Data Mining, 1999
View Details
1999 491 99.2%
Handling Missing Values when Applying Classification Models
Journal of machine learning research, 2007
View Details
2007 380 99.0%
Efficient progressive sampling
Knowledge Discovery and Data Mining, 1999
View Details
1999 406 98.8%
Tree Induction vs. Logistic Regression: A Learning-Curve Analysis
Journal of machine learning research, 2003
View Details
2003 390 98.8%
Repeated labeling using multiple noisy labelers
Data mining and knowledge discovery, 2013
View Details
2013 202 98.5%
In Pursuit of Enhanced Customer Retention Management: Review, Key Issues, and Future Directions
Customer Needs and Solutions, 2017
View Details
2017 162 98.4%
Guest Editors' Introduction: On Applied Research in Machine Learning
Machine-mediated learning, 1998
View Details
1998 315 98.3%
Predictive Modeling With Big Data: Is Bigger Really Better?
Big Data, 2013
View Details
2013 177 98.2%
Machine learning for targeted display advertising: transfer learning in action
Machine-mediated learning, 2013
View Details
2013 178 98.2%
A Simple Relational Classifier
2003
View Details
2003 302 98.1%
A Survey of Methods for Scaling Up Inductive Algorithms
Data mining and knowledge discovery, 1999
View Details
1999 283 98.0%
Bid optimizing and inventory scoring in targeted online advertising
Knowledge Discovery and Data Mining, 2012
View Details
2012 182 98.0%
Combining Data Mining and Machine Learning for Effective User Profiling
Knowledge Discovery and Data Mining, 1996
View Details
1996 231 97.6%
Toward intelligent assistance for a data mining process: an ontology-based approach for cost-sensitive classification
IEEE Transactions on Knowledge and Data Engineering, 2005
View Details
2005 224 97.4%
Mining Massive Fine-Grained Behavior Data to Improve Predictive Analytics
MIS Q., 2016
View Details
2016 123 97.4%
Active Sampling for Class Probability Estimation and Ranking
Machine-mediated learning, 2004
View Details
2004 224 97.3%
Explaining Data-Driven Decisions made by AI Systems: The Counterfactual Approach
MIS Quarterly, 2020
View Details
2020 93 97.2%
A comparison of instance-level counterfactual explanation algorithms for behavioral and textual data: SEDC, LIME-C and SHAP-C
Advances in Data Analysis and Classification, 2019
View Details
2019 96 97.1%
Beat the Machine: Challenging Humans to Find a Predictive Model's “Unknown Unknowns”
JDIQ, 2015
View Details
2015 110 96.6%
Audience selection for on-line brand advertising: privacy-friendly social network targeting
Knowledge Discovery and Data Mining, 2009
View Details
2009 152 96.5%
Research Commentary - Information in Digital, Economic, and Social Networks
Information systems research, 2013
View Details
2013 107 95.9%
Active Feature-Value Acquisition
Management Sciences, 2009
View Details
2009 135 95.8%
Causally motivated attribution for online advertising
International Workshop on Data Mining for Online Advertising, 2012
View Details
2012 110 95.7%
Causal Decision Making and Causal Effect Estimation Are Not the Same... and Why It Matters
INFORMS Journal on Data Science, 2021
View Details
2021 56 95.6%
Applications of Data Mining to Electronic Commerce
Data mining and knowledge discovery, 2000
View Details
2000 172 95.5%
Robust Classification Systems for Imprecise Environments
AAAI/IAAI, 1998
View Details
1998 155 95.4%
Scaling Up: Distributed Machine Learning with Cooperation
AAAI/IAAI, Vol. 1, 1996
View Details
1996 149 95.4%
Inactive learning?: difficulties employing active learning in practice
SKDD, 2011
View Details
2011 105 94.8%
RL4: a tool for knowledge-based induction
[1990] Proceedings of the 2nd International IEEE Conference on Tools for Artificial Intelligence, 1990
View Details
1990 113 94.7%
Why label when you can search?: alternatives to active learning for applying human resources to build classification models under extreme class imbalance
Knowledge Discovery and Data Mining, 2010
View Details
2010 99 93.9%
Aggregation-based feature invention and relational concept classes
Knowledge Discovery and Data Mining, 2003
View Details
2003 128 93.6%
Distribution-based aggregation for relational learning with identifier attributes
Machine-mediated learning, 2006
View Details
2006 115 93.6%
On Applied Research in Machine Learning
1998
View Details
1998 118 93.4%
Beat the Machine: Challenging Workers to Find the Unknown Unknowns
Human Computation, 2011
View Details
2011 87 93.3%
The myth of the double-blind review?: author identification using only citations
SKDD, 2003
View Details
2003 123 93.2%
Active feature-value acquisition for classifier induction
Industrial Conference on Data Mining, 2004
View Details
2004 108 92.5%
Small Disjuncts in Action: Learning to Diagnose Errors in the Local Loop of the Telephone Network
International Conference on Machine Learning, 1993
View Details
1993 87 91.9%
Confidence Bands for ROC Curves: Methods and an Empirical Study
ROC Analysis in Artificial Intelligence, 2004
View Details
2004 99 91.6%
Inductive policy: The pragmatics of bias selection
Machine-mediated learning, 1995
View Details
1995 81 90.5%
Discovering Interesting Patterns for Investment Decision Making with GLOWER ☹—A Genetic Learner Overlaid with Entropy Reduction
Data mining and knowledge discovery, 2000
View Details
2000 92 90.0%
A Unified Approach to Active Dual Supervision for Labeling Features and Examples
ECML/PKDD, 2010
View Details
2010 66 89.8%
Causal Classification: Treatment Effect vs. Outcome Prediction
Social Science Research Network, 2019
View Details
2019 40 89.3%
An expected utility approach to active feature-value acquisition
Industrial Conference on Data Mining, 2005
View Details
2005 79 89.3%
Evaluating and Optimizing Online Advertising: Forget the Click, but There Are Good Proxies
Big Data, 2015
View Details
2015 49 89.2%
Cost-Effective Quality Assurance in Crowd Labeling
Information systems research, 2017
View Details
2017 45 89.2%
ROC confidence bands: an empirical evaluation
International Conference on Machine Learning, 2005
View Details
2005 78 89.2%
Finding Similar Mobile Consumers with a Privacy-Friendly Geosocial Design
Information systems research, 2015
View Details
2015 48 88.9%
Decision-Centric Active Learning of Binary-Outcome Models
Information systems research, 2007
View Details
2007 69 88.6%
Quality-Based Pricing for Crowdsourced Workers
2013
View Details
2013 49 88.0%
An Intelligent Assistant for the Knowledge Discovery Process
2001
View Details
2001 76 87.8%
Kdd-2001: Proceedings of the Seventh Acm Sigkdd International Conference on Knowledge Discovery and Data Mining : August 26-29, 2001 San Francisco, Ca, USA
2002
View Details
2002 75 87.7%
Scalable hands-free transfer learning for online advertising
Knowledge Discovery and Data Mining, 2014
View Details
2014 44 87.2%
Information in Digital, Economic and Social Networks
2012
View Details
2012 48 87.1%
Enhancing Transparency and Control When Drawing Data-Driven Inferences About Individuals
Big Data, 2016
View Details
2016 39 86.8%
Classification in Networked Data 0 : A toolkit and a univariate case study
2006
View Details
2006 62 86.6%
Beware of "Explanations" of AI
arXiv.org, 2025
View Details
2025 5 86.5%
AI Approaches to Fraud Detection and Risk Management
The AI Magazine, 1998
View Details
1998 61 85.8%
Active Learning for Class Probability Estimation and Ranking
International Joint Conference on Artificial Intelligence, 2001
View Details
2001 62 85.0%
Design principles of massive, robust prediction systems
Knowledge Discovery and Data Mining, 2012
View Details
2012 40 84.3%
Corporate residence fraud detection
Knowledge Discovery and Data Mining, 2014
View Details
2014 35 83.5%
Measuring Causal Impact of Online Actions via Natural Experiments: Application to Display Advertising
Knowledge Discovery and Data Mining, 2015
View Details
2015 32 82.5%
Economical active feature-value acquisition through Expected Utility estimation
UBDM '05, 2005
View Details
2005 48 82.3%
Distributed Machine Learning: Scaling Up with Coarse-grained Parallelism
Intelligent Systems in Molecular Biology, 1994
View Details
1994 41 81.8%
Suspicion scoring based on guilt-by-association, colle ctive inference, and focused data access 1
2005
View Details
2005 45 81.2%
Distributed Data Mining: Scaling up and beyond
2000
View Details
2000 49 81.2%
Social Network Collaborative Filtering
2008
View Details
2008 39 80.9%
Exploiting Background Knowledge in Automated Discovery
Knowledge Discovery and Data Mining, 1996
View Details
1996 41 80.6%
Pseudo-social network targeting from consumer transaction data
2011
View Details
2011 33 80.0%
Inductive Policy
AAAI Conference on Artificial Intelligence, 1992
View Details
1992 35 79.9%
Intelligent information triage
Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2001
View Details
2001 44 79.5%
A Survey of Methods for Scaling Up Inductive Learning Algorithms
1997
View Details
1997 40 79.5%
The Relational Vector-space Model and Industry Classification
2003
View Details
2003 43 79.2%
Fraud detection
2002
View Details
2002 43 79.2%
A Comparison of Methods for Treatment Assignment with an Application to Playlist Generation
Information systems research, 2020
View Details
2020 20 78.2%
Confidence Bands for Roc Curves
2003
View Details
2003 39 77.5%
Using co-visitation networks for detecting large scale online display advertising exchange fraud
Knowledge Discovery and Data Mining, 2013
View Details
2013 26 77.5%
Distributed Fault Tolerant Embedding of Binary Trees and Rings in Hypercubes
1989
View Details
1989 27 76.9%
Big Data, Data Science, and Civil Rights
arXiv.org, 2017
View Details
2017 22 76.6%
A benchmarking study of classification techniques for behavioral data
International Journal of Data Science and Analysis, 2019
View Details
2019 19 75.7%
Observational vs. Experimental Data When Making Automated Decisions Using Machine Learning
INFORMS Journal on Data Science, 2025
View Details
2025 3 75.4%
Suspicion scoring of networked entities based on guilt-by-association, collective inference, and focused data access 1
2005
View Details
2005 32 75.3%
Tree induction vs. logistic regression: a learning-curve analysis
2003
View Details
2003 34 75.1%
Authors' Response to Gong's, “Comment on Data Science and its Relationship to Big Data and Data-Driven Decision Making”
Big Data, 2014
View Details
2014 22 75.0%
Discovering Knowledge from Relational Data Extracted from Business News
2002
View Details
2002 33 74.4%
Towards Intelligent Assistance for a Data Mining Process
2005
View Details
2005 30 74.1%
Intelligent Assistance for the Data Mining Process: an Ontology-Based Approach
2002
View Details
2002 32 73.9%
Scaling Up Inductive Algorithms: An Overview
Knowledge Discovery and Data Mining, 1997
View Details
1997 29 73.5%
A Brief Survey of Machine Learning Methods for Classification in Networked Data and an Application to Suspicion Scoring
SNA@ICML, 2006
View Details
2006 28 73.3%
Proceedings of the ACM SIGKDD Workshop on Human Computation
Knowledge Discovery and Data Mining, 2009
View Details
2009 24 72.9%
Efficiently Constructing Relational Features from Background Knowledge for Inductive Machine Learning
KDD Workshop, 1994
View Details
1994 25 72.7%
Goal-Directed Inductive Learning: Trading off Accuracy for Reduced Error Cost
1994
View Details
1994 25 72.7%
Data acquisition and cost-effective predictive modeling: targeting offers for electronic commerce
International Conference on Evolutionary Computation, 2007
View Details
2007 26 72.6%
Increasing the Efficiency of Data Mining Algorithms with Breadth-First Marker Propagation
Knowledge Discovery and Data Mining, 1997
View Details
1997 27 72.1%
Scaling Up Inductive Learning with Massive Parallelism
Machine-mediated learning, 1996
View Details
1996 26 72.1%
Acora: Distribution-Based Aggregation for Relational Learning from Identifier Attributes
2005
View Details
2005 26 71.4%
Learning and Inference in Massive Social Networks
Mining and Learning with Graphs, 2007
View Details
2007 24 71.1%
Explaining Classification Models Built on High-Dimensional Sparse Data
arXiv.org, 2016
View Details
2016 16 69.9%
Analysis and Visualization of Classifier Performance with Nonuniform Class and Cost Distributions
1997
View Details
1997 23 68.9%
Unsupervised dimensionality reduction versus supervised regularization for classification from sparse data
Data mining and knowledge discovery, 2019
View Details
2019 14 68.9%
Iterative Weakening: Optimal and Near-Optimal Policies for the Selection of Search Bias
AAAI Conference on Artificial Intelligence, 1993
View Details
1993 20 68.6%
Pointwise ROC Confidence Bounds: An Empirical Evaluation
2005
View Details
2005 21 67.3%
Combining Data Mining and Machine Learning for Effective Fraud Detection
1997
View Details
1997 21 67.1%
Social Network Collaborative Filtering: Preliminary Results
2007
View Details
2007 18 65.6%
Proceedings of the First Workshop on Social Media Analytics
Knowledge Discovery and Data Mining, 2010
View Details
2010 15 64.8%
Deep Learning on Big, Sparse, Behavioral Data
Big Data, 2019
View Details
2019 11 63.3%
Proceedings of the seventh ACM SIGKDD international conference on Knowledge discovery and data mining, San Francisco, CA, USA, August 26-29, 2001
Knowledge Discovery and Data Mining, 2001
View Details
2001 17 61.5%
Guided Feature Labeling for Budget-Sensitive Learning Under Extreme Class Imbalance
2010
View Details
2010 12 60.5%
NetKit-SRL: A Toolkit for Network Learning and Inference
2005
View Details
2005 14 59.5%
Node classification over bipartite graphs through projection
Machine-mediated learning, 2020
View Details
2020 9 59.4%
Counterfactual Explanations for Data-Driven Decisions
International Conference on Interaction Sciences, 2019
View Details
2019 9 58.8%
A Distributed Algorithm for Embedding Trees in Hypercubes with Modifications for Run-Time Fault Tolerance
J. Parallel Distributed Comput., 1992
View Details
1992 12 58.2%
Relational Learning Problems and Simple Models
2003
View Details
2003 14 58.1%
Data-Driven Investment Strategies for Peer-to-Peer Lending: A Case Study for Teaching Data Science
Big Data, 2018
View Details
2018 9 58.1%
Wallenius Bayes
Machine-mediated learning, 2018
View Details
2018 8 55.5%
Geo-Social Targeting for Privacy-Friendly Mobile Advertising: Position Paper
2011
View Details
2011 9 55.0%
Toward economic machine learning and utility-based data mining
UBDM '05, 2005
View Details
2005 11 54.9%
Special Issue on Applications of data mining to electronic commerce
2001
View Details
2001 12 54.6%
Naive Algorithmic Collusion: When Do Bandit Learners Cooperate and When Do They Compete?
International Conference on Interaction Sciences, 2024
View Details
2024 3 54.5%
Scalable supervised dimensionality reduction using clustering
Knowledge Discovery and Data Mining, 2013
View Details
2013 8 53.9%
Explaining Documents' Classifications
2011
View Details
2011 8 52.7%
Inductive Strengthening: the Effects of a Simple Heuristic for Restricting Hypothesis Space Search
Analogical and Inductive Inference, 1992
View Details
1992 9 52.0%
Information Triage using Prospective Criteria
2001
View Details
2001 10 51.0%
Simple Models and Classification in Networked Data
2004
View Details
2004 9 50.3%
Telecommunications Network Diagnosis
2008
View Details
2008 7 48.7%
Matrix-Factorization-Based Dimensionality Reduction in the Predictive Modeling Process: A Design Science Perspective
2016
View Details
2016 5 45.6%
Hyperlocal: inferring location of IP addresses in real-time bid requests for mobile ads
Workshop on Location-based Social Networks, 2013
View Details
2013 5 44.8%
What If the Prompt Were Different? Counterfactual Explanations for the Characteristics of Generative Outputs
User Modeling, Adaptation, and Personalization, 2025
View Details
2025 1 44.5%
Tree Induction vs Logistic Regression A Learning Curve Analysis
2001
View Details
2001 7 44.3%
Selective Data Acquisition for Machine Learning Saar-Tsechansky
2011
View Details
2011 5 43.8%
Scaling up inductive learning with massive parallelism
Machine-mediated learning, 2004
View Details
2004 6 42.6%
Dimensionality Reduction via Matrix Factorization for Predictive Modeling from Large, Sparse Behavioral Data
2015
View Details
2015 4 40.7%
Viral Marketing: Identifying Likely Adopters Via Consumer Networks
2005
View Details
2005 5 40.3%
Active Inference and Learning for Classifying Streams
2010
View Details
2010 4 39.9%
Inductive Policy: The Pragmatics of Bias Selection
Machine-mediated learning, 2004
View Details
2004 5 39.3%
Predicting citation rates for physics papers: constructing features for an ordered probit model
SKDD, 2003
View Details
2003 5 38.9%
Tree Induction vs. Logistic Regression for Learning Rankings based on Likelihood of Class Membership
2002
View Details
2002 5 38.5%
Problem Definition, Data Cleaning, and Evaluation: A Classifier Learning Case Study
Informatica, 1999
View Details
1999 5 37.7%
A real-time expert system for trigger-logic monitoring
1990
View Details
1990 4 37.3%
Columbus: An Annotated Guide to the Scholarship on His Life and Writings, 1750 to 1988
1991
View Details
1991 4 36.3%
ClimBS: searching the bias space
Proceedings Fourth International Conference on Tools with Artificial Intelligence TAI '92, 1992
View Details
1992 4 35.7%
Measuring overlap of data bases in water supply and sanitation using sampling and the binomial probability distribution
Scientometrics, 1992
View Details
1992 4 35.7%
An exploratory study towards applying and demystifying deep learning classification on behavioral big data
2018
View Details
2018 3 35.5%
Pleasing the advertising oracle: Probabilistic prediction from sampled, aggregated ground truth
International Workshop on Data Mining for Online Advertising, 2014
View Details
2014 3 35.5%
Finding Mobile Consumers with a Privacy-Friendly Geo-Similarity Network *
2015
View Details
2015 3 35.4%
Combining Observational and Experimental Data to Improve Large-Scale Decision-Making
International Conference on Interaction Sciences, 2020
View Details
2020 3 35.0%
Active Learning for Decision Making
2004
View Details
2004 3 30.6%
Appears in User Modeling 2001 Workshop: Machine Learning, Information Retrieval and User Modeling Information Triage using Prospective Criteria
2001
View Details
2001 3 29.7%
Who's Watching TV?
Information systems research, 2016
View Details
2016 2 28.8%
Wallenius Naive Bayes
2013
View Details
2013 2 28.5%
Methods for Individual Treatment Assignment: An Application and Comparison for Playlist Generation
arXiv.org, 2020
View Details
2020 2 27.6%
Data Science for the Real Estate Industry
Knowledge Discovery and Data Mining, 2020
View Details
2020 2 27.6%
Toward optimal allocation of human resources for active learning withapplication to safe advertising
2009
View Details
2009 2 27.2%
Causal Post-Processing of Predictive Models
2024
View Details
2024 1 27.0%
Modeling complex networks for electronic commerce
ACM Conference on Economics and Computation, 2007
View Details
2007 2 26.2%
Roc Confidence Bands: An Empirical Study
2005
View Details
2005 2 25.3%
Aggregation for Predictive Modeling with Relational Data
2005
View Details
2005 2 25.3%
BINARY TREES AND RINGS IN HYPERCUBES
1989
View Details
1989 2 25.1%
On Shakespeare's Sonnet 116
1956
View Details
1956 1 22.1%
The Impact of Cloaking Digital Footprints on User Privacy and Personalization
Big Data, 2023
View Details
2023 1 21.3%
Annotated Bibliography of Edmund Spenser, 1937-1960
1964
View Details
1964 1 19.5%
Causal Classification: Treatment Effect Estimation vs. Outcome Prediction
Journal of machine learning research, 2022
View Details
2022 1 18.3%
A Data Scientist's Guide to Start-Ups
Big Data, 2014
View Details
2014 1 18.3%
Ethics and interventions: A commentary on how to “improve” prediction using behavior modification
International Journal of Forecasting, 2022
View Details
2022 1 18.3%
Data-Driven Investment Strategies for Peer-to-Peer Lending
2018
View Details
2018 1 18.0%
Societal Impact of Data Science and Artificial Intelligence
Knowledge Discovery and Data Mining, 2018
View Details
2018 1 18.0%
What Managers Need to Know About Big Data
2017
View Details
2017 1 17.9%
Node classification over bipartite graphs through projection
Machine-mediated learning, 2020
View Details
2020 1 17.2%
In memory of Tom Fawcett
Machine-mediated learning, 2020
View Details
2020 1 17.2%
E ciently Constructing Relational Features from Background Knowledge for Inductive Machine Learning Also appears in Proceedings AAAI-94 Workshop on Knowledge Discovery in Databases
2007
View Details
2007 1 16.8%
The Gift of Gab: Evidence TelE-Commerce Firms Can Profit from Viral Marketing
2005
View Details
2005 1 16.1%
Probability estimation in multi-relational domains
2005
View Details
2005 1 16.1%
Distributed Fault Tolerant Embeddings of Binary Trees in Hypercubes.
1988
View Details
1988 1 15.7%
Knowledge Discovery Using Concept-Class Taxonomies
Australian Conference on Artificial Intelligence, 2004
View Details
2004 1 15.5%
Discovering Interesting Patterns for Investment Decision Making with GLOWER-A Genetic Learner Overla
1996
View Details
1996 1 14.6%