Publications: João Sedoc
| Title | Year | Citations | Score |
|---|---|---|---|
|
Large language models could change the future of behavioral healthcare: a proposal for responsible development and evaluation
NPJ Mental Health Research 3 (1), 12, 2024 View Details |
2024 | 147 | 99.0% |
|
The GEM Benchmark: Natural Language Generation, its Evaluation and Metrics
arXiv preprint arXiv:2102.01672, 2021 View Details |
2021 | 173 | 96.3% |
|
Health-focused conversational agents in person-centered care: a review of apps
NPJ digital medicine 5 (1), 1-9, 2022 View Details |
2022 | 80 | 94.6% |
|
Natural language processing methods are sensitive to sub-clinical linguistic differences in schizophrenia spectrum disorders
npj Schizophrenia 7 (1), 1-8, 2021 View Details |
2021 | 110 | 93.4% |
|
Overview of the dialogue breakdown detection challenge 4
Increasing Naturalness and Flexibility in Spoken Dialogue Interaction: 10th …, 2021 View Details |
2021 | 103 | 92.9% |
|
Comparison of Diverse Decoding Methods from Conditional Language Models
Proceedings of the 57th Annual Meeting of the Association for Computational …, 2019 View Details |
2019 | 154 | 91.7% |
|
Large language models could change the future of behavioral healthcare: A proposal for responsible development and evaluation. npj Mental Health Research, 3 (1): 1–12, April 2024
URL https://doi. org/10.1038/s44184-024-00056-z, 2024 View Details |
2024 | 24 | 91.6% |
|
Linear Connectivity Reveals Generalization Strategies
arXiv preprint arXiv:2205.12411, 2022 View Details |
2022 | 54 | 91.3% |
|
Empathic Conversations: A Multi-level Dataset of Contextualized Conversations
arXiv preprint arXiv:2205.12698, 2022 View Details |
2022 | 51 | 90.6% |
|
Findings of WASSA 2023 Shared Task on Empathy, Emotion and Personality Detection in Conversation and Reactions to News Articles
Proceedings of the 13th Workshop on Computational Approaches to Subjectivity …, 2023 View Details |
2023 | 29 | 90.3% |
|
MimicNet: fast performance estimates for data center networks with machine learning
Proceedings of the 2021 ACM SIGCOMM 2021 Conference, 287-304, 2021 View Details |
2021 | 69 | 88.5% |
|
WASSA 2022 Shared Task: Predicting Empathy, Emotion and Personality in Reaction to News Stories
Proceedings of the 12th Workshop on Computational Approaches to Subjectivity …, 2022 View Details |
2022 | 42 | 88.3% |
|
An Integrative Survey on Mental Health Conversational Agents to Bridge Computer Science and Medical Perspectives
arXiv preprint arXiv:2310.17017, 2023 View Details |
2023 | 24 | 88.0% |
|
Modeling Empathy and Distress in Reaction to News Stories
Proceedings of the 2018 Conference on Empirical Methods in Natural Language …, 2018 View Details |
2018 | 135 | 87.7% |
|
Findings of wassa 2024 shared task on empathy and personality detection in interactions
Proceedings of the 14th Workshop on Computational Approaches to Subjectivity …, 2024 View Details |
2024 | 17 | 87.5% |
|
ChatEval: A Tool for Chatbot Evaluation
Proceedings of the 2019 Conference of the North American Chapter of the …, 2019 View Details |
2019 | 99 | 86.4% |
|
Usability and Credibility of a COVID-19 Vaccine Chatbot for Young Adults and Health Workers in the United States: Formative Mixed Methods Study
JMIR Human Factors 10, e40533, 2023 View Details |
2023 | 20 | 85.3% |
|
Complexity-weighted loss and diverse reranking for sentence simplification
arXiv preprint arXiv:1904.02767, 2019 View Details |
2019 | 83 | 83.6% |
|
A Needle in a Haystack: An Analysis of High-Agreement Workers on MTurk for Summarization
Proceedings of the 61st Annual Meeting of the Association for Computational …, 2023 View Details |
2023 | 14 | 79.1% |
|
Artificial Intelligence Will Change the Future of Psychotherapy: A Proposal for Responsible, Psychologist-led Development
View Details |
2023 | 13 | 77.5% |
|
Continual Learning for Sentence Representations Using Conceptors
arXiv preprint arXiv:1904.09187, 2019 View Details |
2019 | 57 | 76.2% |
|
Multi-Emotion Classification for Song Lyrics
Proceedings of the Eleventh Workshop on Computational Approaches to …, 2021 View Details |
2021 | 34 | 75.8% |
|
Automatic evaluation and moderation of open-domain dialogue systems
arXiv preprint arXiv:2111.02110, 2021 View Details |
2021 | 31 | 73.6% |
|
Piloting a COVID-19 vaccine chatbot with young adults and health workers in the US to validate usability, credibility, and intention to use.
JMIR Human Factors, 2022 View Details |
2022 | 19 | 73.6% |
|
Large Language Models Show Human-like Social Desirability Biases in Survey Responses
arXiv preprint arXiv:2405.06058, 2024 View Details |
2024 | 8 | 73.0% |
|
Incremental Neural Coreference Resolution in Constant Memory
Proceedings of the 2020 Conference on Empirical Methods in Natural Language …, 2020 View Details |
2020 | 40 | 72.1% |
|
“This show hits really close to home on so many levels”: An analysis of Reddit comments about HBO’s Euphoria to understand viewers’ experiences of and reactions to substance use and mental illness
Drug and alcohol dependence 220, 108468, 2021 View Details |
2021 | 29 | 72.0% |
|
Learning Word Ratings for Empathy and Distress from Document-Level User Responses
Proceedings of the Twelfth Language Resources and Evaluation Conference …, 2020 View Details |
2020 | 39 | 71.5% |
|
Human-Centered Metrics for Dialog System Evaluation
arXiv preprint arXiv:2305.14757, 2023 View Details |
2023 | 10 | 71.4% |
|
Item Response Theory for Efficient Human Evaluation of Chatbots
Proceedings of the First Workshop on Evaluation and Comparison of NLP …, 2020 View Details |
2020 | 38 | 70.8% |
|
GEMv2: Multilingual NLG Benchmarking in a Single Line of Code
arXiv preprint arXiv:2206.11249, 2022 View Details |
2022 | 17 | 70.8% |
|
Using LLMs to Animate Interactive Story Characters with Emotions and Personality
2024 IEEE Conference on Virtual Reality and 3D User Interfaces Abstracts and …, 2024 View Details |
2024 | 7 | 69.5% |
|
Large language models display human-like social desirability biases in Big Five personality surveys
PNAS nexus 3 (12), pgae533, 2024 View Details |
2024 | 7 | 69.5% |
|
Gendered Information in Resumes and its Role in Algorithmic and Human Hiring Bias
Academy of Management Proceedings 2022 (1), 17133, 2022 View Details |
2022 | 16 | 69.3% |
|
Large Human Language Models: A Need and the Challenges
arXiv preprint arXiv:2312.07751, 2023 View Details |
2023 | 9 | 68.7% |
|
Conceptor Debiasing of Word Representations Evaluated on WEAT
arXiv preprint arXiv:1906.05993, 2019 View Details |
2019 | 39 | 66.8% |
|
Decoding Methods for Neural Narrative Generation
arXiv preprint arXiv:2010.07375, 2020 View Details |
2020 | 31 | 65.5% |
|
Overview of Robust and Multilingual Automatic Evaluation Metrics for Open-Domain Dialogue Systems at DSTC 11 Track 4
arXiv preprint arXiv:2306.12794, 2023 View Details |
2023 | 8 | 65.5% |
|
On the Role of Summary Content Units in Text Summarization Evaluation
arXiv preprint arXiv:2404.01701, 2024 View Details |
2024 | 6 | 65.1% |
|
Gendered Information in Resumes and Hiring Bias: A Predictive Modeling Approach
Available at SSRN 4074976, 2022 View Details |
2022 | 12 | 61.0% |
|
WASSA 2021 Shared Task: Predicting Empathy and Emotion in Reaction to News Stories
Proceedings of the Eleventh Workshop on Computational Approaches to …, 2021 View Details |
2021 | 19 | 60.5% |
|
Item Response Theory for Natural Language Processing
Proceedings of the 18th Conference of the European Chapter of the …, 2024 View Details |
2024 | 5 | 59.3% |
|
Evaluating generative AI responses to real-world drug-related questions
Psychiatry Research 339, 116058, 2024 View Details |
2024 | 5 | 59.3% |
|
The 2024 GEM shared task on multilingual data-to-text generation and summarization: Overview and preliminary results
Proceedings of the 17th International Natural Language Generation Conference …, 2024 View Details |
2024 | 5 | 59.3% |
|
Measuring the Language of Self-Disclosure across Corpora
Findings of the Association for Computational Linguistics: ACL 2022, 1035-1047, 2022 View Details |
2022 | 11 | 58.4% |
|
Automatic Document Selection for Efficient Encoder Pretraining
arXiv preprint arXiv:2210.10951, 2022 View Details |
2022 | 10 | 55.5% |
|
Benchmark Data and Evaluation Framework for Intent Discovery Around COVID-19 Vaccine Hesitancy
arXiv preprint arXiv:2205.11966, 2022 View Details |
2022 | 10 | 55.5% |
|
Collecting Verified COVID-19 Question Answer Pairs
View Details |
2020 | 21 | 54.8% |
|
Semantic word clusters using signed spectral clustering
Proceedings of the 55th Annual Meeting of the Association for Computational …, 2017 View Details |
2017 | 33 | 54.8% |
|
The Role of Protected Class Word Lists in Bias Identification of Contextualized Word Representations
Proceedings of the First Workshop on Gender Bias in Natural Language …, 2019 View Details |
2019 | 25 | 54.4% |
|
Socially Responsible Data for Large Multilingual Language Models
arXiv preprint arXiv:2409.05247, 2024 View Details |
2024 | 4 | 51.6% |
|
Common Law Annotations: Investigating the Stability of Dialog System Output Annotations
Findings of the Association for Computational Linguistics: ACL 2023, 12315-12349, 2023 View Details |
2023 | 5 | 51.6% |
|
Overview of the Tenth Dialog System Technology Challenge: DSTC10
IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2023 View Details |
2023 | 5 | 51.6% |
|
Conditioning on Dialog Acts improves Empathy Style Transfer
Findings of the Association for Computational Linguistics: EMNLP 2023, 13254 …, 2023 View Details |
2023 | 5 | 51.6% |
|
Conceptor-Aided Debiasing of Large Language Models
Proceedings of the 2023 Conference on Empirical Methods in Natural Language …, 2023 View Details |
2023 | 5 | 51.6% |
|
Unsupervised post-processing of word vectors via conceptor negation
Proceedings of the AAAI Conference on Artificial Intelligence 33, 6778-6785, 2019 View Details |
2019 | 22 | 51.0% |
|
Predicting emotional word ratings using distributional representations and signed clustering
Proceedings of the 15th Conference of the European Chapter of the …, 2017 View Details |
2017 | 28 | 50.4% |
|
Clustering Examples in Multi-Dataset Benchmarks with Item Response Theory
Proceedings of the Third Workshop on Insights from Negative Results in NLP …, 2022 View Details |
2022 | 8 | 48.6% |
|
Who says like a style of Vitamin: Towards Syntax-Aware DialogueSummarization using Multi-task Learning
arXiv preprint arXiv:2109.14199, 2021 View Details |
2021 | 11 | 44.2% |
|
Gendered Language in Resumes and its Implications for Algorithmic Bias in Hiring
arXiv preprint arXiv:2112.08910, 2021 View Details |
2021 | 11 | 44.2% |
|
COD3S: Diverse Generation with Discrete Semantic Signatures
arXiv preprint arXiv:2010.02882, 2020 View Details |
2020 | 14 | 43.3% |
|
Domain Aware Neural Dialog System
arXiv preprint arXiv:1708.00897, 2017 View Details |
2017 | 21 | 43.1% |
|
Fast Network Simulation Through Approximation or: How Blind Men Can Describe Elephants
Proceedings of the 17th ACM Workshop on Hot Topics in Networks, 141-147, 2018 View Details |
2018 | 19 | 42.9% |
|
Usability, Engagement, and Report Usefulness of Chatbot-Based Family Health History Data Collection: Mixed Methods Analysis
Journal of Medical Internet Research 26, e55164, 2024 View Details |
2024 | 3 | 41.0% |
|
Topic Modeling for Maternal Health Using Reddit
Proceedings of the 12th International Workshop on Health Text Mining and …, 2021 View Details |
2021 | 8 | 35.1% |
|
Measuring the ‘I don’t know’Problem through the Lens of Gricean Quantity
Proceedings of the 2021 Conference of the North American Chapter of the …, 2021 View Details |
2021 | 8 | 35.1% |
|
Lived Experience Matters: Automatic Detection of Stigma toward People Who Use Substances on Social Media
arXiv preprint arXiv:2302.02064, 2023 View Details |
2023 | 3 | 35.0% |
|
SIG
View Details |
2022 | 5 | 34.6% |
|
SMRT Chatbots: Improving Non-Task-Oriented Dialog with Simulated Multi-Reference Training
Proceedings of the 2020 Conference on Empirical Methods in Natural Language …, 2020 View Details |
2020 | 10 | 34.6% |
|
Enterprise to Computer: Star Trek chatbot
arXiv preprint arXiv:1708.00818, 2017 View Details |
2017 | 13 | 32.5% |
|
ChatEval: A Tool for the Systematic Evaluation of Chatbots
Proceedings of the Workshop on Intelligent Interactive Systems and Language …, 2018 View Details |
2018 | 12 | 32.3% |
|
An Evaluation Protocol for Generative Conversational Systems
arXiv preprint arXiv:2010.12741, 2020 View Details |
2020 | 8 | 29.5% |
|
Degendering Resumes for Fair Algorithmic Resume Screening
arXiv preprint arXiv:2112.08910, 2021 View Details |
2021 | 6 | 27.9% |
|
Correcting the Common Discourse Bias in Linear Representation of Sentences using Conceptors
arXiv preprint arXiv:1811.11002, 2018 View Details |
2018 | 9 | 27.0% |
|
Explicit and Implicit Large Language Model Personas Generate Opinions but Fail to Replicate Deeper Perceptions and Biases
arXiv preprint arXiv:2406.14462, 2024 View Details |
2024 | 2 | 25.7% |
|
PsychAdapter: Adapting LLM Transformers to Reflect Traits, Personality and Mental Health
arXiv preprint arXiv:2412.16882, 2024 View Details |
2024 | 2 | 25.7% |
|
From Text to Context: Contextualizing Language with Humans, Groups, and Communities for Socially Aware NLP
Proceedings of the 2024 Conference of the North American Chapter of the …, 2024 View Details |
2024 | 2 | 25.7% |
|
Learning Neural Emotion Analysis from 100 Observations: The Surprising Effectiveness of Pre-Trained Word Representations
arXiv preprint arXiv:1810.10949, 2018 View Details |
2018 | 8 | 24.9% |
|
Gendered Language in Resumes--An Empirical Analysis of Gender Norm Violation and Hiring Outcomes
View Details |
2021 | 5 | 23.8% |
|
An Analysis of BERT FAQ Retrieval Models for COVID-19 Infobot
View Details |
2020 | 6 | 23.5% |
|
Getting in shape: word embedding subspaces
Proceedings of the 28th International Joint Conference on Artificial …, 2019 View Details |
2019 | 6 | 22.0% |
|
How to Choose How to Choose Your Chatbot: A Massively Multi-System MultiReference Data Set for Dialog Metric Evaluation
arXiv preprint arXiv:2305.14533, 2023 View Details |
2023 | 2 | 21.8% |
|
Overview of robust and multilingual automatic evaluation metricsfor open-domain dialogue systems at DSTC 11 track 4
Proceedings of The Eleventh Dialog System Technology Challenge, 260-273, 2023 View Details |
2023 | 2 | 21.8% |
|
Psychological Metrics for Dialog System Evaluation
arXiv preprint arXiv:2305.14757, 2023 View Details |
2023 | 2 | 21.8% |
|
Automatic Reflection Generation for Peer-to-Peer Counseling
Proceedings of the Third Workshop on Natural Language Generation, Evaluation …, 2023 View Details |
2023 | 2 | 21.8% |
|
Revisiting Memory-Efficient Incremental Coreference Resolution
arXiv preprint arXiv:2005.00128, 2020 View Details |
2020 | 5 | 20.1% |
|
Using the Poly-encoder for a COVID-19 Question Answering System
Proceedings of the 1st Workshop on NLP for COVID-19 (Part 2) at EMNLP 2020, 2020 View Details |
2020 | 4 | 16.6% |
|
Deriving Verb Predicates By Clustering Verbs with Arguments
arXiv preprint arXiv:1708.00416, 2017 View Details |
2017 | 4 | 14.5% |
|
VIRATrustData: A Trust-Annotated Corpus of Human-Chatbot Conversations About COVID-19 Vaccines
arXiv preprint arXiv:2205.12240, 2022 View Details |
2022 | 2 | 12.1% |
|
Proceedings of the 12th Workshop on Computational Approaches to Subjectivity, Sentiment & Social Media Analysis
Proceedings of the 12th Workshop on Computational Approaches to Subjectivity …, 2022 View Details |
2022 | 2 | 12.1% |
|
Harnessing Artificial Intelligence to Improve Food Assistance: A Scoping Review of Machine Learning Tools
Preprints, 2022 View Details |
2022 | 2 | 12.1% |
|
Anonymization of Sensitive Information in Medical Health Records.
IberLEF@ SEPLN, 647-653, 2019 View Details |
2019 | 2 | 7.3% |
|
Multiscale Hidden Markov Models For Covariance Prediction
View Details |
2018 | 2 | 6.9% |
|
Trees in transformers: a theoretical analysis of the Transformer's ability to represent trees
arXiv preprint arXiv:2112.11913, 2021 View Details |
2021 | 1 | 0.0% |
|
Inducing Generalizable and Interpretable Lexica
Findings of the Association for Computational Linguistics: EMNLP 2022, 4430-4448, 2022 View Details |
2022 | 1 | 0.0% |
|
Proceedings of the 3rd Workshop on Human Evaluation of NLP Systems
Proceedings of the 3rd Workshop on Human Evaluation of NLP Systems, 2023 View Details |
2023 | 1 | 0.0% |
|
Decreased Speech Coherence Captured by Novel Natural Language Processing Methods in Two Cohorts of Individuals With Schizophrenia
Biological Psychiatry 87 (9), S379-S380, 2020 View Details |
2020 | 1 | 0.0% |
|
Common Law Annotations: Investigating the Stability of Dialog Annotations
View Details |
2022 | 1 | 0.0% |
|
Towards Authoring Open-Ended Behaviors for Narrative Puzzle Games with Large Language Model Support
Proceedings of the 19th International Conference on the Foundations of …, 2024 View Details |
2024 | 1 | 0.0% |
|
From Human Annotation to LLMs: SILICON Annotation Workflow for Management Research
arXiv preprint arXiv:2412.14461, 2024 View Details |
2024 | 1 | 0.0% |
|
Neural Tree Transducers for Tree to Tree Learning
View Details |
2018 | 1 | 0.0% |
|
The Fourth Workshop on Insights from Negative Results in NLP
The Fourth Workshop on Insights from Negative Results in NLP, 2023 View Details |
2023 | 1 | 0.0% |
|
The Illusion of Empathy: How AI Chatbots Shape Conversation Perception
arXiv preprint arXiv:2411.12877, 2024 View Details |
2024 | 1 | 0.0% |
|
Reasoning and the Trusting Behavior of DeepSeek and GPT: An Experiment Revealing Hidden Fault Lines in Large Language Models
arXiv preprint arXiv:2502.12825, 2025 View Details |
2025 | 1 | 0.0% |
|
The INLG 2024 Tutorial on Human Evaluation of NLP System Quality: Background, Overall Aims, and Summaries of Taught Units
Proceedings of the 17th International Natural Language Generation Conference …, 2024 View Details |
2024 | 1 | 0.0% |