Jun Yuan

Department of Data Science

New Jersey Institute of Technology

Newark, NJ 07102

I am a PhD candidate in Data Science, with a background in Mathematics and Statistics. My research interest lies in the intersection of human-centered AI and information visualization.

Currently, I am working on designing visual-conversational analytic interfaces for ranking interpretation on high-stake decision-making (e.g., student course evaluation, AI hiring practice); adapting Learning-to-rank for fairness, accountability, and transparency; adversarial attacks and detection on explainable AI methods; information-seeking in crisis.

news

Sep 11, 2024	Our submission has been accepted to the VISxAI workshop at IEEEVIS24! The title is “TalkToRanker: A Conversational Interface for Ranking-based Decision-Making.”
Dec 16, 2023	Presented poster at the “XAI in action workshop at NeurIPS’23,” on the paper titled “A Simple Scoring Function to Fool SHAP: Stealing from the One Above.”
Nov 20, 2023	Awarded 3rd place at NJIT Research Day Competition 2023.
Sep 23, 2023	Presented poster at the “2023 Asian American Engineer of the Year Award Annual Convention” with the title “Ranking Al Visualization for Transparent Decision-making.”

selected publications

A Simple Scoring Function to Fool SHAP: Stealing from the One Above

Jun Yuan, and Aritra Dasgupta

In NeurIPS23 Workshop of XAI in Action: Past, Present, and Future Applications , 2023

Abs Bib PDF

Explainable Al (XAl) methods such as SHAP can help discover unfairness in black-box models. If the XAl method reveals a significant impact from a "protected attribute" (e.g., gender, race) on the model output, the model is considered unfair. However, adversarial models can subvert the detection of XAl methods. Previous approaches to constructing such an adversarial model require access to underlying data distribution. We propose a simple rule that does not require access to the underlying data or data distribution. It can adapt any scoring function to fool XAl methods, such as SHAP. Our work calls for more attention to scoring functions besides classifiers in XAl research and reveals the limitations of XAl methods for explaining behaviors of scoring functions.
@inproceedings{yuan2023simple, title = {A Simple Scoring Function to Fool SHAP: Stealing from the One Above}, author = {Yuan, Jun and Dasgupta, Aritra}, booktitle = {NeurIPS23 Workshop of XAI in Action: Past, Present, and Future Applications}, year = {2023}, }
Introducing Contextual Transparency for Automated Decision Systems

Mona Sloane, Ian René Solano-Kamaiko, Jun Yuan, Aritra Dasgupta, and Julia Stoyanovich

Nature Machine Intelligence, 2023

Abs Bib PDF

As automated decision systems (ADS) get more deeply embedded into business processes worldwide, there is a growing need for practical ways to establish meaningful transparency. Here we argue that universally perfect transparency is impossible to achieve. We introduce the concept of contextual transparency as an approach that integrates social science, engineering and information design to help improve ADS transparency for specific professions, business processes and stakeholder groups. We demonstrate the applicability of the contextual transparency approach by using it for a well-established ADS transparency tool: nutritional labels that display specific information about an ADS. Empirically, it focuses on the profession of recruiting. Presenting data from an ongoing study about ADS use in recruiting alongside a typology of ADS nutritional labels, we suggest a nutritional label prototype for ADS-driven rankers such as LinkedIn Recruiter before closing with directions for future work.
@article{sloane2023introducing, title = {Introducing Contextual Transparency for Automated Decision Systems}, author = {Sloane, Mona and Solano-Kamaiko, Ian Ren{\'e} and Yuan, Jun and Dasgupta, Aritra and Stoyanovich, Julia}, journal = {Nature Machine Intelligence}, volume = {5}, number = {3}, pages = {187--195}, year = {2023}, publisher = {Nature Publishing Group UK London}, }
TRIVEA: Transparent Ranking Interpretation Using Visual Explanation of Black-box Algorithmic Rankers

Jun Yuan, Kaustav Bhattacharjee, Akm Zahirul Islam, and Aritra Dasgupta

The Visual Computer, 2024

Abs Bib PDF

Ranking schemes drive many real-world decisions, like, where to study, whom to hire, what to buy, etc. Many of these decisions often come with high consequences. For example, a university can be deemed less prestigious if not featured in a top-k list, and consumers might not even explore products that do not get recommended to buyers. At the heart of most of these decisions are opaque ranking schemes, which dictate the ordering of data entities, but their internal logic is inaccessible or proprietary. Drawing inferences about the ranking differences is like a guessing game to the stakeholders, like, the rankees (i.e., the entities who are ranked, like product companies) and the decision-makers (i.e., who use the rankings, like buyers). In this paper, we aim to enable transparency in ranking interpretation by using algorithmic rankers that learn from available data and by enabling human reasoning about the learned ranking differences using explainable AI (XAI) methods. To realize this aim, we leverage the exploration–explanation paradigm of human–data interaction to let human stakeholders explore subsets and groupings of complex multi-attribute ranking data using visual explanations of model fit and attribute influence on rankings. We realize this explanation paradigm for transparent ranking interpretation in TRIVEA, a visual analytic system that is fueled by: (i) visualizations of model fit derived from algorithmic rankers that learn the associations between attributes and rankings from available data and (ii) visual explanations derived from XAI methods that help abstract important patterns, like, the relative influence of attributes in different ranking ranges. Using TRIVEA, end users not trained in data science have the agency to transparently reason about the global and local behavior of the rankings without the need to open black-box ranking models and develop confidence in the resulting attribute-based inferences. We demonstrate the efficacy of TRIVEA using multiple usage scenarios and subjective feedback from researchers with diverse domain expertise.
@article{yuan2024trivea, title = {TRIVEA: Transparent Ranking Interpretation Using Visual Explanation of Black-box Algorithmic Rankers}, author = {Yuan, Jun and Bhattacharjee, Kaustav and Islam, Akm Zahirul and Dasgupta, Aritra}, journal = {The Visual Computer}, volume = {40}, number = {5}, pages = {3615--3631}, year = {2024}, publisher = {Springer}, }