Wednesday, June 4, 2025
LBNN
  • Business
  • Markets
  • Politics
  • Crypto
  • Finance
  • Energy
  • Technology
  • Taxes
  • Creator Economy
  • Wealth Management
  • Documentaries
No Result
View All Result
LBNN

Mitigating spurious correlations for self-supervised recommendation

Simon Osuji by Simon Osuji
August 4, 2023
in Sports
0
Mitigating spurious correlations for self-supervised recommendation
0
SHARES
2
VIEWS
Share on FacebookShare on Twitter


by Beijing Zhongke Journal Publising Co.

Related posts

Bill Belichick on the Patriots getting Tom Brady a statue: ‘Give him whatever he wants’

Bill Belichick on the Patriots getting Tom Brady a statue: ‘Give him whatever he wants’

August 22, 2023
56% of USWNT fans surveyed say team has fallen behind, 55% disagree with Carli Lloyd

56% of USWNT fans surveyed say team has fallen behind, 55% disagree with Carli Lloyd

August 22, 2023
Mitigating spurious correlations for self-supervised recommendation
An example of spurious correlations in job recommendation, where the positive interactions with the full-time jobs show strong correlations with the users′ 4 and 6 years of experience. Credit: Beijing Zhongke Journal Publising Co. Ltd.

Recent years have witnessed the great success of self-supervised learning (SSL) in recommendation systems. However, SSL recommender models are likely to suffer from spurious correlations, leading to poor generalization. To mitigate spurious correlations, existing work usually pursues ID-based SSL recommendation or utilizes feature engineering to identify spurious features.

Nevertheless, ID-based SSL approaches sacrifice the positive impact of invariant features, while feature engineering methods require high-cost human labeling. In a paper published in Machine Intelligence Research, a team of researchers aim to automatically mitigate the effect of spurious correlations to address the problems. This objective requires to automatically mask spurious features without supervision, and block the negative effect transmission from spurious features to other features during SSL.

Self-supervised learning (SSL) approaches have recently become state-of-the-art (SOTA) for personalized recommendation. The core idea of SSL in recommendation is to learn better user and item representations via an additional self-discrimination task, which contrasts the augmentations over user-item features or user-item interaction graphs to discover the correlation relationships among features and interactions.

Despite the great success, SSL-based recommender models are vulnerable to spurious correlations due to fitting the correlations from the input features to interactions. Because of the selection bias in the data collection process, spurious correlations inevitably exist in the training data, where some spurious features show strong correlations with users’ positive interactions. By the self-discrimination task, SSL models tend to capture these spurious correlations, resulting in poor generalization ability.

To alleviate the harmful effect of spurious correlations on SSL models, existing solutions mainly fall into three categories: First is ID-based SSL methods, which only utilize IDs of users and items for collaborative filtering, and thus can avoid the harmful influence of some spurious features.

However, the user and item features are still useful in the recommendation, especially for users with sparse interactions. It is necessary to consider some invariant features that causally affect the interactions. Second is feature engineering methods, which are able to identify a set of spurious features manually or using human-machine hybrid approaches. Thereafter, they can train the SSL recommender models by discarding the identified features.

Nevertheless, feature engineering methods require extensive human-labeling work and thus are not applicable to large-scale recommendations with extensive user and item features. Third is informative feature selection methods, which are capable of automatically recognizing the informative cross features and removing the redundant ones in the training process. Nevertheless, spurious features might be very informative for the interaction prediction in the training data, and thus degrade the generalization ability.

To solve the problems, researchers require the SSL models to automatically mitigate the effect of spurious correlations. In order to achieve this objective, there exist two essential challenges: The first one is that it is non-trivial to mask spurious features without supervision. The other challenge is blocking the effect transmission from spurious features to other features is of vital importance.

To address the two challenges, researchers consider learning a feature mask mechanism from multiple environments to estimate the probabilities of spurious features and then adopt the mask mechanism to guide the feature augmentation in SSL models. Specifically, researchers can cluster the interactions into multiple environments, where each environment has similar feature distributions, but the distributions shift between environments.

The distribution shifts will guide the mask mechanism to capture invariant features across environments and exclude spurious features. Besides, they can utilize the mask mechanism to drop the spurious features as the augmented sample and then maximize the mutual information between the invariant features in the augmented sample and all the input features in the factual sample, pushing SSL models to ignore the spurious features and cut off the negative effect transmission from spurious features to invariant features.

To this end, researchers propose an invariant feature learning (IFL) framework for SSL recommender models to mitigate spurious correlations. In particular, IFL clusters the training interactions into multiple environments and leverages a masking mechanism with learnable parameters in [0, 1] to shield spurious correlations. To optimize the mask parameters, IFL adopts a variance loss to identify invariant features and achieve robust predictions across environments.

As for the self-discrimination task, researchers drop the spurious features based on the mask parameters as the augmented sample, and then maximize the mutual information between the factual and augmented samples via contrastive loss, which pushes the SSL model to ignore spurious features. They instantiate IFL on a SOTA SSL model, and extensive experiments on two real-world datasets validate the effectiveness of the proposed IFL in mitigating spurious correlations.

The contributions of this paper are summarized as follows: Firstly, researchers point out the spurious correlations in SSL recommendation and consider learning invariant features from multiple environments.

Secondly, they propose a model-agnostic IFL framework, which leverages a feature mask mechanism and mask-guided contrastive learning to reduce spurious correlations for SSL models.

Thirdly, empirical results on two public datasets verify the superiority of their proposed IFL in masking spurious features and enhancing the generalization ability of SSL models.

More information:
Xin-Yu Lin et al, Mitigating Spurious Correlations for Self-supervised Recommendation, Machine Intelligence Research (2023). DOI: 10.1007/s11633-022-1374-8

Provided by
Beijing Zhongke Journal Publising Co.

Citation:
Mitigating spurious correlations for self-supervised recommendation (2023, August 4)
retrieved 4 August 2023
from https://techxplore.com/news/2023-08-mitigating-spurious-self-supervised.html

This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no
part may be reproduced without the written permission. The content is provided for information purposes only.





Source link

Previous Post

BRICS Eyeing Increased Strength and Global Role After Summit

Next Post

South Africa and the limits of having it both ways

Next Post
South Africa and the limits of having it both ways

South Africa and the limits of having it both ways

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

RECOMMENDED NEWS

Esri Partners with TechSoup, Providing Nonprofit Organizations Easier Access to GIS Technology

Esri Partners with TechSoup, Providing Nonprofit Organizations Easier Access to GIS Technology

1 year ago
How US tariffs could impact GCC banks and economy

How US tariffs could impact GCC banks and economy

2 months ago
Central banks’ green push hits a hurdle as Fed drops out

Central banks’ green push hits a hurdle as Fed drops out

4 months ago
Stand-up comedians test ability of LLMs to write jokes

Stand-up comedians test ability of LLMs to write jokes

12 months ago

POPULAR NEWS

  • Ghana to build three oil refineries, five petrochemical plants in energy sector overhaul

    Ghana to build three oil refineries, five petrochemical plants in energy sector overhaul

    0 shares
    Share 0 Tweet 0
  • When Will SHIB Reach $1? Here’s What ChatGPT Says

    0 shares
    Share 0 Tweet 0
  • Matthew Slater, son of Jackson State great, happy to see HBCUs back at the forefront

    0 shares
    Share 0 Tweet 0
  • Dolly Varden Focuses on Adding Ounces the Remainder of 2023

    0 shares
    Share 0 Tweet 0
  • US Dollar Might Fall To 96-97 Range in March 2024

    0 shares
    Share 0 Tweet 0
  • Privacy Policy
  • Contact

© 2023 LBNN - All rights reserved.

No Result
View All Result
  • Home
  • Business
  • Politics
  • Markets
  • Crypto
  • Economics
    • Manufacturing
    • Real Estate
    • Infrastructure
  • Finance
  • Energy
  • Creator Economy
  • Wealth Management
  • Taxes
  • Telecoms
  • Military & Defense
  • Careers
  • Technology
  • Artificial Intelligence
  • Investigative journalism
  • Art & Culture
  • Documentaries
  • Quizzes
    • Enneagram quiz
  • Newsletters
    • LBNN Newsletter
    • Divergent Capitalist

© 2023 LBNN - All rights reserved.