Friday, May 16, 2025
LBNN
  • Business
  • Markets
  • Politics
  • Crypto
  • Finance
  • Energy
  • Technology
  • Taxes
  • Creator Economy
  • Wealth Management
  • Documentaries
No Result
View All Result
LBNN

Alignment efficient image-sentence retrieval considering transferable cross-modal representation learning

Simon Osuji by Simon Osuji
February 27, 2024
in Artificial Intelligence
0
Alignment efficient image-sentence retrieval considering transferable cross-modal representation learning
0
SHARES
0
VIEWS
Share on FacebookShare on Twitter


Alignment efficient image-sentence retrieval considering transferable cross-modal representation learning
The processing flow of AEIR. Credit: Yang Yang, Jinyi Guo, Guangyu Li, Lanyu Li, Wenjie Li, Jian Yang

Image-sentence retrieval task aims to search images for given sentences and retrieve sentences from image queries. The current retrieval methods are all supervised methods that require a large number of annotations for training. However, considering the labor cost, it is difficult to re-align large amounts of multimodal data in many applications (e.g., medical retrieval), which results in unsupervised multimodal data.

Related posts

Congress pushes GPS tracking for every exported semiconductor

Congress pushes GPS tracking for every exported semiconductor

May 16, 2025
Does Your City Use Chlorine or Chloramine to Treat Its Water?

Does Your City Use Chlorine or Chloramine to Treat Its Water?

May 16, 2025

A research team led by Yang Yang published their new research in Frontiers of Computer Science.

To solve the problem the team strive to take a step towards non-parallel image-sentence retrieval by designing the alignment transfer, and propose a novel Alignment Efficient Image-Sentence Retrieval method (AEIR).

In the research, AEIR use other auxiliary parallel data with multimodal consistency as the source domain and non-parallel data with missing consistency as the target domain. Unlike unimodal transfer learning, AEIR transfers semantic representations and modal consistency relations together from the source domain to the target domain.

Firstly, AEIR learns cross-modal consistency representations using cross-modal parallel data in the source domain. Then AEIR jointly optimizes adversarial learning-based semantic transfer constraints and metric learning-based structural transfer constraints to learn cross-domain cross-modal consistency representations to achieve transfer of consistency knowledge from the source domain to the target domain.

A large number of experimental experiments conducted in different transfer scenarios show that semantic transfer and structural transfer can effectively learn invariant features across modalities across domains. The proposed efficient alignment-based image-sentence retrieval network verifies that AEIR is more advantageous than current cross-modal retrieval methods, semi-supervised cross-modal retrieval methods and cross-modal transfer methods.

Future work can focus on the conduction of positive cross-modal transfer considering the domain discrepancy.

More information:
Yang Yang et al, Alignment efficient image-sentence retrieval considering transferable cross-modal representation learning, Frontiers of Computer Science (2023). DOI: 10.1007/s11704-023-3186-6

Provided by
Higher Education Press

Citation:
Alignment efficient image-sentence retrieval considering transferable cross-modal representation learning (2024, February 26)
retrieved 26 February 2024
from https://techxplore.com/news/2024-02-alignment-efficient-image-sentence-modal.html

This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no
part may be reproduced without the written permission. The content is provided for information purposes only.





Source link

Previous Post

The Navy is trying to use quantum computers to task spy satellites

Next Post

Top Men’s Fashion Trends From Spring

Next Post
Top Men’s Fashion Trends From Spring

Top Men’s Fashion Trends From Spring

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

RECOMMENDED NEWS

Ambassador Zhang Lizhong attended the Opening ceremony of Uganda-China Tourism and Cultural Symposium

Ambassador Zhang Lizhong attended the Opening ceremony of Uganda-China Tourism and Cultural Symposium

4 months ago
Ransomware Attacks Are Getting Worse

Ransomware Attacks Are Getting Worse

11 months ago
Ivy Wangeci’s Killer Naftali Kinuthia Jailed For 40 Years

Ivy Wangeci’s Killer Naftali Kinuthia Jailed For 40 Years

1 year ago
Seattle man sentenced to two years of probation for selling fake Native American art

Seattle man sentenced to two years of probation for selling fake Native American art

2 years ago

POPULAR NEWS

  • Ghana to build three oil refineries, five petrochemical plants in energy sector overhaul

    Ghana to build three oil refineries, five petrochemical plants in energy sector overhaul

    0 shares
    Share 0 Tweet 0
  • When Will SHIB Reach $1? Here’s What ChatGPT Says

    0 shares
    Share 0 Tweet 0
  • Matthew Slater, son of Jackson State great, happy to see HBCUs back at the forefront

    0 shares
    Share 0 Tweet 0
  • Dolly Varden Focuses on Adding Ounces the Remainder of 2023

    0 shares
    Share 0 Tweet 0
  • US Dollar Might Fall To 96-97 Range in March 2024

    0 shares
    Share 0 Tweet 0
  • Privacy Policy
  • Contact

© 2023 LBNN - All rights reserved.

No Result
View All Result
  • Home
  • Business
  • Politics
  • Markets
  • Crypto
  • Economics
    • Manufacturing
    • Real Estate
    • Infrastructure
  • Finance
  • Energy
  • Creator Economy
  • Wealth Management
  • Taxes
  • Telecoms
  • Military & Defense
  • Careers
  • Technology
  • Artificial Intelligence
  • Investigative journalism
  • Art & Culture
  • Documentaries
  • Quizzes
    • Enneagram quiz
  • Newsletters
    • LBNN Newsletter
    • Divergent Capitalist

© 2023 LBNN - All rights reserved.