• Business
  • Markets
  • Politics
  • Crypto
  • Finance
  • Intelligence
    • Policy Intelligence
    • Security Intelligence
    • Economic Intelligence
    • Fashion Intelligence
  • Energy
  • Technology
  • Taxes
  • Creator Economy
  • Wealth Management
  • LBNN Blueprints
  • Business
  • Markets
  • Politics
  • Crypto
  • Finance
  • Intelligence
    • Policy Intelligence
    • Security Intelligence
    • Economic Intelligence
    • Fashion Intelligence
  • Energy
  • Technology
  • Taxes
  • Creator Economy
  • Wealth Management
  • LBNN Blueprints

Single-stream model enhances image translation efficiency

Simon Osuji by Simon Osuji
December 17, 2024
in Artificial Intelligence
0
Single-stream model enhances image translation efficiency
0
SHARES
2
VIEWS
Share on FacebookShare on Twitter


Single-stream image-to-image translation (SSIT): a more efficient approach to image translation
The SSIT model uses a single encoder to extract spatial features from the content image and DAdaINP to capture features from the style image. The decoder then combines these features to generate a new image with the desired style. Credit: Rina Oh / Sophia University, Japan

Among the many artificial intelligence and machine learning models available today for image translation, image-to-image translation models using Generative Adversarial Networks (GANs) can change the style of images.

Related posts

Our Favorite Pixel Phone Is $100 Off

Our Favorite Pixel Phone Is $100 Off

February 4, 2026
Governance and data readiness enable the agentic enterprise

Governance and data readiness enable the agentic enterprise

February 4, 2026

These models work by using two input images: a content image, which is altered to match the style of a reference image. The models are used for tasks like transforming images into different artistic styles, simulating weather changes, improving satellite video resolution, and helping autonomous vehicles recognize different lighting conditions, like day and night.

Now, researchers from Sophia University have developed a model which can reduce the computational requirements needed to run these models, making it possible to run them on a wide range of devices, including smartphones.

In a study published in the IEEE Open Journal of the Computer Society on 25 September 2024, Project Assistant Professor Rina Oh and Professor Tad Gonsalves from the Department of Information and Communication Sciences at Sophia University proposed a “single-stream image-to-image translation (SSIT)” model that uses only a single encoder to carry out this transformation.

Typically, image-to-image translation models require two encoders—one for the content image and one for the style image—to “understand” the images.

These encoders convert the content and style images into numerical values (feature space) that represent key aspects of the image, such as color, objects, and other features. The decoder then takes the combined content and style features and reconstructs the final image with the desired content and style.

In contrast, SSIT uses a single encoder to extract spatial features such as the shapes, object boundaries, and layouts of the content image.

For the style image, the model uses Direct Adaptive Instance Normalization with Pooling (DAdaINP), which captures key style details like colors and textures while focusing on the most prominent features to improve efficiency. A decoder then takes the combined content and style features and reconstructs the final image with the desired content and style.

Prof. Oh says, “We implemented a guided image-to-image translation model that performs style transformation with reduced GPU computational costs while referencing input style images.

“Unlike previous related models, our approach utilizes Pooling and Deformable Convolution to efficiently extract style features, enabling high-quality style transformation with both reduced computational cost and preserved spatial features in the content images.”

Single-stream image-to-image translation (SSIT): a more efficient approach to image translation
The SSIT model outperformed five existing models in image translation tasks such as seasonal changes (e.g., summer-to-winter), artistic style transformations (e.g., Monet and anime), and time/weather translations (e.g., day-to-night). Credit: R. Oh and T. Gonsalves / Sophia University, Japan. computer.org/csdl/journal/oj/2024/01/10694773/20wCWTplz7W

The model is trained using adversarial training, where the generated images are evaluated by a Discriminator with a Vision Transformer, which captures patterns in images. The discriminator assesses whether the generated images are real or fake by comparing them to the target images, while the generator learns to create images that can fool the discriminator.

Using the model, the researchers performed three types of image transformation tasks. The first involved seasonal transformation, where landscape photos were converted from summer to winter and vice versa.

The second task was photo-to-art conversion, in which landscape photos were transformed into famous artistic styles, such as those of Picasso, Monet, or anime.

The third task focused on time and weather translation for driving, where images captured from the front of a car were altered to simulate different conditions, such as changing from day to night or from sunny to rainy weather.

In all these tasks, the model performed better than five other GAN models (namely NST, CNNMRF, MUNIT, GDWCT, and TSIT), with lower Fréchet Inception Distance and Kernel Inception Distance scores. This demonstrates that the generated images were similar to the target styles and did a better job of replicating colors and artistic details.

“Our generator was able to reduce the computational cost and FLOPs compared to the other models because we employed a single encoder that consists of multiple convolution layers only for content image and placed pooling layers for extracting style features at different angles instead of convolution layers,” says Prof. Oh.

In the long run, the SSIT model has the potential to democratize image transformation, making it deployable on devices like smartphones or personal computers.

It enables users across various fields, including digital art, design, and scientific research, to create high-quality image transformations without relying on expensive hardware or cloud services.

More information:
Rina Oh et al, Photogenic Guided Image-to-Image Translation With Single Encoder, IEEE Open Journal of the Computer Society (2024). DOI: 10.1109/OJCS.2024.3462477

Provided by
Sophia University

Citation:
Single-stream model enhances image translation efficiency (2024, December 16)
retrieved 16 December 2024
from https://techxplore.com/news/2024-12-stream-image-efficiency.html

This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no
part may be reproduced without the written permission. The content is provided for information purposes only.





Source link

Previous Post

They Grew a Business to $200M+ Annual Revenue: Boll & Branch

Next Post

Ethena Labs launches stablecoin backed by BlackRock’s tokenized fund shares

Next Post
Ethena Labs launches stablecoin backed by BlackRock’s tokenized fund shares

Ethena Labs launches stablecoin backed by BlackRock’s tokenized fund shares

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

RECOMMENDED NEWS

Akeso claims another PD-1/VEGF win; FDA requests more data from Novavax

Eli Lilly extends Purdue alliance; EMA investigates Valneva shot

9 months ago
AI chatbots are supposed to improve health care. But research says some are perpetuating racism

AI chatbots are supposed to improve health care. But research says some are perpetuating racism

2 years ago
Google’s AI-powered search expands outside U.S. to India and Japan

Google’s AI-powered search expands outside U.S. to India and Japan

2 years ago
Five African private equity and venture capital moves in January 2025

Five African private equity and venture capital moves in January 2025

12 months ago

POPULAR NEWS

  • Ghana to build three oil refineries, five petrochemical plants in energy sector overhaul

    Ghana to build three oil refineries, five petrochemical plants in energy sector overhaul

    0 shares
    Share 0 Tweet 0
  • The world’s top 10 most valuable car brands in 2025

    0 shares
    Share 0 Tweet 0
  • Top 10 African countries with the highest GDP per capita in 2025

    0 shares
    Share 0 Tweet 0
  • Global ranking of Top 5 smartphone brands in Q3, 2024

    0 shares
    Share 0 Tweet 0
  • When Will SHIB Reach $1? Here’s What ChatGPT Says

    0 shares
    Share 0 Tweet 0

Get strategic intelligence you won’t find anywhere else. Subscribe to the Limitless Beliefs Newsletter for monthly insights on overlooked business opportunities across Africa.

Subscription Form

© 2026 LBNN – All rights reserved.

Privacy Policy | About Us | Contact

Tiktok Youtube Telegram Instagram Linkedin X-twitter
No Result
View All Result
  • Home
  • Business
  • Politics
  • Markets
  • Crypto
  • Economics
    • Manufacturing
    • Real Estate
    • Infrastructure
  • Finance
  • Energy
  • Creator Economy
  • Wealth Management
  • Taxes
  • Telecoms
  • Military & Defense
  • Careers
  • Technology
  • Artificial Intelligence
  • Investigative journalism
  • Art & Culture
  • LBNN Blueprints
  • Quizzes
    • Enneagram quiz
  • Fashion Intelligence

© 2023 LBNN - All rights reserved.