Friday, May 16, 2025
LBNN
  • Business
  • Markets
  • Politics
  • Crypto
  • Finance
  • Energy
  • Technology
  • Taxes
  • Creator Economy
  • Wealth Management
  • Documentaries
No Result
View All Result
LBNN

Helping machine learning models identify objects in any pose

Simon Osuji by Simon Osuji
December 17, 2024
in Artificial Intelligence
0
Helping machine learning models identify objects in any pose
0
SHARES
0
VIEWS
Share on FacebookShare on Twitter


Helping machine learning models identify objects in any pose
In the joint semantic-pose embedding, images are clustered by semantics (left) and within each cluster images form a mini-cluster by pose (right). Credit: Wang et al., 2024.

A new visual recognition approach improved a machine learning technique’s ability to both identify an object and how it is oriented in space, according to a study presented in October at the European Conference on Computer Vision in Milan, Italy.

Related posts

Does Your City Use Chlorine or Chloramine to Treat Its Water?

Does Your City Use Chlorine or Chloramine to Treat Its Water?

May 16, 2025
Alibaba’s ZeroSearch method uses simulated search results to slash LLM training costs

Alibaba’s ZeroSearch method uses simulated search results to slash LLM training costs

May 16, 2025

Self-supervised learning is a machine learning approach that trains on unlabeled data, extending generalizability to real-world data. While it excels at identifying objects, a task called semantic classification, it may struggle to recognize objects in new poses.

This weakness quickly becomes a problem in situations like autonomous vehicle navigation, where an algorithm must assess whether an approaching car is a head-on collision threat or side-oriented and just passing by.

“Our work helps machines perceive the world more like humans do, paving the way for smarter robots, safer self-driving cars and more intuitive interactions between technology and the physical world,” said Stella Yu, a University of Michigan professor of computer science and engineering and senior author of the study.

To help machines learn both object identities and poses, the research team developed a new self-supervised learning benchmark with problem setting, training and evaluation protocols along with a dataset of unlabeled image triplets for pose-aware representation learning.

The image triplets involve capturing three adjacent shots of the same object with slight camera pose changes, known as a smooth viewpoint trajectory. However, neither object labels (e.g. “car”) nor pose labels (e.g., frontal view) are provided.

This mimics robotic vision where the robot pans a camera as it moves around the environment. While the robot understands it is viewing the same object, it does not know what the object is or its pose.

Previous approaches typically managed regularization by mapping different views of the same object to the same feature at the final layer of a deep neural network. The new approach uses the mid-layer feature and imposes viewpoint trajectory regularization, which instead maps three consecutive views of an object to a straight line in the feature space. The first strategy boosts pose estimation performance by 10–20%, whereas the second strategy further improves pose estimation by 4% without reducing semantic classification.

“More importantly, we map an image to a feature that encodes not only object identities but also object poses, and such a feature map can generalize better to images of novel objects the robot has never seen before,” said Jiayun Wang, a University of California Berkeley doctoral graduate of vision science and the Berkeley AI research lab and first author of the study.

This concept can be applied to uncover meaningful patterns in various types of related data, such as multichannel audio or time series. For instance, each snapshot of audio at a specific moment can be assigned a unique feature, while the entire sequence is mapped to a smooth feature trajectory that captures how things change continuously over time.

More information:
Jiayun Wang et al, Pose-Aware Self-supervised Learning with Viewpoint Trajectory Regularization, Computer Vision – ECCV 2024 (2024). DOI: 10.1007/978-3-031-72664-4_2

Provided by
University of Michigan College of Engineering

Citation:
Helping machine learning models identify objects in any pose (2024, December 17)
retrieved 17 December 2024
from https://techxplore.com/news/2024-12-machine-pose.html

This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no
part may be reproduced without the written permission. The content is provided for information purposes only.





Source link

Previous Post

Africa must chart its own course in the global mineral landscape

Next Post

US Air Force Gears Up for CBRN Warfare

Next Post
US Air Force Gears Up for CBRN Warfare

US Air Force Gears Up for CBRN Warfare

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

RECOMMENDED NEWS

Phyllis Wambui’s Unwavering Support for Deborah Kituyi

Phyllis Wambui’s Unwavering Support for Deborah Kituyi

3 months ago
Dangote unfazed by Trump tariff’s impact on urea export, cites advantage over Algerian rivals

Dangote unfazed by Trump tariff’s impact on urea export, cites advantage over Algerian rivals

2 weeks ago
Ukraine Says It Struck Russian Oil Terminal

Ukraine Says It Struck Russian Oil Terminal

5 months ago
2023 a year of ‘good’ mixed with ‘bad’ and some ‘ugly’ for the SA defence industry

2023 a year of ‘good’ mixed with ‘bad’ and some ‘ugly’ for the SA defence industry

1 year ago

POPULAR NEWS

  • Ghana to build three oil refineries, five petrochemical plants in energy sector overhaul

    Ghana to build three oil refineries, five petrochemical plants in energy sector overhaul

    0 shares
    Share 0 Tweet 0
  • When Will SHIB Reach $1? Here’s What ChatGPT Says

    0 shares
    Share 0 Tweet 0
  • Matthew Slater, son of Jackson State great, happy to see HBCUs back at the forefront

    0 shares
    Share 0 Tweet 0
  • Dolly Varden Focuses on Adding Ounces the Remainder of 2023

    0 shares
    Share 0 Tweet 0
  • US Dollar Might Fall To 96-97 Range in March 2024

    0 shares
    Share 0 Tweet 0
  • Privacy Policy
  • Contact

© 2023 LBNN - All rights reserved.

No Result
View All Result
  • Home
  • Business
  • Politics
  • Markets
  • Crypto
  • Economics
    • Manufacturing
    • Real Estate
    • Infrastructure
  • Finance
  • Energy
  • Creator Economy
  • Wealth Management
  • Taxes
  • Telecoms
  • Military & Defense
  • Careers
  • Technology
  • Artificial Intelligence
  • Investigative journalism
  • Art & Culture
  • Documentaries
  • Quizzes
    • Enneagram quiz
  • Newsletters
    • LBNN Newsletter
    • Divergent Capitalist

© 2023 LBNN - All rights reserved.