Advertisement Banner
  • Home
  • News
    • PRESS RELEASE
  • Shop
  • BUSINESS
    • CRYPTO
    • ECONOMY
    • FINANCE
    • MARKET
    • MONEY
  • TECH
    • APPS
    • GADGET
    • MOBILE
    • SCIENCE
  • SOCIAL MEDIA
  • ENTERTAINMENT
    • ARTS & THEATER
    • GAMING
    • GAMBLING
    • MOVIE
    • MUSIC
    • SHOWS
    • SPORTS
  • LIFESTYLE
    • CELEBRITY
    • CULTURE
    • Education
    • FASHION
    • FOOD
    • HEALTH
    • HISTORY
    • Nature
    • Religion
    • Shopping
    • TRAVEL
  • REAL ESTATE
  • Blog
  • Classifieds
No Result
View All Result

No products in the cart.

  • Home
  • News
    • PRESS RELEASE
  • Shop
  • BUSINESS
    • CRYPTO
    • ECONOMY
    • FINANCE
    • MARKET
    • MONEY
  • TECH
    • APPS
    • GADGET
    • MOBILE
    • SCIENCE
  • SOCIAL MEDIA
  • ENTERTAINMENT
    • ARTS & THEATER
    • GAMING
    • GAMBLING
    • MOVIE
    • MUSIC
    • SHOWS
    • SPORTS
  • LIFESTYLE
    • CELEBRITY
    • CULTURE
    • Education
    • FASHION
    • FOOD
    • HEALTH
    • HISTORY
    • Nature
    • Religion
    • Shopping
    • TRAVEL
  • REAL ESTATE
  • Blog
  • Classifieds
No Result
View All Result
No Result
View All Result
Home GADGET

Meta’s open-source ImageBind AI aims to mimic human perception

North Dakota Digital News by North Dakota Digital News
May 9, 2023
in GADGET
38 1
0
Meta’s open-source ImageBind AI aims to mimic human perception
32
SHARES
356
VIEWS
Share on TwitterShare on Facebook


Meta is open-sourcing an AI tool called ImageBind that predicts connections between data similar to how humans perceive or imagine an environment. While image generators like Midjourney, Stable Diffusion and DALL-E 2 pair words with images, allowing you to generate visual scenes based only on a text description, ImageBind casts a broader net. It can link text, images / videos, audio, 3D measurements (depth), temperature data (thermal), and motion data (from inertial measurement units) — and it does this without having to first train on every possibility. It’s an early stage of a framework that could eventually generate complex environments from an input as simple as a text prompt, image or audio recording (or some combination of the three).

You could view ImageBind as moving machine learning closer to human learning. For example, if you’re standing in a stimulating environment like a busy city street, your brain (largely unconsciously) absorbs the sights, sounds and other sensory experiences to infer information about passing cars and pedestrians, tall buildings, weather and much more. Humans and other animals evolved to process this data for our genetic advantage: survival and passing on our DNA. (The more aware you are of your surroundings, the more you can avoid danger and adapt to your environment for better survival and prosperity.) As computers get closer to mimicking animals’ multi-sensory connections, they can use those links to generate fully realized scenes based only on limited chunks of data.

So, while you can use Midjourney to prompt “a basset hound wearing a Gandalf outfit while balancing on a beach ball” and get a relatively realistic photo of this bizarre scene, a multimodal AI tool like ImageBind may eventually create a video of the dog with corresponding sounds, including a detailed suburban living room, the room’s temperature and the precise locations of the dog and anyone else in the scene. “This creates distinctive opportunities to create animations out of static images by combining them with audio prompts,” Meta researchers said today in a developer-focused blog post. “For example, a creator could couple an image with an alarm clock and a rooster crowing, and use a crowing audio prompt to segment the rooster or the sound of an alarm to segment the clock and animate both into a video sequence.”

Series of two graphs with the title
Meta’s graph showing ImageBind’s accuracy outperforming single-mode models.

Meta

As for what else one could do with this new toy, it points clearly to one of Meta’s core ambitions: VR, mixed reality and the metaverse. For example, imagine a future headset that can construct fully realized 3D scenes (with sound, movement, etc.) on the fly. Or, virtual game developers could perhaps eventually use it to take much of the legwork out of their design process. Similarly, content creators could make immersive videos with realistic soundscapes and movement based on only text, image or audio input. It’s also easy to imagine a tool like ImageBind opening new doors in the accessibility space, generating real-time multimedia descriptions to help people with vision or hearing disabilities better perceive their immediate environments.

“In typical AI systems, there is a specific embedding (that is, vectors of numbers that can represent data and their relationships in machine learning) for each respective modality,” said Meta. “ImageBind shows that it’s possible to create a joint embedding space across multiple modalities without needing to train on data with every different combination of modalities. This is important because it’s not feasible for researchers to create datasets with samples that contain, for example, audio data and thermal data from a busy city street, or depth data and a text description of a seaside cliff.”

Meta views the tech as eventually expanding beyond its current six “senses,” so to speak. “While we explored six modalities in our current research, we believe that introducing new modalities that link as many senses as possible — like touch, speech, smell, and brain fMRI signals — will enable richer human-centric AI models.” Developers interested in exploring this new sandbox can start by diving into Meta’s open-source code.



Source link

Tweet8Share13Share3Share
Previous Post

https://apps.apple.com/us/app/bobo-group-voice-chat-rooms/id1489566586?l=ar

Next Post

Boise Cascade Expands Birmingham, AL, Distribution Center – Business Wire

North Dakota Digital News

North Dakota Digital News

Next Post
Ten People Charged for their Involvement in the Illegal Trafficking of … – Department of Justice

Boise Cascade Expands Birmingham, AL, Distribution Center - Business Wire

Discussion about this post

Bismarck
◉
70°
Fair
6:15 am9:03 pm CDT
Feels like: 70°F
Wind: 15mph ESE
Humidity: 58%
Pressure: 29.94"Hg
UV index: 6
WedThuFri
79/55°F
70/55°F
66/50°F
Weather forecast Bismarck, North Dakota ▸
Plant Kween’s Leafy Brooklyn Apartment Is Rooted in Love
LIFESTYLE

Plant Kween’s Leafy Brooklyn Apartment Is Rooted in Love

by North Dakota Digital News
May 9, 2023
Ten People Charged for their Involvement in the Illegal Trafficking of … – Department of Justice
PRESS RELEASE

Caroline Sweeney joins Gov. Lujan Grisham’s communications team … – Office of the Governor

by North Dakota Digital News
May 9, 2023
The End of the Covid Emergency Is a Warning
SCIENCE

The End of the Covid Emergency Is a Warning

by North Dakota Digital News
May 9, 2023
The two iPhone 16 Pro models will have taller displays
MOBILE

The two iPhone 16 Pro models will have taller displays

by North Dakota Digital News
May 9, 2023
Ten People Charged for their Involvement in the Illegal Trafficking of … – Department of Justice
PRESS RELEASE

Boise Cascade Expands Birmingham, AL, Distribution Center – Business Wire

by North Dakota Digital News
May 9, 2023
Meta’s open-source ImageBind AI aims to mimic human perception
GADGET

Meta’s open-source ImageBind AI aims to mimic human perception

by North Dakota Digital News
May 9, 2023
https://apps.apple.com/us/app/bobo-group-voice-chat-rooms/id1489566586?l=ar
APPS

https://apps.apple.com/us/app/bobo-group-voice-chat-rooms/id1489566586?l=ar

by North Dakota Digital News
May 9, 2023
MARKET

House Speaker nixes possibility of short-term debt-limit extensin – report

by North Dakota Digital News
May 9, 2023
2:00PM Water Cooler 7/27/2022 | naked capitalism
ECONOMY

2:00PM Water Cooler 5/9/2023 | naked capitalism

by North Dakota Digital News
May 9, 2023
5 ways AI is helping to improve customer service in e-commerce
CRYPTO

5 ways AI is helping to improve customer service in e-commerce

by North Dakota Digital News
May 9, 2023
Joana Silva – Art – Pro – Pole Theatre UK 2015
ARTS & THEATER

Joana Silva – Art – Pro – Pole Theatre UK 2015

by North Dakota Digital News
May 9, 2023
Survivor Is Already A GOTY Contender
GAMING

Survivor Is Already A GOTY Contender

by North Dakota Digital News
May 9, 2023

About Us

North Dakota Digital News

Category

  • APPS
  • ARTS & THEATER
  • BUSINESS
  • CELEBRITY
  • CRYPTO
  • CULTURE
  • ECONOMY
  • Education
  • ENTERTAINMENT
  • FASHION
  • FINANCE
  • FOOD
  • GADGET
  • Gambling
  • GAMING
  • HEALTH
  • HISTORY
  • LIFESTYLE
  • MARKET
  • MOBILE
  • MONEY
  • MOVIE
  • MUSIC
  • Nature
  • News
  • PRESS RELEASE
  • REAL ESTATE
  • Religion
  • SCIENCE
  • Shopping
  • SHOWS
  • SPORTS
  • TECH
  • TRAVEL
LIFESTYLE

Plant Kween’s Leafy Brooklyn Apartment Is Rooted in Love

May 9, 2023
PRESS RELEASE

Caroline Sweeney joins Gov. Lujan Grisham’s communications team … – Office of the Governor

May 9, 2023
SCIENCE

The End of the Covid Emergency Is a Warning

May 9, 2023

© 2023 northdakotadigitalnews.com

No Result
View All Result
  • Home
  • News
    • PRESS RELEASE
  • Shop
  • BUSINESS
    • CRYPTO
    • ECONOMY
    • FINANCE
    • MARKET
    • MONEY
  • TECH
    • APPS
    • GADGET
    • MOBILE
    • SCIENCE
  • SOCIAL MEDIA
  • ENTERTAINMENT
    • ARTS & THEATER
    • GAMING
    • GAMBLING
    • MOVIE
    • MUSIC
    • SHOWS
    • SPORTS
  • LIFESTYLE
    • CELEBRITY
    • CULTURE
    • Education
    • FASHION
    • FOOD
    • HEALTH
    • HISTORY
    • Nature
    • Religion
    • Shopping
    • TRAVEL
  • REAL ESTATE
  • Blog
  • Classifieds

© 2023 northdakotadigitalnews.com

Welcome Back!

Login to your account below

Forgotten Password?

Retrieve your password

Please enter your username or email address to reset your password.

Log In