Monday, 18 November 2024
26.8 C
Singapore

Apple unveils revolutionary AI image editing model

Discover Apple's groundbreaking AI image editing model, revolutionizing editing with simple text commands and superior performance.

Apple has stepped into image editing by releasing a groundbreaking multimodal AI model. Developed in collaboration with researchers from the University of California, Santa Barbara, the MLLM-Guided Image Editing (MGIE) model promises to revolutionize how images are edited.

Simplifying image editing with natural language commands

The MGIE model enables users to edit images using simple text commands, akin to giving instructions to a Photoshop expert. Unlike traditional methods, which often struggle to comprehend brief instructions, MGIE leverages multimodal large language models (MLLMs) to simultaneously understand natural language prompts and image data. This innovative approach eliminates the need for detailed descriptions, resulting in more accurate and intuitive editing.

According to the research paper accompanying the release, existing AI image editing tools frequently fall short when interpreting human instructions. However, MGIE’s ability to grasp the context of expressive instructions sets it apart. For instance, when presented with an image of a pepperoni pizza and instructed to “make this more healthy,” MGIE intelligently infers that “this” refers to the pizza and interprets “more healthy” as adding vegetables. The result? A pepperoni pizza adorned with nutritious green toppings, transforming it into a healthier option.

Superior performance demonstrated

In comparative tests, MGIE outperformed other models by accurately implementing complex editing commands. For example, when instructed to “add lightning and make the water reflect the lightning” to an image of a forested shoreline, MGIE seamlessly incorporated the lightning and its reflection on the water’s surface. In contrast, competing models failed to capture the intricate details of the command, highlighting MGIE’s exceptional capabilities.

Apple’s foray into AI image editing marks a significant step forward in the field, demonstrating the company’s commitment to advancing AI technologies. While the release of MGIE is a notable development, industry observers speculate that this may be just the beginning for Apple in the realm of AI.

Hot this week

T-Mobile network infiltrated by hackers linked to China

China-linked hackers breached T-Mobile, accessing officials' data. T-Mobile says customers' data remains largely unaffected.

YouTube’s new AI music remixer could let you transform songs with ease

YouTube’s experimental AI remixer lets creators transform tracks into new genres, adding personal flair to Shorts with AI-powered custom soundtracks.

Alibaba’s quarterly profit rises 58% on cloud and international growth

Despite a challenging domestic market, Alibaba's quarterly profit jumped 58%, driven by cloud computing and international e-commerce growth.

18 states challenge SEC over crypto regulation enforcement

18 US states filed a lawsuit challenging the SEC’s authority over crypto regulation, seeking state-level control. The legal battle could reshape oversight.

Related Articles

Popular Categories