Physics artwork

10/5/2023

You can use these two values to plot the image as a point in 2D space. We can fully describe this image with two values, based on each pixel’s shade (from zero being completely black to 255 being completely white). To understand how creating data works for images, let’s start with a simple image made of just two adjacent grayscale pixels. And while the realistic-looking images created by diffusion models can sometimes perpetuate social and cultural biases, she said, “we have demonstrated that generative models are useful for downstream tasks improve the fairness of predictive AI models.” High Probabilities “This is an exciting time for generative models,” said Anima Anandkumar, a computer scientist at the California Institute of Technology and senior director of machine learning research at Nvidia. The power of these models has rocked industry and users alike. “There are a lot of techniques that were initially invented by physicists and now are very important in machine learning,” said Yang Song, a machine learning researcher at OpenAI. The system that underpins them, known as a diffusion model, is heavily inspired by nonequilibrium thermodynamics, which governs phenomena like the spread of fluids and gases. The key insight that makes DALL♾ 2’s images possible - as well as those of its competitors Stable Diffusion and Imagen - comes from the world of physics. Meanwhile, a powerful generative model - created by a postdoctoral researcher with a passion for physics - lay dormant, until two graduate students made technical breakthroughs that brought the beast to life.ĭALL♾ 2 is such a beast. But even as the quality of their images got better, the models proved unreliable and hard to train. The first important generative models for images used an approach to artificial intelligence called a neural network - a program composed of many layers of computational units called artificial neurons. This is one of the hardest problems in machine learning, and getting to this point has been a difficult journey. Yet DALL♾ 2 can assemble the concepts into something that might have made Dalí proud.ĭALL♾ 2 is a type of generative model - a system that attempts to use training data to generate something new that’s comparable to the data in terms of quality and variety. The program would have encountered images of beaches, goldfish and Coca-Cola during training, but it’s highly unlikely it would have seen one in which all three came together.

Ask DALL♾ 2, an image generation system created by OpenAI, to paint a picture of “goldfish slurping Coca-Cola on a beach,” and it will spit out surreal images of exactly that.

0 Comments

Physics artwork

Leave a Reply.

Author

Archives

Categories