![]() It’s an open legal question as to whether GenAI vendors like Google can train a model on publicly available - even copyrighted - data and then turn around and commercialize that model. Google didn’t reveal the data that it used to train Imagen 2, which - while disappointing - doesn’t exactly come as a surprise. But as policymakers express concern over the growing volume of AI-generated disinformation on the web, it’ll perhaps allay some fears. Of course, detecting these watermarks - which Google claims are resilient to image edits including compression, filters and color adjustments - requires a Google-provided tool that’s not available to third parties. Imagen 2 leverages SynthID, an approach developed by DeepMind, to apply invisible watermarks to images created by it. These techniques also enhance Imagen 2’s multilingual understanding, Google says - allowing the model to translate a prompt in one language to an output (e.g. Thanks to “novel training and modeling techniques,” Imagen 2 can also understand more descriptive, long-form prompts and provide “detailed answers” to questions about elements in an image. “Imagen 2 can generate … emblems, lettermarks and abstract logos … has the ability to overlay these logos onto products, clothing, business cards and other surfaces,” Vishy Tirumalasetty, head of generative media products at Google, explains in a blog post provided to TechCrunch ahead of today’s announcement. In two possible points of differentiation, though, Imagen 2 can render text in multiple languages - specifically Chinese, Hindi, Japanese, Korean, Portuguese, English and Spanish, with more to come sometime in 2024 - and overlay logos in existing images. Text and logo generation brings Imagen in line with other leading image-generating models, like OpenAI’s DALL-E 3 and Amazon’s recently launched Titan Image Generator. “If you want to create images with a text overlay - for example, advertising - you can do that,” Google Cloud CEO Thomas Kurian said during a press briefing on Tuesday. Compared to the first-gen Imagen, it’s “significantly” improved in terms of image quality, Google claims (the company bizarrely refused to share image samples prior to this morning), and introduces new capabilities, including the ability to render text and logos. Google’s making the second generation of Imagen, its AI model that can create and edit images given a text prompt, more widely available - at least to Google Cloud customers using Vertex AI who’ve been approved for access.īut the company isn’t disclosing which data it used to train the new model - nor introducing a way for creators who might’ve inadvertently contributed to the dataset to opt out or apply for compensation.Ĭalled Imagen 2, Google’s enhanced model - which was quietly launched in preview at the tech giant’s I/O conference in May - was developed using technology from Google DeepMind, Google’s flagship AI lab.
0 Comments
Leave a Reply. |
AuthorWrite something about yourself. No need to be fancy, just an overview. ArchivesCategories |