

Cloudinary announced significant advancements in its generative AI portfolio, including AI Vision, which gives developers and brands an unprecedented level of control and insight into their visual media.
Cloudinary also announced several enhancements to its most popular generative AI tools including Generative Enhance, Generative Fill, Generative Restore, and Generative Upscale. Thousands of developers rely on Cloudinary’s generative AI capabilities to edit, optimize and transform media at scale with ease and efficiency.
Introducing AI Vision - Media Intelligence Powered by Generative AI
Designed to address the growing complexity of managing large-scale visual media libraries while prioritizing brand safety, AI Vision is a specialized AI feature that brings the power of generative AI to Cloudinary’s intelligent digital asset management (DAM) platform. By automating media management and enabling precise, scalable, brand-specific content workflows, AI Vision empowers brands with capabilities that go beyond basic automation. Key use cases include:
- Custom Taxonomy and Image Classification: More easily categorize, search and find assets based on detailed criteria such as background color or subject orientation without needing to train or fine-tune tagging models. Demographics can be built into an automated workflow that can analyze images at scale.
- Content Moderation and Compliance: Advanced image analysis provides unparalleled accuracy and detailed insights into your content, from detecting the presence of specific branding elements to identifying sensitive or inappropriate content.
- Visual Question Answering (VQA): Ask complex, image-specific questions and receive actionable, precise responses that streamline and improve your media workflows, such as generating SEO-ready metadata or descriptive alt text.
Cloudinary's Generative AI Features Receive Major Upgrades
Since launching its first set of generative AI tools in 2023, Cloudinary continues to serve the most pressing, real-world needs of developers and brands. Its latest generative AI innovations further enable users to unlock creative possibilities while reducing complexity and costs in their visual media workflows. Key enhancements to Cloudinary’s suite of generative AI tools include:
- Generative Fill: Powered by an even more advanced and fine-tuned model, Cloudinary’s most popular generative AI feature now delivers even greater contextual accuracy when filling the whitespace created by expanding the image canvas to fit new aspect ratios.
- Background Removal and Background Replacement: These features now provide more precise outputs for removing and replacing backgrounds based on the image’s foreground for even the most complex assets. They also offer more streamlined interfaces that empower users to create and manage content with minimal effort in less time.
- Generative Extract: This new feature intelligently isolates specific elements like products, objects, or people from images to create layered, dynamic content optimized for any channel, which is especially powerful when used in combination with Cloudinary’s overlay feature.
- Generative Enhance and Restore: Remove noise and imperfections from any image, automatically sharpen details, correct issues, and enhance image quality. Effortlessly revive old or damaged images while preserving critical details for professional results.
- Generative Upscale: This highly practical feature seamlessly expands image resolution without compromising quality or introducing artifacts–enabling brands to use high-impact images for any use case, regardless of original asset quality.
- AI Video Transcription and Chaptering: Cloudinary further streamlines video management tasks at scale with its new video transcription and chaptering tools which auto generate transcripts and chapters upon upload through Cloudinary’s Video API and the Video Player Studio of its intelligent DAM.
“By applying these tools to our most popular use cases we are able edit and deliver more dynamic experiences, creating multiple assets for different markets in a fraction of time,” said Javier Acón, Audiovisual Documentalist, Creative Marketing, Fever. “They’ve been a game-changer for us.”
“Managing images at scale isn’t just about speed—it’s about ensuring accuracy, brand compliance, and efficiency across thousands of assets, teams, and touchpoints,” said Nadav Soferman, co-founder and Chief Product Officer at Cloudinary. “AI Vision brings automation and intelligence to these critical workflows, allowing brands to instantly tag, moderate, and transform images with confidence. Combined with our suite of generative AI tools, this means faster go-to-market times, less manual work, and seamless delivery of optimized visuals everywhere.”
With the growing demand for fast and flawless visual content on every channel and device, Cloudinary remains dedicated to providing both code-based and no-code tools, empowering businesses to deliver engaging visual experiences in today’s competitive digital landscape.
The Latest
In MEAN TIME TO INSIGHT Episode 12, Shamus McGillicuddy, VP of Research, Network Infrastructure and Operations, at EMA discusses purchasing new network observability solutions....
There's an image problem with mobile app security. While it's critical for highly regulated industries like financial services, it is often overlooked in others. This usually comes down to development priorities, which typically fall into three categories: user experience, app performance, and app security. When dealing with finite resources such as time, shifting priorities, and team skill sets, engineering teams often have to prioritize one over the others. Usually, security is the odd man out ...

IT outages, caused by poor-quality software updates, are no longer rare incidents but rather frequent occurrences, directly impacting over half of US consumers. According to the 2024 Software Failure Sentiment Report from Harness, many now equate these failures to critical public health crises ...
In just a few months, Google will again head to Washington DC and meet with the government for a two-week remedy trial to cement the fate of what happens to Chrome and its search business in the face of ongoing antitrust court case(s). Or, Google may proactively decide to make changes, putting the power in its hands to outline a suitable remedy. Regardless of the outcome, one thing is sure: there will be far more implications for AI than just a shift in Google's Search business ...

In today's fast-paced digital world, Application Performance Monitoring (APM) is crucial for maintaining the health of an organization's digital ecosystem. However, the complexities of modern IT environments, including distributed architectures, hybrid clouds, and dynamic workloads, present significant challenges ... This blog explores the challenges of implementing application performance monitoring (APM) and offers strategies for overcoming them ...
Service disruptions remain a critical concern for IT and business executives, with 88% of respondents saying they believe another major incident will occur in the next 12 months, according to a study from PagerDuty ...
IT infrastructure (on-premises, cloud, or hybrid) is becoming larger and more complex. IT management tools need data to drive better decision making and more process automation to complement manual intervention by IT staff. That is why smart organizations invest in the systems and strategies needed to make their IT infrastructure more resilient in the event of disruption, and why many are turning to application performance monitoring (APM) in conjunction with high availability (HA) clusters ...
In today's data-driven world, the management of databases has become increasingly complex and critical. The following are findings from Redgate's 2025 The State of the Database Landscape report ...
With the 2027 deadline for SAP S/4HANA migrations fast approaching, organizations are accelerating their transition plans ... For organizations that intend to remain on SAP ECC in the near-term, the focus has shifted to improving operational efficiencies and meeting demands for faster cycle times ...
As applications expand and systems intertwine, performance bottlenecks, quality lapses, and disjointed pipelines threaten progress. To stay ahead, leading organizations are turning to three foundational strategies: developer-first observability, API platform adoption, and sustainable test growth ...