Clipdrop

Clipdrop is an AI-powered image editing and creation platform developed by the French startup Init ML. Founded in 2020 by Cyril Diagne, Damien Henry, and Jonathan Blanchet, the platform grew out of a viral augmented reality prototype called AR Cut Paste that demonstrated the ability to capture real-world objects with a smartphone and paste them directly into desktop applications. Over the following years, Clipdrop evolved into a comprehensive suite of AI-driven visual tools, including background removal, object cleanup, image upscaling, relighting, text removal, uncropping, and text-to-image generation powered by Stable Diffusion. The platform accumulated over 15 million users before being acquired by Stability AI in March 2023 and subsequently sold to Jasper AI in February 2024.

Origins and Founding

Clipdrop traces its origins to the work of Cyril Diagne, a French creative coder and former Artist in Residence at the Google Arts & Culture Lab (2015 to 2020). Diagne studied at Gobelins, l'ecole de l'image in Paris and co-founded the interactive art collective LAB212 after graduating in 2008. He later served as Professor and Head of Media & Interaction Design at ECAL in Lausanne, Switzerland.

In May 2020, Diagne published a prototype called AR Cut Paste on GitHub. The tool used a combination of BASNet (Boundary-Aware Segmentation Network), a deep learning model for salient object detection, and OpenCV SIFT (Scale-Invariant Feature Transform) to enable users to point a smartphone camera at a real-world object, automatically segment it from the background, and paste it into a desktop application such as Adobe Photoshop. The prototype took approximately 2.5 seconds to cut an object and about 4 seconds to transfer it to the desktop. The demo went viral on Twitter, attracting attention from Adobe's official Photoshop account and generating widespread media coverage.

The success of AR Cut Paste motivated Diagne to formalize the concept into a product. In July 2020, he co-founded Init ML alongside Damien Henry and Jonathan Blanchet, both of whom had backgrounds at Google. Henry had worked in operations and product roles, while Blanchet brought expertise in engineering and machine learning. The company was headquartered in Paris, France.

The founding vision was to simplify a workflow that traditionally required multiple steps: photographing an object, transferring the file to a computer, opening an image editor, and manually removing the background. Clipdrop compressed this entire process into a single "point, capture, drop" interaction.

Y Combinator and Early Funding

Init ML was accepted into Y Combinator's Winter 2021 (W21) batch, giving the startup access to Silicon Valley mentorship, network connections, and global visibility. During its time in the accelerator program, the team refined Clipdrop's core product and expanded its feature set beyond the original AR copy-paste functionality.

In August 2021, Init ML closed a seed funding round of approximately 1.2 million euros (roughly $1.5 million) led by Air Street Capital, a venture firm focused on artificial intelligence. The round included participation from 11 investors, among them Kima Ventures and SOMA. Several prominent technology executives participated as angel investors:

Investor	Role / Background
Maria Raga	Former CEO of Depop (acquired by Etsy)
Kaj Drobin	Co-founder of TicTail (now VP at Shopify)
Siavash Ghorbani	Co-founder of TicTail
Carl Riviera	Co-founder of TicTail
Edward Miller	CEO of Scape (acquired by Facebook)
Namrata Ganatra	Former CTO of Lambda School
Ian Hogarth	CEO of SongKick
Matt Robinson	CEO of Nested

The seed capital supported the expansion of Clipdrop's product line and its growing user base. By mid-2021, users had already created over 5 million "clips" (object captures and edits) within six months of the platform's initial launch.

Product Evolution and Feature Set

Clipdrop launched initially as a focused AR-based object capture tool but rapidly expanded into a full suite of AI image editing tools. By 2023, the platform offered nine primary tools, each powered by distinct AI models.

Remove Background

The background removal tool was Clipdrop's flagship feature from the start. It uses image segmentation models to identify and isolate the foreground subject from the rest of the image. The tool supports a wide range of subjects including portraits, products, and complex scenes. Clipdrop marketed it as "the most accurate background removal solution available on the market," though independent benchmarks from Photoroom and others showed that all background removal tools (including Clipdrop, Remove.bg, and Meta's Segment Anything Model) struggled with particularly fine details such as individual hair strands, bicycle spokes, and lace textures.

Cleanup

The Cleanup tool, also available at the standalone domain cleanup.pictures, allows users to remove unwanted objects, people, text, and defects from images. Unlike traditional clone stamp or content-aware fill tools found in conventional image editors, Clipdrop's Cleanup feature uses AI to intelligently predict what should appear behind the removed element. Users simply brush over the area they want removed, and the AI fills the region with contextually appropriate content. Pro subscribers gained access to an HD mode for higher-resolution output.

Image Upscaler

Clipdrop's upscaling tool enhances image resolution by 2x or 4x while reducing noise and recovering fine details. The technology uses super-resolution algorithms trained on large datasets of image pairs to predict high-frequency details that are absent in lower-resolution originals. Pro plan subscribers could access upscaling up to 16x the original resolution.

Relight

Relight is a tool that allows users to adjust the lighting in a photograph after it has been taken. The tool uses AI-based depth estimation to understand the three-dimensional structure of the scene, then applies virtual light sources that cast realistic shadows and highlights. Users can position light sources interactively within the interface, and the AI recalculates the shading accordingly. The feature proved popular with product photographers, portrait editors, and digital artists who needed to adjust lighting conditions without re-shooting.

Text Remover

The Text Remover tool detects and removes text from images. It uses optical character recognition (OCR) and inpainting models to first identify text regions and then fill those regions with content that matches the surrounding area. Common use cases include removing watermarks, cleaning up photographs of signage, and preparing images for localization where text needs to be replaced in different languages.

Uncrop

Uncrop extends the boundaries of an image by generating new content beyond the original frame. Built on foundation models from Stability AI, the tool predicts what should exist outside the visible edges of a photograph. Users upload an image and select a target aspect ratio, and the AI generates the missing portions. Practical applications include converting square Instagram photos to widescreen formats, restoring cropped areas in old photographs, and adapting images for social media platforms that require specific dimensions.

Replace Background

Replace Background combines Clipdrop's background removal capability with generative AI. After isolating the foreground subject, the tool generates a new background based on a text prompt or selected scene description. This feature was released in beta and enabled users to place products or people into entirely new environments without manual compositing.

Text to Image

Clipdrop's text-to-image tool generates images from written prompts. The feature is built on top of Stable Diffusion and was optimized for speed within the Clipdrop interface. Users can choose from multiple artistic styles, including photographic, anime, digital art, comic book, fantasy art, analog film, neon punk, isometric, low poly, origami, line art, cinematic, and 3D model. Available aspect ratios range from ultrawide (21:9) to vertical (9:16). Each prompt generates four image variations for the user to choose from.

Universal Resizer

The Universal Resizer adapts images automatically for social media formats. Rather than simply cropping or stretching, the tool intelligently adjusts composition and fills in missing areas to fit the target dimensions for platforms such as Instagram, Facebook, Twitter, LinkedIn, and YouTube.

Stable Diffusion Integration

After Stability AI acquired Clipdrop in 2023, the platform became a primary consumer-facing showcase for Stability AI's generative models.

Stable Diffusion XL

Clipdrop served as the official demo platform for Stable Diffusion XL (SDXL), released in July 2023. SDXL represented a major upgrade over previous Stable Diffusion versions, generating images at 1024x1024 resolution with a two-stage pipeline (base model and optional refiner). Through Clipdrop, users gained free access to SDXL without needing to install software or manage GPU resources locally.

Reimagine XL

In late May 2023, Stability AI launched Reimagine XL on Clipdrop. This tool generates multiple variations of an uploaded image, preserving the overall composition and subject matter while changing specific details, colors, and artistic elements. The technology is based on an open-source Stable unCLIP model that replaces Stable Diffusion's text encoder with an image encoder. Instead of conditioning generation on a text prompt, Reimagine XL encodes the source image into a latent representation, adds controlled noise, and then generates new images that are visually similar but not pixel-identical to the original. This approach differs from standard image-to-image processing because the source image is fully encoded first, meaning the generated output does not reuse any pixels from the original.

The initial version, Stable Diffusion Reimagine, launched in March 2023. The XL upgrade in May 2023 brought higher-resolution output and improved quality.

SDXL Turbo

In November 2023, Clipdrop became the demo platform for SDXL Turbo, a distilled version of SDXL 1.0 trained using Adversarial Diffusion Distillation (ADD). SDXL Turbo could generate images in a single network evaluation step, enabling near-real-time text-to-image synthesis. Users could type a prompt and see results appear almost instantly, a significant departure from the multi-second generation times of standard diffusion models.

Platform Availability

Clipdrop was designed as a cross-platform product from its earliest days. The platform was available through multiple channels:

Platform	Description
Web application	Full-featured browser-based access at clipdrop.co
iOS app	Mobile application for iPhone and iPad
Android app	Mobile application for Android devices
macOS application	Native desktop app for Mac
Windows application	Native desktop app for Windows
Photoshop plugin	Direct integration with Adobe Photoshop, allowing users to drop captured objects as new layers with editable masks
Figma plugin	Integration with Figma for background removal, object removal, text removal, and automatic image decomposition into layers

The Figma plugin was free to use with unlimited image processing and required no subscription. The Photoshop plugin allowed users to insert captured objects directly as new layers with editable masks, fitting naturally into existing design workflows.

Clipdrop also supported integration with other applications including Microsoft PowerPoint, Google Docs, Canva, and Pitch.

API

Clipdrop offered a REST API that provided programmatic access to its core image processing capabilities. The API endpoints included:

API Endpoint	Function
Remove Background	Segment and extract foreground subjects
Cleanup	Remove unwanted objects, text, or people
Image Upscaling	Enhance resolution up to 16x
Text to Image	Generate images from text prompts using Stable Diffusion
Remove Text	Detect and remove text from images
Uncrop	Extend image boundaries with AI-generated content
Replace Background	Combine background removal with generative fill

The API was used by developers and businesses to integrate Clipdrop's capabilities into their own applications, e-commerce platforms, and content management systems. After the Jasper acquisition, the API was migrated to the Jasper API platform.

Pricing

Clipdrop operated on a freemium model with three tiers:

Plan	Price	Features
Free	$0	Limited daily usage, watermarked results, standard resolution, 100 images per month
Pro	$9/month ($7/month billed annually)	Unlimited access, HD resolution, no watermarks, batch processing (up to 10 images), 16x upscaling, Relight templates, priority processing
API / Business	$29/month	API access, 5,000 image allowance, developer documentation, commercial integration support

All plans permitted commercial use of generated and edited images.

Awards and Recognition

Clipdrop earned significant recognition in the product and technology community:

Won the 2020 Golden Kitty Award from Product Hunt for Augmented Reality Product of the Year
Named Product Hunt Product of the Day six times across different feature launches
Accepted into Y Combinator W21, one of the most competitive startup accelerator programs

Acquisition by Stability AI

On March 7, 2023, Stability AI announced the acquisition of Init ML, the company behind Clipdrop. The financial terms of the deal were not disclosed. At the time of the acquisition, Clipdrop had amassed more than 15 million users.

Init ML became a wholly owned independent subsidiary of Stability AI, with all employees retained. The three co-founders took on new roles within the Stability AI organization:

Co-founder	Previous Role	New Role at Stability AI
Cyril Diagne	CEO, Init ML	Vice President of Innovation, Clipdrop
Damien Henry	COO, Init ML	Senior Vice President of Product, Clipdrop
Jonathan Blanchet	CTO, Init ML	Vice President of Engineering, Clipdrop

The acquisition gave Clipdrop access to Stability AI's generative AI models, including Stable Diffusion XL and its successors. For Stability AI, Clipdrop provided a polished consumer-facing platform to showcase its models to a broad audience. During the Stability AI ownership period, Clipdrop launched several new features built on Stability AI technology, including Reimagine XL, SDXL-powered text-to-image generation, and SDXL Turbo real-time synthesis.

Acquisition by Jasper

On February 20, 2024, Jasper AI, an AI marketing platform, announced it had completed the acquisition of Clipdrop from Stability AI. The deal closed less than a year after Stability AI's original purchase of Init ML. Financial terms were again not disclosed.

The acquisition was part of Jasper's strategy to build a comprehensive "marketing copilot" that combined text generation with image creation and editing capabilities. Jasper CEO Timothy Young stated: "Marketing is visual. The addition of Clipdrop to Jasper will advance our vision to be the most comprehensive end-to-end marketing copilot in the industry, powering all the formats, channels, and functions enterprise marketing teams need."

Emad Mostaque, then CEO of Stability AI, expressed support for the deal, noting that Jasper had been "a valued partner, leveraging Stability AI models for many years" and that Stability AI would "continue to partner with Clipdrop on research and deliver our cutting-edge models to their platform."

Clipdrop co-founder Damien Henry took on an expanded role at Jasper, leading research and innovation across the entire Jasper product line. The Clipdrop platform continued to operate as a standalone product at clipdrop.co, while its API was integrated into the Jasper API for business customers.

Underlying Technology

Clipdrop's tools rely on several categories of AI models:

Image Segmentation

The background removal and object isolation features use image segmentation models that classify each pixel in an image as either foreground or background. The original AR Cut Paste prototype used BASNet (Boundary-Aware Segmentation Network), a predict-refine architecture that combines a densely supervised encoder-decoder network with a residual refinement module. Later versions of Clipdrop likely incorporated more advanced segmentation architectures as the field progressed.

Inpainting

The Cleanup and Text Remover tools use inpainting models that fill in masked regions of an image with content that is contextually consistent with the surrounding pixels. These models are trained on large datasets of images with artificially masked regions, learning to reconstruct the missing portions in a way that appears natural.

Super-Resolution

The Image Upscaler uses super-resolution neural networks that learn mappings between low-resolution and high-resolution image pairs. These networks predict high-frequency details (textures, edges, fine patterns) that cannot be recovered through simple interpolation methods like bicubic scaling.

Depth Estimation

The Relight tool uses monocular depth estimation models to infer the three-dimensional structure of a scene from a single two-dimensional image. This depth map enables the placement of virtual light sources that produce physically plausible shadows and highlights.

Diffusion Models

The text-to-image, Reimagine, Uncrop, and Replace Background features all rely on diffusion models, specifically variants of Stable Diffusion. These models work by learning to reverse a gradual noising process: during training, they learn to remove noise from progressively noisier versions of images, and during inference, they start from pure noise and iteratively denoise it into a coherent image guided by a conditioning signal (text prompt, image encoding, or partial image).

Current Status

As of early 2026, Clipdrop continues to operate at clipdrop.co under Jasper's ownership. The platform maintains its suite of AI image editing tools, with pricing starting at a free tier (100 images per month) and paid plans beginning at $9 per month. The Clipdrop API has been incorporated into Jasper's broader API platform, though Stability AI has continued to provide its generative models to the Clipdrop platform as part of an ongoing partnership.

The competitive landscape for AI image editing has intensified since Clipdrop's founding, with tools such as Adobe Firefly, Canva AI, Photoroom, and Remove.bg offering overlapping capabilities. The broader trend in the market has moved toward integrating AI editing tools directly into existing creative software platforms rather than offering them as standalone web applications.

References

PR Newswire. "Stability AI Acquires Init ML, Makers of Clipdrop Application." March 7, 2023.
PR Newswire. "Jasper Expands by Acquiring Image Platform Clipdrop from Stability AI." February 20, 2024.
Damien Henry. "Clipdrop." damien-henry.com.
PetaPixel. "This Guy Figured Out How to 'Cut and Paste' the Real World Into Photoshop." May 6, 2020.
Cyril Diagne. GitHub repository: ar-cutpaste. github.com/cyrildiagne/ar-cutpaste.
EU-Startups. "Paris-based ClipDrop raises 1.2 million euros for its 'cut & paste your surroundings' photo editing app." August 2021.
Nordic 9. "ClipDrop to raise a 1.2 million euro seed round led by Air Street Capital." 2021.
Y Combinator. "ClipDrop (Init ML)." ycombinator.com/companies/clipdrop.
VentureBeat. "Jasper acquires Stability AI's Clipdrop to strengthen marketing copilot." February 2024.
WinBuzzer. "Stability AI Debuts Reimagine XL for its Clipdrop Tool." May 29, 2023.
MarkTechPost. "Stability AI Launches Stable Diffusion Reimagine." March 21, 2023.
Stability AI. "Stable Diffusion Reimagine." stability.ai/news/stable-diffusion-reimagine.
Product Hunt. "Announcing the 2020 Golden Kitty Award Winners." 2020.
Clipdrop. "APIs Documentation." clipdrop.co/apis.
Clipdrop. "Pricing." clipdrop.co/pricing.
Clipdrop. "Tools." clipdrop.co/tools.
Photoroom. "Remove Background Competitors Benchmark." docs.photoroom.com.
Hackster.io. "Cyril Diagne's AR Copy Paste App Pulls Objects From Reality and Pastes Them Into Your Laptop." 2020.
Google Experiments. "Runway Palette by Cyril Diagne." experiments.withgoogle.com.
Hugging Face. "stabilityai/sdxl-turbo." huggingface.co/stabilityai/sdxl-turbo.

Origins and Founding

Y Combinator and Early Funding

Product Evolution and Feature Set

Remove Background

Cleanup

Image Upscaler

Relight

Text Remover

Uncrop

Replace Background

Text to Image

Universal Resizer

Stable Diffusion Integration

Stable Diffusion XL

Reimagine XL

SDXL Turbo

Platform Availability

API

Pricing

Awards and Recognition

Acquisition by Stability AI

Acquisition by Jasper

Underlying Technology

Image Segmentation

Inpainting

Super-Resolution

Depth Estimation

Diffusion Models

Current Status

References

Improve this article

Related Articles

Magnific AI

Magnific AI

Topaz Labs

Playground AI

Photoroom

Unstable Diffusion

Origins and Founding

Y Combinator and Early Funding

Product Evolution and Feature Set

Remove Background

Cleanup

Image Upscaler

Relight

Text Remover

Uncrop

Replace Background

Text to Image

Universal Resizer

Stable Diffusion Integration

Stable Diffusion XL

Reimagine XL

SDXL Turbo

Platform Availability

API

Pricing

Awards and Recognition

Acquisition by Stability AI

Acquisition by Jasper

Underlying Technology

Image Segmentation

Inpainting

Super-Resolution

Depth Estimation

Diffusion Models

Current Status

References

Related Articles

Magnific AI

Magnific AI

Topaz Labs

Playground AI

Photoroom

Unstable Diffusion