Method Overview
DIPSY generates synthetic training data for few-shot classification using a novel dual IP-Adapter approach. The method leverages IP-Adapter and Stable Diffusion to create highly discriminative images without requiring model fine-tuning or external tools.

Key Innovations
Extended Classifier-Free Guidance: We extend CFG to independently control text, positive image, and negative image conditioning. This provides fine-grained control over the generation process, allowing us to simultaneously enhance class-specific features while suppressing features from related classes.
Class Similarity-Based Sampling: Our strategy selects effective negative prompts from related classes, enhancing the discriminative power of generated images. By identifying semantically similar classes, we create stronger contrastive examples that improve classifier training.
Training-Free Pipeline: DIPSY requires no model fine-tuning, external captioning, or filtering, making it practical for real-world applications. The entire pipeline operates using pre-trained models without any additional training steps.