Supported Models for Image Analysis
Currently, AICamp supports image analysis through:- OpenAI’s GPT-4o
- OpenAI’s GPT-4o mini
- Anthropic’s Claude 3 Opus
- Anthropic’s Claude 3.5 Sonnet
- Anthropic’s Claude 3 Haiku
- Anthropic’s Claude 3 Sonnet
- Google’s Gemini 1.5 Pro
- Google’s Gemini 1.5 Flash
Model availability depends on your organization’s settings and access permissions.
What You Can Do
- Get detailed descriptions of images
Example: Upload a product design and get a description of its layout and color scheme. - Analyze charts, graphs, or screenshots
Example: Upload a sales dashboard screenshot and ask for a performance summary. - Extract information from documents or forms
Example: Upload a scanned receipt and extract the total amount. - Ask questions about visual content
Example: “What does this flowchart show?” or “Summarize the data shown in this graph.”
Structuring Your Input for Better Results
Here’s how to frame your prompts after uploading an image:Instead of | Ask Like This |
---|---|
What is this? | Describe the main elements shown in this product design image. |
Analyze this chart. | Summarize the sales trend over the last six months shown in this bar chart. |
Tell me about this picture. | Identify the key objects and setting shown in this outdoor photo. |
- Be clear about what you want analyzed (overall summary, specific details, etc.)
- Mention if you want structured outputs like bullet points.
Best Practices
- Choose models that support image input.
- Upload clear, high-quality images for best results.
- Ask specific questions about the image if you need targeted insights.
- Refine your prompts to dive deeper if needed.