Secrets AI Video Generator: How It Works, Quality, and Cost
Video generation from AI companion images is Secrets AI's clearest competitive differentiator. Character.AI does not have it. CrushOn AI does not have it. Janitor AI does not have it. Most platforms in this category are text and image only. Secrets AI generates actual video clips from companion images — animated, realistic, character-consistent clips that most competing platforms simply cannot produce. This page covers the mechanics, the quality reality, the Moments economics, and who this feature actually serves. Start with the full platform review if you want overall context first.
What the Video Generator Actually Is
The Secrets AI video generator is a feature that converts a static companion image into a short animated video clip based on a text prompt describing the desired motion or action. You provide the source image (a companion-generated photo) and write a prompt; the AI processes both and returns a video clip approximately two minutes later.
This is not a template animation system. The output reflects the specific character's appearance, expression, and visual context from the source image, combined with the motion described in the prompt. The results vary — simple, natural movements tend to produce the best output; complex or physics-intensive prompts are less predictable.
The feature is available starting on the Lite tier ($5.99/month). Free accounts cannot access video generation regardless of Moments balance. The feature is gated at the tier level, not just the Moments level.
How Video Generation Works: Step by Step
- Generate or select a companion image. You can use any image from your character's existing library or generate a new one (25-50 Moments). The quality of the video output correlates directly with the quality and composition of the source image — choose a well-lit, clearly composed image.
- Write a motion prompt. Describe the action or movement you want. Examples: "smiling and slowly turning toward camera," "hair blowing in wind while looking out a window," "laughing softly and looking down." Specific, natural descriptions work better than complex or acrobatic directions.
- Submit the request. The generation process typically takes approximately two minutes. The AI processes the source image and the motion prompt together. You can continue using the platform during the wait.
- Review and save. The completed clip is returned to the conversation. You can download or save it. If the result does not meet expectations, you can regenerate with a refined prompt — at the same Moments cost.
Clip lengths vary by tier: Lite produces 3-second clips. Plus and above enable longer clips. The longer the clip, the higher the Moments cost.
Quality Assessment
Video quality is rated 4.1/5 by independent reviewers at aigirlfriendscout — a strong score that reflects genuine capability while acknowledging limitations. The honest assessment from direct testing:
What works well:
- Natural facial expressions (smiling, laughing, looking away) are the strongest output type
- Character visual consistency — the video subject looks like the source image
- Smooth motion on simple actions — a character turning, adjusting hair, or shifting posture
- Realistic rendering quality that goes beyond obvious AI-generated aesthetic
Where quality varies:
- Complex body movements are less consistent
- Full-body action sequences (walking, gesturing broadly) are less reliable than face/upper-body shots
- Prompt complexity correlates inversely with reliability — simpler prompts produce more consistent results
- Quality noticeably improves on the Premium generation model tier
The 4.1/5 score situates the feature correctly: it is genuinely good and functional, not a gimmick, but it is not cinematic-quality production. For companion-context use — short clips of a character in natural situations — it delivers what most users want.
Moments Costs: The Full Economics
Video is the most Moments-intensive feature on the platform. Understanding the costs before using it prevents budget surprises.
| Video Type | Moments Cost | Generation Time |
|---|---|---|
| Short clip (3 seconds) | ~50 Moments | ~2 minutes |
| Full-length clip | Up to 600 Moments | ~2 minutes |
For comparison with other features:
| Feature | Cost | Output |
|---|---|---|
| Text message | 1-2 Moments | Single text response |
| Manual memory save | 10 Moments | Saved memory anchor |
| Image generation | 25-50 Moments | Single static image |
| Short video (3 sec) | ~50 Moments | Brief motion clip |
| Full video | ~600 Moments | Longer clip |
| Voice call | 100 Moments/minute | Real-time audio |
With 600 Moments, you get: 1 full video OR 12-24 images OR 6 minutes of voice. The video cost is significant in relative terms.
Monthly video budget by subscription tier:
| Tier | Monthly Moments | Short Clips (50M each) | Full Clips (600M each) |
|---|---|---|---|
| Lite | 1,000 | ~20 | ~1-2 |
| Plus | 3,000 | ~60 | ~5 |
| Premium | 8,000 | ~160 | ~13 |
| Ultimate | 15,000 | ~300 | ~25 |
The math is stark for heavy video users. If full-length video generation is your primary use case, Lite and Plus will constrain you quickly. Premium ($19.99/month) gives approximately 13 full videos per month. Ultimate ($39.99/month) approximately 25. Neither is unlimited — Moments are a real constraint regardless of tier.
For a comprehensive overview of Moments costs across all features, see the Moments costs page.
Video vs Images vs Voice: Which Is Worth It?
Different use patterns suggest different feature priorities:
Image generation (25-50 Moments) is the most cost-efficient visual output. For users who want visual content without spending heavily on Moments, images provide a high volume of companion content at modest cost. The quality of Secrets AI's image output is strong, though not as high-fidelity as dedicated platforms like Candy AI.
Video generation (50-600 Moments) is the distinctive feature — no comparable platform offers it. The cost-per-unit is high, but the output is unique. For users who specifically value animated companion content, there is no competitive alternative. The feature justifies Secrets AI's position for this specific use case.
Voice calls (100 Moments/minute) sit in between. Natural voice quality (4.3/5) at a significant Moments rate. Useful for immersive interaction but expensive for extended sessions.
The recommended approach for new users: start with images to build up a character library at low cost, then selectively convert the best images to video. This minimizes Moments spend while producing the highest-quality video output (since you are selecting the best source images).
Who Should Use the Video Generator?
Use it if:
- Visual companion content is a priority alongside text chat
- You want unique media that no competing platform can produce
- You are on Plus tier or above with enough Moments to sustain regular video use
- Short animated clips of realistic companions have specific appeal for you
Skip it (or use sparingly) if:
- You are primarily a text conversation user
- You are on a tight Moments budget where every 600-Moment spend is significant
- You need high volume — the Moments math makes regular full-video generation expensive at every tier below Ultimate
Best tiers for video:
- Casual video use: Plus ($9.99/month) — 5 full videos per month is enough for occasional use
- Regular video use: Premium ($19.99/month) — 13 full videos per month covers most users
- Heavy video creation: Ultimate ($39.99/month) — 25 full videos per month maximum
The video access by tier breakdown has the full Moments comparison across all tiers.
Platforms That Also Offer Video Generation
The video generation landscape in AI companion platforms is sparse:
| Platform | Video Generation | Notes |
|---|---|---|
| Secrets AI | Yes | From companion images; 50-600 Moments |
| Candy AI | Limited | Less developed than Secrets AI |
| Character.AI | No | No video capability |
| CrushOn AI | No | Image generation only |
| Janitor AI | No | Text-focused, no media generation |
| SweetDream AI | Limited | Competitor with partial video capability |
| Xotic AI | Yes | 4K, 15-second clips — different product segment |
Secrets AI's video feature is genuinely distinctive in the mainstream AI companion market. The closest competitor with comparable video capability (Xotic AI) targets a different price point and use case. For the specific niche of AI girlfriend platforms with video generation, Secrets AI currently occupies the space with minimal direct competition. Try the video generator →
FAQ
Video length depends on the subscription tier and the generation settings used. Lite tier produces 3-second clips. Plus and above unlock longer clip generation. Maximum clip length is not officially published with a specific second count — the Moments cost for "full clips" runs up to 600 Moments per clip. The generation time is consistent at approximately two minutes regardless of clip length.
No. Video generation is tier-gated, not just Moments-gated. Free accounts cannot access video generation even if they have unspent Moments from the 200 starting credit. Lite tier ($5.99/month) is the minimum subscription required to generate any video clips. Once on Lite, you can generate 3-second clips at approximately 50 Moments each.
It depends entirely on your subscription tier and Moments allocation. On Plus (3,000 Moments), you can generate approximately 5 full-length clips per month if you spend all Moments on video. On Premium (8,000 Moments), approximately 13. On Ultimate (15,000 Moments), approximately 25. These numbers assume all Moments go to full-length video; if you also generate images or make voice calls, the video budget decreases proportionally. Additional Moments can be purchased starting at $5.99 for 1,980 Moments.
Yes, at a 4.1/5 quality level. Videos show realistic character movement with consistent facial expressions and natural motion on simple actions. The character in the video looks like the source companion image. Complex motion prompts are less reliable. The best results come from natural, face-focused actions (expressions, subtle movements, looking at camera) rather than full-body or high-energy motion. Premium generation models produce higher quality output than the standard generation model.