Final keyframe was a woman and I image2imaged backwards to the man using the BigAsp 2.0 checkpoint. I was very pleased with how well this worked overall. Still, I wouldn't consider these optimal keyframes and I was very impressed with how well Wan2.1 handled them.
I used the model on Fal.ai via the API because you can't disable the safety checker otherwise. It might be possible to improve on the results using a comfyui workflow, but I need to figure out the best hyperparameters. I'll be testing that shortly!
I joined two clips, but the prompt for the first clip was "一段逼真的手机视频,一位男士低头看着手中的文件,他的胸部越来越丰满,也越来越女性化。他留着一头黑色的短发,逐渐变长。他坐在床上,身后是一堵白墙。镜头慢慢拉远,展现出他日益女性化的身影。特写,低头"
I used the standard negative in Chinese: "色调艳丽,过曝,静态,细节模糊不清,字幕,风格,作品,画作,画面,静止,整体发灰,最差质量,低质量,JPEG压缩残留,丑陋的,残缺的,多余的手指,画得不好的手部,画得不好的脸部,畸形的,毁容的,形态畸形的肢体,手指融合,静止不动的画面,杂乱的背景,三条腿,背景人很多,倒着走"
I read somewhere that the "first-to-last-frame" model was trained on Chinese, and it did appear to work better than with English (based on some quick comparisons I did). Of course, I'm still learning how to prompt this model so I may be prompting the wrong way entirely.
W0RR13Z
2025-04-28 00:18:45 +0000 UTCBlankage
2025-04-27 23:51:25 +0000 UTCW0RR13Z
2025-04-27 22:43:34 +0000 UTC