Text-to-video model

Text-to-Video is a state of art technology which needs only text as input for outcome as video.The inspiration came from Text-to-image model which delivers images as output for text as input by CogVideo ^[1] .

Video prediction on making objects realistic in stable background by using Recurrent neural network for sequence to sequence model with connector Convolutional neural network encoding/decoding each frame pixel by pixel ^[2] , creating video using Deep learning^[3] .

Methodology

Data collection and data set preparation using clear video from kinetic human action video.

Training the Convolutional neural network for making video. Keywords extraction from text using Natural-language programming .

Testing of Data set in conditional generative model for existing static and dynamic information from text by Variational autoencoder and Generative adversarial network

Models

Different models are there Open source Artificial intelligence is CogVideo presented their code in GitHub ^[4] . Meta Platforms uses text2video with makeavideo.studio ^[5] ,^[6] . ^[7]Google used Imagen Video for converting text 2video ^[8] ^[9] ,^[10] ,^[11] ,^[12]

Antonia Antonova presented another model^[13]

References

^ CogVideo, THUDM, 2022-10-12, retrieved 2022-10-12
^ "Leading India" (PDF).
^ Narain, Rohit (2021-12-29). "Smart Video Generation from Text Using Deep Neural Networks". Retrieved 2022-10-12.
^ CogVideo, THUDM, 2022-10-12, retrieved 2022-10-12
^ Davies, Teli (2022-09-29). "Make-A-Video: Meta AI's New Model For Text-To-Video Generation". W&B. Retrieved 2022-10-12.
^ Monge, Jim Clyde (2022-08-03). "This AI Can Create Video From Text Prompt". Medium. Retrieved 2022-10-12.
^ "Meta's Make-A-Video AI creates videos from text". www.fonearena.com. Retrieved 2022-10-12.
^ "google: Google takes on Meta, introduces own video-generating AI - The Economic Times". m.economictimes.com. Retrieved 2022-10-12.
^ Monge, Jim Clyde (2022-08-03). "This AI Can Create Video From Text Prompt". Medium. Retrieved 2022-10-12.
^ "Nuh-uh, Meta, we can do text-to-video AI, too, says Google". www.theregister.com. Retrieved 2022-10-12.
^ "Papers with Code - See, Plan, Predict: Language-guided Cognitive Planning with Video Prediction". paperswithcode.com. Retrieved 2022-10-12.
^ "Papers with Code - Text-driven Video Prediction". paperswithcode.com. Retrieved 2022-10-12.
^ "Text to Video Generation". Antonia Antonova. Retrieved 2022-10-12.

[1] CogVideo, THUDM, 2022-10-12, retrieved 2022-10-12

[2] "Leading India" (PDF).

[3] Narain, Rohit (2021-12-29). "Smart Video Generation from Text Using Deep Neural Networks". Retrieved 2022-10-12.

[4] CogVideo, THUDM, 2022-10-12, retrieved 2022-10-12

[5] Davies, Teli (2022-09-29). "Make-A-Video: Meta AI's New Model For Text-To-Video Generation". W&B. Retrieved 2022-10-12.

[6] Monge, Jim Clyde (2022-08-03). "This AI Can Create Video From Text Prompt". Medium. Retrieved 2022-10-12.

[7] "Meta's Make-A-Video AI creates videos from text". www.fonearena.com. Retrieved 2022-10-12.

[8] "google: Google takes on Meta, introduces own video-generating AI - The Economic Times". m.economictimes.com. Retrieved 2022-10-12.

[9] Monge, Jim Clyde (2022-08-03). "This AI Can Create Video From Text Prompt". Medium. Retrieved 2022-10-12.

[10] "Nuh-uh, Meta, we can do text-to-video AI, too, says Google". www.theregister.com. Retrieved 2022-10-12.

[11] "Papers with Code - See, Plan, Predict: Language-guided Cognitive Planning with Video Prediction". paperswithcode.com. Retrieved 2022-10-12.

[12] "Papers with Code - Text-driven Video Prediction". paperswithcode.com. Retrieved 2022-10-12.

[13] "Text to Video Generation". Antonia Antonova. Retrieved 2022-10-12.

[1]

[2]

[3]

[4]

[5]

[6]

[7]

[8]

[9]

[10]

[11]

[12]

[13]