For training of the model, I used Daniel Hesse’s amazing pix2pix TensorFlow (TF) implementation which is really well documented. Luckily, at the workshop Gene also pointed out some existing codebase for generative models like pix2pix. This video was especially suited as the camera position was kind of static so that I could get a lot of images with the same positions of her face and background. At the end, I decided to go with Angela Merkel’s (German chancellor) New Year’s speech in 2017. I looked up several potential videos on YouTube that I could use to create the data that ranges from interviews to speeches from prominent persons.However, this is something that I can try out later to improve performance even more. On another blog article by Satya Mallick, he also recommended to skip frames but I didn’t do this as fps was decent enough now. Reducing the size of the frame by factor four improved fps a lot. I figured out that input frame was just too big.
0 Comments
Leave a Reply. |
AuthorWrite something about yourself. No need to be fancy, just an overview. ArchivesCategories |