Today I experimented with Elevenlabs.IO. I pasted some text from my video script and choosed a voice and here is the result. What do you think? Is it natural enough?
She sounds pretty good! Are you able to make space, like more of a pause, between topics? It doesn't sound natural that she goes on to the next subject immediately. I think we would put more of a pause in there, if speaking to a group.