It's okay! English isn't my first language either, but I understand the idea. However, I was thinking of joining both speech bubbles just like this instead:
I personally prefer splitting the information into different bubbles because I think it makes the reading more... dynamic? I read a lot about the important of their sizes, content and information on posts by etheringtonbrothers and similar artists.
Still, what do you think? Would it be less confusing and better for the reader if all the information was inside one single bubble speech?