Is it possible to use and train dalle with an external ( frozen) text encoder ( as those available in hugging face) ?