The captioner is being prompted by JTP Pilot2 tagger. You may use hand-curated tags to get better results. This demo is running on CPU. For faster inference without waiting in queue, you may duplicate the space and upgrade to GPU in settings.