Image Captioning with nlbconnect/vit-gpt2-image-captioning

Carl's NetSuite Notes / 2023-05-05 / 原文

  • https://huggingface.co/nlpconnect/vit-gpt2-image-captioning
  • The Illustrated Image Captioning using transformers
    • Image captioning is the process of generating caption i.e. description from input image. It requires both Natural language processing as well as computer vision to generate the caption.
  • facebook/detr-resnet-50, Sample:
    • Object Detection App with DETR and YOLOS

实例

1.  打开:https://huggingface.co/spaces/SRDdev/Image-Caption

2. 上传图片,拖放或者点击Upload any Image

3. 点击 Submit,稍等片刻,右侧的Captions就会出现图片的介绍