Our work on multi-modal retrieval is given an Oral talk at ACL 2021.