Publications Repository

Details/Description:

Pastra, K. , & Wilks, Y. . (2004). Vision-Language Integration in AI: a reality check. In Proceedings of the 16th European Conference in Artificial Intelligence (Vol. 16, pp. 937–941).

Link:

http://www.csri.gr/files/publications/ECAI04-pastra.pdf

Details/Description:

Pastra, K. . (2004). Viewing Vision-Language Integration as a Double-Grounding Case. In Proceedings of the AAAI Fall Symposium on "Achieving Human-Level Intelligence through Integrated Systems and Research" (pp. 62–67).

Link:

http://www.csri.gr/files/publications/AAAI04-pastra.pdf

Details/Description:

Pastra, K. . (2006). Image-Language Association: are we looking at the right features?. In OntoImage Workshop on Language Resources for Content-based Image Retrieval, Language Resources and Evaluation Conference (LREC).

Link:

http://www.csri.gr/files/publications/LREC06wsh-pastra.pdf

Details/Description:

Pastra K.., E. Balta, P. Dimitrakis, G. Karakatsiotis (2011), Embodied Language Processing: A New Generation of Language Technology, in AAAI workshop on 'Language-Action Tools for Cognitive Artificial Agents'.

Link:

http://www.csri.gr/files/publications/EmboLa-pastra.pdf

Details/Description:

Pastra K. and Y. Aloimonos (2012), The minimalist grammar of action, Philosophical Transactions of the Royal Society of London B: Biological Sciences, vol. 367, pp. 103-117.

Link:

http://www.csri.gr/files/publications/philoTrans-pastra.pdf

Details/Description:

Vatakis A. and K. Pastra (2016), A multimodal dataset of spontaneous speech and movement production on object affordances, Data Science Journal, Nature Publishing.

Link:

http://www.nature.com/articles/sdata201578

Details/Description:

Desmond Elliott, Stella Frank, Khalil Sima'an, Lucia Specia. Multi30K: Multilingual English-German Image Descriptions. 5th Workshop on Vision and Language, pages 70–74, Berlin, Germany, 2016.

Link:

http://www.aclweb.org/anthology/W16-3210.pdf

Details/Description:

Specia, L., Frank, S., Sima'an, K., Elliott, D. A Shared Task on Multimodal Machine Translation and Crosslingual Image Description. In First Conference on Machine Translation, Volume 2: Shared Task Papers, pp. 540–550, WMT, Berlin, Germany, 2016.

Link:

http://www.statmt.org/wmt16/pdf/W16-2346.pdf

Details/Description:

Elliott, D., Frank, S., Barrault, L. Bougares, F., Specia, L. Findings of the Second Shared Task on Multimodal Machine Translation and Multilingual Image Description. In 2nd Conference on Machine Translation, WMT, pp .215-233, Copenhagen, Denmark, 2017.

Link:

http://www.statmt.org/wmt17/pdf/WMT18.pdf

Details/Description:

Madhyastha, P.S., Wang, J., Specia, L. Sheffield MultiMT: Using Object Posterior Predictions for Multimodal Machine Translation. In 2nd Conference on Machine Translation, WMT, pp. 470-476, Copenhagen, Denmark, 2017.

Link:

https://www.google.co.uk/url?sa=t&rct=j&q=&esrc=s&source=web&cd=1&cad=rja&uact=8&ved=0ahUKEwiilYrcwLXaAhUEJFAKHaSmAqMQFggnMAA&url=http%3A%2F%2Fwww.aclweb.org%2Fanthology%2FW%2FW17%2FW17-4752.pdf&usg=AOvVaw2xApDToVCFgOmcKhLF83Rb

Details/Description:

Lala, C., Madhyastha, P., Wang, J., Specia, L. Unraveling the Contribution of Image Captioning and Neural Machine Translation for Multimodal Machine Translation, Prague Bull. Math. Linguistics, vol 108, pp. 197-208, 2017.

Link:

https://www.google.co.uk/url?sa=t&rct=j&q=&esrc=s&source=web&cd=1&cad=rja&uact=8&ved=0ahUKEwjnuM6iwLXaAhUJaFAKHe1CDc4QFggqMAA&url=https%3A%2F%2Fufal.mff.cuni.cz%2Fpbml%2F108%2Fart-lala-madhyastha-wang-specia.pdf&usg=AOvVaw1SXou9RKg_q2U6IXGSUXFq

Details/Description:

Madhyastha, P. Wang, J., Specia L. (2018). The role of image representations in vision to language tasks. Natural Language Engineering, Cambridge University Press, pp. 1-25.

Link:

https://doi.org/10.1017/S1351324918000116

Details/Description:

K. Laenen, S. Zoghbi, and M.-F. Moens. 2018. Web Search of Fashion Items with Multimodal Querying. In Proceedings of WSDM 2018: The Eleventh ACM International Conference on Web Search and Data Mining.

Link:

https://dl.acm.org/citation.cfm?id=3159716

Details/Description:

Laenen, K., Zoghbi, S., & Moens, M-F. (2017). Cross-modal search for fashion attributes. In Proceedings of the KDD Workshop on Machine Learning Meets Fashion.

Link:

https://kddfashion2017.mybluemix.net/final_submissions/ML4Fashion_paper_7.pdf

Details/Description:

Learning Representations Specialized in Spatial Knowledge: Leveraging Language and Vision

Link:

https://transacl.org/ojs/index.php/tacl/article/view/1214/288

Details/Description:

Adapting a decision Tree based Tagger for Arabic. Zeroual, I., & Lakhouaja, A. The 2nd International Conference on Information Technology for Organizations Development, March 30 - April 1st, 2016, Fez, Morocco.

Link:

http://ieeexplore.ieee.org/document/7479306/

Details/Description:

Application of Arabic language processing in language learning. El Kah, A., Zeroual, I., & Lakhouaja, A. The 2nd International Conference on Big Data, Cloud and Applications, March 01-03, 2017, Tetuan, Morocco.

Link:

https://dl.acm.org/citation.cfm?doid=3090354.3090390

Details/Description:

Developing and performance evaluation of a new Arabic heavy/light stemmer. Zeroual, I., Boudchiche, M., Mazroui, A., & Lakhouaja, A. The 2nd International Conference on Big Data, Cloud and Applications, March 29-30, 2017, Tetuan, Morocco.

Link:

https://dl.acm.org/citation.cfm?doid=3090354.3090371

Details/Description:

Arabic Information Retrieval: Stemming or Lemmatization?. Zeroual, I., & Lakhouaja, A. The 2nd International Conference on Intelligent Systems and Computer Vision, April 17-18-19, 2017, Fez, Morocco.

Link:

http://ieeexplore.ieee.org/document/8054932/?reload=true

Details/Description:

Towards a standard part of speech tagset for the Arabic language. Zeroual, I., Lakhouaja, A., and Belahbib R. Journal of King Saud University – Computer and Information Sciences, 2017

Link:

http://www.sciencedirect.com/science/article/pii/S1319157817300265

Details/Description:

Gamification for Arabic Natural Language Processing: Ideas into Practice. Zeroual, I, El Kah A. and Lakhouaja A. Transactions on Machine Learning and Artificial Intelligence 5.4 (2017)

Link:

http://scholarpublishing.org/index.php/TMLAI/article/view/3323/

Details/Description:

Feature-rich PoS Tagging through Taggers Combination: Experience in Arabic. Zeroual, I, and Lakhouaja A. Transactions on Machine Learning and Artificial Intelligence 5.4 (2017)

Link:

http://scholarpublishing.org/index.php/TMLAI/article/view/2981

Details/Description:

Arabic Corpus Linguistics: Major Progress, but Still a Long Way to Go. Zeroual I., Lakhouaja A. In: Shaalan K., Hassanien A., Tolba F. (eds) Intelligent Natural Language Processing: Trends and Applications. Studies in Computational Intelligence, vol 740. Springer, Cham. 2018

Link:

https://link.springer.com/chapter/10.1007/978-3-319-67056-0_29

Details/Description:

Hybrid Focused Crawling on the Surface and the Dark Web. C. Iliou, G. Kalpakis, T. Tsikrika, S. Vrochidis, I. Kompatsiaris. EURASIP Journal on Information Security, vol. 2017, no. 11, 2017.

Link:

https://link.springer.com/article/10.1186/s13635-017-0064-5

Details/Description:

Multimedia retrieval based on non-linear graph-based fusion and partial least squares regression. I. Gialampoukidis, A. Moumtzidou, D. Liparas, T. Tsikrika, S. Vrochidis, I. Kompatsiaris (2017). Multimedia Tools and Applications, 2017.

Link:

https://link.springer.com/article/10.1007/s11042-017-4797-4

Details/Description:

Gaze Movement-driven Random Forests for Query Clustering in Automatic Video Annotation. S. Vrochidis, I. Patras and I. Kompatsiaris. Multimedia Tools and Applications, 2016.

Link:

https://link.springer.com/article/10.1007/s11042-015-3221-1

Details/Description:

Interactive Video Search Tools: A Detailed Analysis of the Video Browser Showdown 2015. C. Cobarzan, K. Schoeffmann, W. Bailer, W. Hurst, A. Blazek, J. Lokoc, S. Vrochidis, K. U. Barthel, and L. Rossetto. In Multimedia Tools and Applications (MTAP), 2016, pp. 1-33.

Link:

https://link.springer.com/article/10.1007/s11042-016-3661-2

Details/Description:

Focussed Crawling of Environmental Web Resources Based on the Combination of Multimedia Evidence. T. Tsikrika, A. Moumtzidou, S. Vrochidis and I. Kompatsiaris. Multimedia Tools and Applications, May 2015, pp 1-25.

Link:

https://link.springer.com/article/10.1007/s11042-015-2624-3

Details/Description:

Environmental data extraction from heatmaps using the AirMerge system Multimedia Tools and Applications. V. Epitropou, T. Bassoukos, K. Karatzas, A. Karppinen, L. Wanner, S. Vrochidis, I. Kompatsiaris, J. Kukkonen. May 2015. pp. 1-25.

Link:

https://link.springer.com/article/10.1007/s11042-015-2604-7

Details/Description:

Ontology-centered environmental information delivery for personalized decision support. L. Wanner, M. Rospocher, S. Vrochidis, L. Johansson, N. Bouayad-Aghae, G. Casamayor, A. Karppinen, I. Kompatsiaris, S. Millee, A. Moumtzidou, L. Serafini. Expert Systems With Applications, Volume 42, Issue 12, 15 July 2015, Pages 5032–5046.

Link:

https://www.sciencedirect.com/science/article/pii/S0957417415001554

Details/Description:

Fusion of meteorological and air quality data extracted from the web for personalized environmental information services. L. Johansson, V. Epitropou, K. Karatzas, A. Karppinen, L. Wanner, S. Vrochidis, A. Bassoukos, J. Kukkonen, I. Kompatsiaris. Environmental Modeling and Software Journal, 2015, Volume 64, February 2015, pp. 143–155.

Link:

https://www.sciencedirect.com/science/article/pii/S1364815214003478

Details/Description:

A Model for Environmental Data Extraction from Multimedia and its Evaluation against various Chemical Weather Forecasting Datasets. A. Moumtzidou, V. Epitropou, S. Vrochidis, K. Karatzas, S. Voth, A. Bassoukos, J. Moßgraber, A. Karppinen, J. Kukkonen and I. Kompatsiaris. Journal of Ecological Informatics, pp. 69-82, 2014, special issue, ISSN 1574-9541.

Link:

https://www.sciencedirect.com/science/article/pii/S1574954113000745

Details/Description:

OSINT and the Dark Web. G. Kalpakis, T. Tsikrika, N. Cunningham, C. Iliou, S. Vrochidis, J. Middleton, I. Kompatsiaris. In “Open Source Intelligence Investigation – From Strategy to Implementation”, B. Akhgar, P. S. Bayerl, F. Sampson (Eds.), Springer, 2016.

Link:

https://link.springer.com/chapter/10.1007%2F978-3-319-47671-1_8

Details/Description:

Enhancing Patent Search with Content-based Image Retrieval. S. Vrochidis, A. Moumtzidou, I. Kompatsiaris. Professional Search in the Modern World, Lecture Notes in Computer Science Volume 8830, 2014, pp 250-273.

Link:

https://link.springer.com/content/pdf/10.1007%2F978-3-319-12511-4_12.pdf

Details/Description:

Description Logics and Rules for Multimodal Situational Awareness in Healthcare. G. Meditskos, S. Vrochidis, I. Kompatsiaris. Special session on Multimedia and Multimodal Interaction for Health and Basic Care Applications at MMM 2017, 2016.

Link:

https://link.springer.com/chapter/10.1007/978-3-319-51811-4_58

Details/Description:

VERGE IN VBS 2017. A. Moumtzidou, T. Mironidis, F. Markatopoulou, S. Andreadis, I. Gialampoukidis, D. Galanopoulos, A. Ioannidou, S. Vrochidis, V. Mezaris, I. Kompatsiaris, I. Patras. Proc. Video Browser Showdown (VBS’17) at the 23rd Int. Conf. on MultiMedia Modeling (MMM’17), Reykjavik, Iceland, 4 January 2017.

Link:

https://link.springer.com/chapter/10.1007/978-3-319-51814-5_46

Details/Description:

Ontology-Driven Context Interpretation and Conflict Resolution in Dialogue-Based Home Care Assistance. G. Meditskos, G., Kontopoulos, E., Vrochidis, S., & Kompatsiaris, I. (2016). In: Paschke, A., Burger, A., Splendiani, A., Marshall, M.S., and Romano, P. (eds.) 9th Int. Conf. on Semantic Web Applications and Tools for Life Sciences – SWAT4LS. CEUR Workshop Proceedings Vol 1795, Amsterdam, The Netherlands (2016).

Link:

http://ceur-ws.org/Vol-1795/paper1.pdf

Details/Description:

ITI-CERTH participation in TRECVID 2016. F. Markatopoulou, A. Moumtzidou, D. Galanopoulos, T. Mironidis, V. Kaltsa, A. Ioannidou, S. Symeonidis, K. Avgerinakis, S. Andreadis, I. Gialampoukidis, S. Vrochidis, A. Briassouli, V. Mezaris, I. Kompatsiaris, I. Patras. Proc. TRECVID 2016 Workshop, Gaithersburg, MD, USA, Nov. 2016.

Link:

http://www-nlpir.nist.gov/projects/tvpubs/tv16.papers/iti-certh.pdf

Details/Description:

Incremental estimation of visual vocabulary size for image retrieval. I. Gialampoukidis, S. Vrochidis, I. Kompatsiaris. In INNS Conference on Big Data, pp. 29-38. Springer International Publishing, 2016.

Link:

https://link.springer.com/content/pdf/10.1007/978-3-319-47898-2_4.pdf

Details/Description:

Community Detection in Complex Networks Based on DBSCAN* and a Martingale Process. I. Gialampoukidis, T. Tsikrika, S. Vrochidis, I. Kompatsiaris. Semantic and Social Media Adaptation and Personalization (SMAP), 2016 11th International Workshop on, pp. 1-6. IEEE, 2016.

Link:

http://ieeexplore.ieee.org/document/7753375/

Details/Description:

Towards Air Quality Estimation Using Collected Multimodal Environmental Data. A. Moumtzidou, S. Papadopoulos, S. Vrochidis, I. Kompatsiaris, K. Kourtidis, G. Hloupis, I. Stavrakas, K. Papachristopoulou, and C. Keratidis. 1st International Workshop on Internet and Social media for Environmental Monitoring (In conjunction with the 3rd international conference on Internet Science (INSCI 2016)), Florence, Italy, 12 September 2016.

Link:

https://link.springer.com/chapter/10.1007/978-3-319-50237-3_7

Details/Description:

Semantic integration of web data for international investment decision support. B. Simeonov, V. Alexiev, D. Liparas, M. Puigbo, S. Vrochidis, E. Jamin and I. Kompatsiaris. 3rd international conference on Internet Science, Florence, Italy, Sept. 12-14 2016.

Link:

https://link.springer.com/chapter/10.1007/978-3-319-45982-0_18

Details/Description:

Question Answering over Pattern-Based User Models. G. Meditskos, S. Dasiopoulou, S. Vrochidis, L. Wanner, I. Kompatsiaris. In Proceedings of the 12th International Conference on Semantic Systems (SEMANTiCS 2016), pp. 153-160. ACM, New York, NY, USA, 2016.

Link:

http://delivery.acm.org/10.1145/3000000/2993331/p153-meditskos.pdf?ip=160.40.51.77&id=2993331&acc=ACTIVE%20SERVICE&key=5641A0C343C36AC1%2E80105867122BFAB8%2E4D4702B0C3E38B35%2E4D4702B0C3E38B35&__acm__=1518604495_6c65ec83b2793aee902157535b0d4993

Details/Description:

Hybrid Focused Crawling for Homemade Explosives Discovery on Surface and Dark Web. C. Iliou, G. Kalpakis, T. Tsikrika, S. Vrochidis, I. Kompatsiaris. 11th International Conference on Availability, Reliability and Security (ARES 2016), Salzburg, Austria, Aug 2016.

Link:

http://ieeexplore.ieee.org/document/7784575/

Details/Description:

Query-based Topic Detection Using Concepts and Named Entities. I. Gialampoukidis, D. Liparas, S. Vrochidis, I. Kompatsiaris. 1st International Workshop on Multimodal Media Data Analytics (MMDA 2016), The Hague, Netherlands, August 30, 2016.

Link:

https://pdfs.semanticscholar.org/b811/747fa6f82e2f878e950fd16cda15e0358af6.pdf

Details/Description:

Key player identification in terrorism-related social media networks using centrality measures. I. Gialampoukidis, G. Kalpakis, T. Tsikrika, S. Vrochidis, I. Kompatsiaris. In Intelligence and Security Informatics Conference (EISIC), 2016 European, pp. 112-115. IEEE, 2016.

Link:

http://ieeexplore.ieee.org/document/7870202/

Details/Description:

Interactive Discovery and Retrieval of Web Resources Containing Home Made Explosive Recipes. G. Kalpakis, T. Tsikrika, C. Iliou, T. Mironidis, S. Vrochidis, J. Middleton, U. Williamson, I. Kompatsiaris. 4th International Conference on Human Aspects of Information Security, Privacy and Trust, Toronto, Canada, 17 – 22 July 2016.

Link:

https://link.springer.com/chapter/10.1007/978-3-319-39381-0_20

Details/Description:

A Hybrid framework for news clustering based on the DBSCAN-Martingale and LDA. I. Gialampoukidis, S. Vrochidis, I. Kompatsiaris. Machine Learning and Data Mining in Pattern Recognition, pp. 170-184, Springer International Publishing, 2016.

Link:

https://link.springer.com/chapter/10.1007/978-3-319-41920-6_13

Details/Description:

A hybrid graph-based and non-linear late fusion approach for multimedia retrieval. I. Gialampoukidis, A. Moumtzidou, D. Liparas, S. Vrochidis, I. Kompatsiaris. In Content-Based Multimedia Indexing (CBMI), 2016 14th International Workshop on, pp. 1-6, IEEE, 2016.

Link:

http://ieeexplore.ieee.org/document/7500252/

Details/Description:

A Multimedia Interactive Search Engine based on Graph-based and Non-linear Multimodal Fusion. A. Moumtzidou, I. Gialampoukidis, T. Mironidis, D. Liparas, S. Vrochidis, I. Kompatsiaris. In Content-Based Multimedia Indexing (CBMI), 2016 14th International Workshop on, IEEE, 2016.

Link:

http://ieeexplore.ieee.org/stamp/stamp.jsp?arnumber=7500276

Details/Description:

Retrieval of Multimedia objects by Fusing Multiple Modalities. I. Gialampoukidis, A. Moumtzidou, T. Tsikrika, S. Vrochidis and I. Kompatsiaris. In Proceedings of the 2016 ACM on International Conference on Multimedia Retrieval, pp. 359-362, ACM.

Link:

http://delivery.acm.org/10.1145/2920000/2912068/p359-gialampoukidis.pdf?ip=160.40.51.77&id=2912068&acc=ACTIVE%20SERVICE&key=5641A0C343C36AC1%2E80105867122BFAB8%2E4D4702B0C3E38B35%2E4D4702B0C3E38B35&__acm__=1518604970_6cb311bcb9d9a5d08ddf43ee04a2a57e

Details/Description:

Towards a Multimedia Knowledge-Based Agent with Social Competence and Human Interaction Capabilities. L. Wanner, J. Blat, S. Dasiopoulou, M. Domínguez, G. Llorach, S. Mille, F. Sukno, E. Kamateri, S. Vrochidis, I. Kompatsiaris, E. André, F. Lingenfelser, G. Mehlmann, A. Stam, L. Stellingwerff, B. Vieru, L. Lamel, W. Minker, L. Pragst, Stefan Ultes. 1st International Workshop on Multimedia Analysis and Retrieval for Multimodal Interaction (MARMI 2016), New York, USA, June 6, 2016.

Link:

http://delivery.acm.org/10.1145/2930000/2927011/p21-wanner.pdf?ip=160.40.51.77&id=2927011&acc=ACTIVE%20SERVICE&key=5641A0C343C36AC1%2E80105867122BFAB8%2E4D4702B0C3E38B35%2E4D4702B0C3E38B35&__acm__=1518604951_820019f8cc4415f8b5dc503f209d26ff

Details/Description:

Towards an Ontology-driven Adaptive Dialogue Framework. G. Meditskos, S. Dasiopoulou, Louisa Pragst, S. Ultes, S. Vrochidis, I. Kompatsiaris, L. Wanner. 1st International Workshop on Multimedia Analysis and Retrieval for Multimodal Interaction (MARMI 2016), New York, USA, June 6, 2016.

Link:

http://delivery.acm.org/10.1145/2930000/2927009/p15-meditskos.pdf?ip=160.40.51.77&id=2927009&acc=ACTIVE%20SERVICE&key=5641A0C343C36AC1%2E80105867122BFAB8%2E4D4702B0C3E38B35%2E4D4702B0C3E38B35&__acm__=1518604928_3f5d54ec904826f62f099b05a1e5f053

Details/Description:

A Multimodal Annotation Schema for Non-Verbal Affective Analysis in the Health-Care Domain. Sukno, M. Dominguez, A. Ruiz Ovejero, D. Schiller, F. Lingenfelser, L. Pragst, E. Kamateri, S. Vrochidis. 1st International Workshop on Multimedia Analysis and Retrieval for Multimodal Interaction (MARMI 2016), New York, USA, June 6, 2016.

Link:

http://delivery.acm.org/10.1145/2930000/2927008/p9-sukno.pdf?ip=160.40.51.77&id=2927008&acc=ACTIVE%20SERVICE&key=5641A0C343C36AC1%2E80105867122BFAB8%2E4D4702B0C3E38B35%2E4D4702B0C3E38B35&__acm__=1518604901_b536cc4c62d7ad88518f610413da0c89

Details/Description:

VERGE: A Multimodal Interactive Search Engine for Video Browsing and Retrieval. A. Moumtzidou, T. Mironidis, E. Apostolidis, F. Markatopoulou, A. Ioannidou, I. Gialampoukidis, K. Avgerinakis, S. Vrochidis, V. Mezaris, I. Kompatsiaris, I. Patras. Proc. Video Browser Showdown (VBS’16) at the 22nd Int. Conf. on MultiMedia Modeling (MMM’16), Miami, USA, 4 January 2016.

Link:

https://link.springer.com/chapter/10.1007/978-3-319-27674-8_39

Details/Description:

Fast Visual Vocabulary Construction for Image Retrieval using Skewed-Split k-d trees. I. Gialampoukidis, S. Vrochidis, I. Kompatsiaris. In International Conference on Multimedia Modeling, pp. 466-477. Springer International Publishing, 2016.

Link:

https://link.springer.com/chapter/10.1007/978-3-319-27671-7_39

Details/Description:

ITI-CERTH participation to TRECVID 2015. F. Markatopoulou, A. Ioannidou, C. Tzelepis, T. Mironidis, D. Galanopoulos, S. Arestis-Chartampilas, N. Pittaras, K. Avgerinakis, N. Gkalelis, A. Moumtzidou, S. Vrochidis, V. Mezaris, I. Kompatsiaris, I. Patras. Proc. TRECVID 2015 Workshop, Gaithersburg, MD, USA, Nov. 2015.

Link:

http://www-nlpir.nist.gov/projects/tvpubs/tv15.papers/iti-certh.pdf

Details/Description:

Exploiting visual similarities for ontology alignment. C. Doulaverakis, S. Vrochidis, I. Kompatsiaris. 7th International Conference on Knowledge Engineering and Ontology Development (KEOD 2015), Lisbon, Portugal, 12-14 November, 2015.

Link:

https://www.multisensorproject.eu/wp-content/uploads/2016/11/Doulaverakis_KEOD2015_camera_ready.pdf

Details/Description:

Classification using various ML Methods and Combinations of Key-Phrases and Visual Features. Y. Hacohen-Kerner, A. Sabag, D. Liparas, A. Moumtzidou, S. Vrochidis and I. Kompatsiaris. 1st KEYSTONE Conference (IKC2015), Coimbra, Portugal, September 8-9, 2015.

Link:

https://www.multisensorproject.eu/wp-content/uploads/2016/11/Camera_ready_IKC2015_Hacohen-Kerner_etal.pdf

Details/Description:

A Framework for the Discovery, Analysis, and Retrieval of Multimedia Homemade Explosives Information on the Web. T. Tsikrika, G. Kalpakis, S. Vrochidis, I. Kompatsiaris, I. Paraskakis, I. Kavasidis, J. Middleton, and U. Williamson. In Proceedings of the International Workshop on Multimedia Forensics and Security (MFSec 2015), held in conjunction with the 10th International Conference on Availability, Reliability and Security, Toulouse, France, 2015.

Link:

http://ieeexplore.ieee.org/document/7299970/

Details/Description:

Concept Detection on Multimedia Web Resources about Home Made Explosives. G. Kalpakis, T. Tsikrika, F. Markatopoulou, N. Pittaras, S. Vrochidis, V. Mezaris, I. Patras, and I. Kompatsiaris. In Proceedings of the International Workshop on Multimedia Forensics and Security (MFSec 2015), held in conjunction with the 10th International Conference on Availability, Reliability and Security, Toulouse, France, 2015.

Link:

http://ieeexplore.ieee.org/document/7299974/

Details/Description:

MULTISENSOR: Development of Multimedia Content Integration Technologies for Journalism, Media Monitoring and International Exporting Decision Support. S. Vrochidis, I. Kompatsiaris, G. Casamayor, I. Arapakis, R. Busch, V. Alexiev, E. Jamin, M. Jugov, N. Heise, T. Forrellat, D. Liparas, L. Wanner, I. Miliaraki, V. Aleksic, K. Simov, A. M. Soro, M. Eckhoff, T. Wagner, M. Puigbó. 2015 IEEE International Conference on Multimedia and Expo (ICME 2015), Turin, Italy, June 29 – July 3, 2015.

Link:

http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7169818

Details/Description:

Discovery of Environmental Web Resources Based on the Combination of Multimedia Evidence. T. Tsikrika, A. Latas, A. Moumtzidou, E. Chatzilari, S. Vrochidis and I. Kompatsiaris. 2nd International Workshop on Environmental Multimedia Retrieval (EMR 2015), Shanghai, China, June 23, 2015.

Link:

http://delivery.acm.org/10.1145/2770000/2764876/p27-tsikrika.pdf?ip=160.40.51.77&id=2764876&acc=ACTIVE%20SERVICE&key=5641A0C343C36AC1%2E80105867122BFAB8%2E4D4702B0C3E38B35%2E4D4702B0C3E38B35&__acm__=1518604674_60ea9b5a6976f9b4fc466fc5b8484961

Details/Description:

A Unified Model for Socially Interconnected Multimedia-Enriched Objects. T. Tsikrika, K. Andreadou, A. Moumtzidou, E. Schinas, S. Papadopoulos, S. Vrochidis, Y. Kompatsiaris. 21st MultiMedia Modelling Conference (MMM2015), Sydney, Australia, 5-7 January, 2015.

Link:

https://link.springer.com/chapter/10.1007/978-3-319-14445-0_32

Details/Description:

VERGE: A Multimodal Interactive Video Search Engine. A. Moumtzidou, K. Avgerinakis, E. Apostolidis, F. Markatopoulou, K. Apostolidis, T. Mironidis, S. Vrochidis, V. Mezaris, Y. Kompatsiaris, I. Patras. Proc. 21st Int. Conf. on MultiMedia Modeling (MMM15), Sydney, Australia, Jan. 2015.

Link:

https://link.springer.com/chapter/10.1007/978-3-319-14442-9_23

Details/Description:

News articles classification using Random Forests and weighted multimodal features. D. Liparas, Y. Hacohen-Kerner, A. Moumtzidou, S. Vrochidis and I. Kompatsiaris. Proceedings of the 3rd Open Interdisciplinary MUMIA Conference and 7th Information Retrieval Facility Conference (IRFC2014), 10 – 12 November 2014, Copenhagen, Denmark, LNCS 8849, pp. 63-75, Berlin: Springer-Verlag, 2014.

Link:

https://link.springer.com/chapter/10.1007/978-3-319-12979-2_6

Details/Description:

ITI-CERTH participation to TRECVID 2014. N. Gkalelis, F. Markatopoulou, A. Moumtzidou, D. Galanopoulos, K. Avgerinakis, N. Pittaras, S. Vrochidis, V. Mezaris, I. Kompatsiaris, I. Patras. Proc. TRECVID 2014 Workshop, Orlando, FL, USA, November 2014.

Link:

https://www.iti.gr/~bmezaris/publications/trecvid2014.pdf

Details/Description:

Concept-oriented labelling of patent images based on Random Forests and proximity-driven generation of synthetic data. D. Liparas, A. Moumtzidou, S. Vrochidis, I. Kompatsiaris. COLING’14 Workshop on Vision and Language (VL’14), Dublin, August 23, 2014.

Link:

http://www.aclweb.org/anthology/W14-5404

Details/Description:

Key-phrase Extraction using Textual and Visual Features. Y. HaCohen-Kerner, S. Vrochidis, D. Liparas, A. Moumtzidou and I. Kompatsiaris. 3rd Workshop on Vision and Language (VL), Dublin, Ireland, August 23-29, 2014.

Link:

https://www.multisensorproject.eu/wp-content/uploads/2016/11/W14-5421.pdf

Details/Description:

Detection of Terrorism-related Twitter Communities using Centrality Scores. I. Gialampoukidis, G. Kalpakis, T. Tsikrika, S. Papadopoulos, S. Vrochidis, I. Kompatsiaris. In Proceedings of International Workshop on Multimedia Forensics and Security (MFSec 2017), Bucharest, Romania, June 06, 2017 (accepted for publication).

Link:

http://delivery.acm.org/10.1145/3090000/3080534/p21-gialampoukidis.pdf?ip=160.40.51.77&id=3080534&acc=ACTIVE%20SERVICE&key=5641A0C343C36AC1%2E80105867122BFAB8%2E4D4702B0C3E38B35%2E4D4702B0C3E38B35&__acm__=1518603127_e93d391169af8c97fd70f53219db13aa

Details/Description:

Intelligent traffic city management from surveillance systems (CERTH-ITI). Avgerinakis, P. Giannakeris, A. Briassouli, A. Karakostas, S. Vrochidis, I. Kompatsiaris. NVIDIA AI city challenge, IEEE Smart World, Aug. 2017, USA.

Link:

http://smart-city-sjsu.net/AICityChallenge/papers/NVIDIA_AI_City_Challenge_2017_paper_2.pdf

Details/Description:

Visual and textual analysis of social media and satellite images for flood detection. Avgerinakis, A. Moumtzidou, S. Andreadis, E. Michail, I. Gialampoukidis, S. Vrochidis, I. Kompatsiaris. Multimedia Satellite task of MediaEval 2017.

Link:

http://ceur-ws.org/Vol-1984/Mediaeval_2017_paper_31.pdf

Details/Description:

Crater monitoring through social media observations. Gialampoukidis, S. Vrochidis and I. Kompatsiaris. In European Planetary Science Congress 2017, 17–22 September 2017.

Link:

http://meetingorganizer.copernicus.org/EPSC2017/EPSC2017-25-1.pdf

Details/Description:

LBP-flow and hybrid encoding for real-time water and fire classification. Avgerinakis, P. Giannakeris, A. Briassouli, A. Karakostas, S. Vrochidis, I. Kompatsiaris. ICCV-MSF 2017, IEEE/ISPRS 4th Joint Workshop on Multi-Sensor Fusion for Dynamic Scene Understanding.

Link:

http://openaccess.thecvf.com/content_ICCV_2017_workshops/papers/w6/Avgerinakis_LBP-Flow_and_Hybrid_ICCV_2017_paper.pdf

Details/Description:

Unsupervised Keyword Extraction Using the GoW Model and Centrality Scores. E. Batziou, I. Gialampoukidis, S. Vrochidis, I. Antoniou, I. Kompatsiaris. In International Conference on Internet Science, pp. 344-351. Springer, Cham, 2017.

Link:

https://link.springer.com/chapter/10.1007/978-3-319-70284-1_26

Details/Description:

A long short-term memory based Schaeffer gesture recognition system SO Oprea, A Garcia‐Garcia, S Orts‐Escolano, V Villena‐Martinez, JA Castro‐Vargas, Expert Systems, online. DOI: 10.1111/exsy.12247

Link:

http://onlinelibrary.wiley.com/doi/10.1111/exsy.12247/full

Details/Description:

Automatic Schaeffer's gestures recognition system Francisco Gomez‐Donoso, Miguel Cazorla, Alberto Garcia‐Garcia, Jose Garcia‐Rodriguez. Expert Systems. Volume 33, Issue 5, October 2016, Pages 480–488

Link:

http://onlinelibrary.wiley.com/doi/10.1111/exsy.12160/full

Details/Description:

A Markov Network Based Passage Retrieval Method for Multimodal Question Answering in the Cultural Heritage Domain. Shurong Sheng, Aparna Nurani Venkitasubramanian and Marie-Francine Moens. In Proceedings of the 24th International Conference on Multimedia Modeling (MMM2018), Bangkok, Thailand, 2018. Lecture Notes in Computer Science

Link:

https://link.springer.com/chapter/10.1007/978-3-319-73603-7_1

Details/Description:

Collell, G., Van Gool, L., & Moens, M. F. (2018) Acquiring common sense spatial knowledge through implicit spatial templates. AAAI Conference on Artificial Intelligence. AAAI

Link:

https://arxiv.org/abs/1711.06821

Details/Description:

Video Description using Bidirectional Recurrent Neural Networks Álvaro Peris, Marc Bolaños, Petia Radeva, Francisco Casacuberta 25th International Conference on Artificial Neural Networks (ICANN) Lecture Notes in Computer Science: 9887:3-11, 2016

Link:

https://link.springer.com/chapter/10.1007/978-3-319-44781-0_1

Details/Description:

VIBIKNet: Visual Bidirectional Kernelized Network for Visual Question Answering Marc Bolaños, Álvaro Peris, Petia Casacuberta, Francisco, Radeva IbPRIA: 8th Iberian Conference on Pattern Recognition and Image Analysis (LNCS) Lecture Notes in Computer Science, 10255:372-380, 2017

Link:

https://link.springer.com/chapter/10.1007/978-3-319-58838-4_41

Details/Description:

Egocentric video description based on temporally-linked sequences M Bolaños, Á Peris, F Casacuberta, S Soler, P Radeva Journal of Visual Communication and Image Representation 50:205-216, 2018

Link:

https://www.sciencedirect.com/science/article/pii/S1047320317302316

Details/Description:

Albert Gatt, Marc Tanti, Adrian Muscat, Patrizia Paggio, Reuben A Farrugia, Claudia Borg, Kenneth Camilleri, Mike Rosner and Lonneke van der Plas (2018) Face2Text: Collecting an Annotated Image Description Corpus for the Generation of Rich Face Descriptions. To be published in Proceedings of LREC 2018.

Link:

http://lrec2018.lrec-conf.org/en/conference-programme/accepted-papers/

Details/Description:

Patrizia Paggio, Costanza Navarretta and Bart Jongejan (2017) Automatic identification of head movements in video-recorded conversations: can words help?. In Proceedings of the 6th Workshop on Vision and Language, pp. 40-42. The Association for Computational Linguistics.

Link:

http://aclweb.org/anthology/W17-2000

Details/Description:

Huu Ton Le, Thierry Urruty, Syntyche Gbèhounou, François Lecellier, Jean Martinet, Christine Fernandez-Maloigne: Improving retrieval framework using information gain models. Signal, Image and Video Processing 11(2): 309-316 (2017)

Link:

http://www.cristal.univ-lille.fr/~martinej/papers/2017/le17improving.pdf

Details/Description:

Amel Aissaoui, Afifa Dahmane, Jean Martinet, Ioan Marius Bilasco: Introducing FoxFaces: A 3-in-1 Head Dataset. VISIGRAPP (4: VISAPP) 2016: 533-537

Link:

http://www.cristal.univ-lille.fr/~martinej/papers/2016/aissaoui16introducing.pdf

Details/Description:

Jalila Filali, Hajer Baazaoui Zghal, Jean Martinet: Visually Supporting Image Annotation Based on Visual Features and Ontologies. IV 2017: 182-187

Link:

http://www.cristal.univ-lille.fr/~martinej/papers/2017/filali17visually.pdf

Details/Description:

Jalila Filali, Hajer Baazaoui Zghal, Jean Martinet: Towards Visual Vocabulary and Ontology-based Image Retrieval System. ICAART (2) 2016: 560-565

Link:

http://www.cristal.univ-lille.fr/~martinej/papers/2016/filali16towards.pdf

Details/Description:

Rémi Auguste, Jean Martinet, Pierre Tirilly: Space-time Histograms And Their Application To Person Re-identification In TV Shows. ICMR 2015: 91-97

Link:

http://www.cristal.univ-lille.fr/~martinej/papers/2015/auguste15space.pdf

Details/Description:

Thierry Urruty, Syntyche Gbèhounou, Huu Ton Le, Jean Martinet, Christine Fernandez-Maloigne: Iterative Random Visual Word Selection. ICMR 2014: 249

Link:

http://www.cristal.univ-lille.fr/~martinej/papers/2014/urruty14iterative.pdf

Details/Description:

Jean Martinet: From Text Vocabularies to Visual Vocabularies - What Basis?. VISAPP (2) 2014: 668-675

Link:

http://www.cristal.univ-lille.fr/~martinej/papers/2014/martinet14from.pdf

Details/Description:

Meriem Bendris, Benoît Favre, Delphine Charlet, Géraldine Damnati, Grégory Senay, Rémi Auguste, Jean Martinet: Unsupervised face identification in TV content using audio-visual sources. CBMI 2013: 243-249

Link:

http://www.cristal.univ-lille.fr/~martinej/papers/2013/bendriss13unsupervised

Details/Description:

Ismail Elsayad, Jean Martinet, Thierry Urruty, Chabane Djeraba: Toward a higher-level visual representation for content-based image retrieval. Multimedia Tools Appl. 60(2): 455-482 (2012)

Link:

http://www.cristal.univ-lille.fr/~martinej/papers/elsayad12toward.pdf

Details/Description:

Ismail Elsayad, Jean Martinet, Thierry Urruty, Chabane Djeraba: A new spatial weighting scheme for bag-of-visual-words. CBMI 2010: 1-6

Link:

http://www.cristal.univ-lille.fr/~martinej/papers/elsayad10new.pdf

Details/Description:

Sorodoc, I., Lazaridou, A., Boleda, G., Herbelot, A., Pezzelle, S., & Bernardi, R. (2016). “Look, some green circles!”: Learning to quantify from images. In Proceedings of the 5th Workshop on Vision and Language (VL'16), co-located with ACL 2016, Berlin, Germany, August 2016.

Link:

https://aclweb.org/anthology/W/W16/W16-3211.pdf

Details/Description:

Pezzelle, S., Marelli, M., & Bernardi, R. (2017). Be Precise or Fuzzy: Learning the Meaning of Cardinals and Quantifiers from Vision. EACL 2017, Valencia, Spain, April 2017.

Link:

http://aclweb.org/anthology/E17-2054

Details/Description:

Pezzelle, S., Shekhar, R., & Bernardi, R. (2016). Building a bagpipe with a bag and a pipe: Exploring Conceptual Combination in Vision. In Proceedings of the 5th Workshop on Vision and Language (VL'16), co-located with ACL 2016, Berlin, Germany, August 2016.

Link:

http://www.aclweb.org/anthology/W/W16/W16-3208.pdf

Details/Description:

Sina Zarrieß, Julian Hough, Casey Kennington, Ramesh Manuvinakurike, David DeVault, Raquel Fernández, and David Schlangen. PentoRef: A Corpus of Spoken References in Task-oriented Dialogues. In Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC), pp. 125-131, 2016.

Link:

http://www.lrec-conf.org/proceedings/lrec2016/pdf/563_Paper.pdf

Details/Description:

Angeliki Lazaridou, Grzegorz Chrupała, Raquel Fernández, and Marco Baroni. Multimodal Semantic Learning from Child-Directed Input. In Proceedings of the 15th Annual Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL-HLT), pp. 387-392, 2016.

Link:

http://aclweb.org/anthology/N/N16/N16-1043.pdf

Details/Description:

Development of a common validation framework for the evaluation of polyp detection methods, including definition of new databases and performance metrics Bernal, J., Tajkbaksh, N., Sánchez, F. J., Matuszewski, B. J., Chen, H., Yu, L., ... & Histace, A. (2017). comparative validation of polyp detection methods in video colonoscopy: results from the MICCAI 2015 endoscopic vision challenge. IEEE transactions on medical imaging, 36(6), 1231-1249.

Link:

https://hal.archives-ouvertes.fr/hal-01488652/document

Details/Description:

Setting up the first benchmark of polyp segmentation in colonoscopy images Vázquez, D., Bernal, J., Sánchez, F. J., Fernández-Esparrach, G., López, A. M., Romero, A., ... & Courville, A. (2017). A Benchmark for Endoluminal Scene Segmentation of Colonoscopy Images. Journal of healthcare engineering, 2017.

Link:

http://downloads.hindawi.com/journals/jhe/2017/4037190.pdf

Details/Description:

Creation of a software to detect automatically specular highlights in images Sánchez, F. J., Bernal, J., Sánchez-Montes, C., de Miguel, C. R., & Fernández-Esparrach, G. (2017). Bright spot regions segmentation and classification for specular highlights detection in colonoscopy videos. Machine Vision and Applications, 28(8), 917-936.

Link:

http://refbase.cvc.uab.es/files/SBS2017.pdf

Details/Description:

A Distributed Representation Based Query Expansion Approach for Image Captioning, S. Yagcioglu, E. Erdem, A. Erdem and R. Cakici, The 53rd Annual Meeting of the Association for Computational Linguistics (ACL 2015), Beijing, China, July 2015

Link:

http://semihyagcioglu.com/projects/image-captioning/

Details/Description:

Leveraging Captions in the Wild to Improve Object Detection, M. Kilickaya, N. Ikizler-Cinbis, E. Erdem and A. Erdem, The 5th Workshop on Vision and Language (VL'16) - in conjuction with ACL 2016, Berlin, Germany, August 2016

Link:

https://web.cs.hacettepe.edu.tr/~erkut/publications/acl-vl16.pdf

Details/Description:

Re-evaluating Automatic Metrics for Image Captioning, M. Kilickaya, A. Erdem, N. Ikizler-Cinbis and E. Erdem, The 15th Conference of the European Chapter of the Association for Computational Linguistics (EACL 2017), Valencia, Spain, April 2017

Link:

http://aclweb.org/anthology/E17-1019

Details/Description:

Automatic Description Generation from Images: A Survey of Models, Datasets, and Evaluation Measures, R. Bernardi, R. Cakici, D. Elliott, A. Erdem, E. Erdem, N. Ikizler-Cinbis, F. Keller, A. Muscat, B. Plank, Journal of Artificial Intelligence Research, 55, pp. 409-442, February 2016

Link:

http://arxiv.org/pdf/1601.03896v1.pdf

Details/Description:

Kilickaya, M. , Kerim Akkus, B., Cakici, R., Erdem, A., Erdem, E., and Ikizler-Cinbis, N. (2017) Data-driven image captioning via salient region discovery. IET Computer Vision,, 11(6), pages. 398-406, September 2017

Link:

https://web.cs.hacettepe.edu.tr/~erkut/publications/iet-cv2016.pdf

Details/Description:

Jia, X., Gavves, S., Fernando, B., Tuytelaars, T., (2015) Guiding Long-Short Term Memory for Image Caption Generation, International Conference on Computer Vision (ICCV), 2015.

Link:

http://homes.esat.kuleuven.be/~xjia/xjia_publications/xjia_iccv15_glstm.pdf

Details/Description:

Calixto, I., Stein, D., Matusov, E., Lohar, P., Castilho, S., and Way, A. (2017). Using images to improve machine-translating e-commerce product listings. In Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics: Volume 2, Short Papers, pages 637–643, Valencia, Spain.

Link:

http://www.aclweb.org/anthology/E17-2101

Details/Description:

Calixto, I., Stein, D., Matusov, E., Castilho, S., and Way, A. (2017d). Human evaluation of multi-modal neural machine translation: A case-study on e-commerce listing titles. In Proceedings of the Sixth Workshop on Vision and Language, pages 31–37, Valencia, Spain.

Link:

http://www.aclweb.org/anthology/W17-2004

Details/Description:

Calixto, I. and Liu, Q. (2017). Sentence-Level Multilingual Multi-modal Embedding for Natural Language Processing. In: Proceedings of the International Conference Recent Advances in Natural Language Processing, RANLP 2017, pages 139-148, Varna, Bulgaria.

Link:

http://www.acl-bg.org/proceedings/2017/RANLP%202017/pdf/RANLP020.pdf

Details/Description:

Calixto, I. and Liu, Q. (2017) Incorporating Global Visual Features into Attention-based Neural Machine Translation. In: Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, pages 992-1003, Copenhagen, Denmark.

Link:

http://aclweb.org/anthology/D17-1105

Details/Description:

Calixto, I., Liu, Q., and Campbell, N. (2017a). Doubly-Attentive Decoder for Multi-modal Neural Machine Translation. In Proceedings of the 55th Conference of the Association for Computational Linguistics: Volume 1, Long Papers, Vancouver, Canada.

Link:

http://www.aclweb.org/anthology/P17-1175

Details/Description:

Calixto, I., Elliott, D., and Frank, S. (2016). DCU-UvA Multimodal MT System Report. In Proceedings of the First Conference on Machine Translation, pages 634–638, Berlin, Germany.

Link:

http://www.statmt.org/wmt16/pdf/W16-2359.pdf

Details/Description:

Calixto, I., de Campos, T., and Specia, L. (2012). Images as context in Statistical Machine Translation. In The 2nd Annual Meeting of the EPSRC Network on Vision & Language (VL’12), Sheffield, UK. EPSRC Vision and Language Network.

Link:

http://www.ee.surrey.ac.uk/CVSSP/Publications/papers/Calixto-VL-2012.pdf

Details/Description:

Emiel van Miltenburg, Roser Morante, and Desmond Elliott, “Pragmatic factors in image description: the case of negations,” in Proceedings of the 5th workshop on vision and language, 2016, pp. 54-59.

Link:

http://www.aclweb.org/anthology/W/W16/W16-3207.pdf

Details/Description:

Emiel van Miltenburg, Desmond Elliott, and Piek Vossen, “Cross-linguistic differences and similarities in image descriptions,” in Proceedings of the 10th international conference on natural language generation, Santiago de Compostela, Spain, 2017, pp. 21-30.

Link:

http://www.aclweb.org/anthology/W/W17/W17-3503.pdf

Details/Description:

Elliott, D., Frank, S., Barrault, L. Bougares, F., Specia, L. Findings of the Second Shared Task on Multimodal Machine Translation and Multilingual Image Description. In 2nd Conference on Machine Translation, WMT, pp .215-233, Copenhagen, Denmark, 2017.

Link:

http://www.statmt.org/wmt17/pdf/WMT18.pdf

Details/Description:

Madhyastha, P.S., Wang, J., Specia, L. Sheffield MultiMT: Using Object Posterior Predictions for Multimodal Machine Translation. In 2nd Conference on Machine Translation, WMT, pp. 470-476, Copenhagen, Denmark, 2017.

Link:

http://www.aclweb.org/anthology/W/W17/W17-4752.pdf

Details/Description:

Lala, C., Madhyastha, P., Wang, J., Specia, L. Unraveling the Contribution of Image Captioning and Neural Machine Translation for Multimodal Machine Translation, Prague Bull. Math. Linguistics, vol 108, pp. 197-208, 2017.

Link:

https://ufal.mff.cuni.cz/pbml/108/art-lala-madhyastha-wang-specia.pdf

Details/Description:

Dat Tien Nguyen, Angeliki Lazaridou, Raffaella Bernardi Coloring Objects: Adjective-Noun Visual Semantic Compositionality Proceedings of the Third Workshop on Vision and Language, Dublin City University and the Association for Computational Linguistics, Pages: 112–114

Link:

http://aclweb.org/anthology/W14-5418

Details/Description:

Dieu-Thu Le, Jasper Uijlings, Raffaella Bernardi TUHOI: Trento Universal Human Object Interaction Dataset Proceedings of the Third Workshop on Vision and Language, Dublin City University and the Association for Computational Linguistics Pages: 17–24

Link:

http://aclweb.org/anthology/W14-5403

Details/Description:

Ionut Sorodoc and Angeliki Lazaridou and Gemma Boleda Aurelie Herbelot ´ and Sandro Pezzelle and Raffaella Bernardi “Look, some green circles!”: Learning to quantify from images Proceedings of the 5th Workshop on Vision and Language, pages 75–79, Berlin, Germany, August 12 2016. c 2016 Association for Computational Linguistics

Link:

http://aclweb.org/anthology/W16-3211

Details/Description:

Sandro Pezzelle, Ravi Shekhar, Raffaella Bernardi Building a Bagpipe with a Bag and a Pipe: Exploring Conceptual Combination in Vision Proceedings of the 5th Workshop on Vision and Language, pages 60–64, Berlin, Germany, August 12 2016. c 2016 Association for Computational Linguistics

Link:

http://aclweb.org/anthology/W16-3208

Details/Description:

Sandro Pezzelle, Marco Marelli, Raffaella Bernardi Be Precise or Fuzzy: Learning the Meaning of Cardinals and Quantifiers from Vision Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics: Volume 2, Short Papers, pages 337–342, Valencia, Spain, April 3-7, 2017. c 2017 Association for Computational Linguistics

Link:

http://aclanthology.info/papers/E17-2054/be-precise-or-fuzzy-learning-the-meaning-of-cardinals-and-quantifiers-from-vision

Details/Description:

Ravi Shekhar, Sandro Pezzelle, Yauhen Klimovich, Aurelie Herbelot, Moin Nabi, Enver Sangineto, Raffaella Bernardi FOIL it! Find One mismatch between Image and Language caption Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, pages 255–265 Vancouver, Canada, July 30 - August 4, 2017. c 2017 Association for Computational Linguistics

Link:

http://aclweb.org/anthology/P17-1024

Details/Description:

G. Collell and S. Moens, “Is an image worth more than a thousand words? on the fine-grain semantic differences between visual and linguistic representations,” in COLING, ACL, 2016

Link:

https://www.aclweb.org/anthology/C/C16/C16-1264.pdf

Details/Description:

G. Collell, T. Zhang, and M.-F. Moens, “Imagined visual representations as multimodal embeddings,” In AAAI Conference on Artificial Intelligence, AAAI, 2017.

Link:

http://aaai.org/ocs/index.php/AAAI/AAAI17/paper/view/14811

Details/Description:

L Specia, S Frank, K Sima’an and D Elliott. A Shared Task on Multimodal Machine Translation and Crosslingual Image Description. In First Conference on Machine Translation, Volume 2: Shared Task Papers, pp. 540550, WMT, Berlin, Germany. 2016

Link:

http://www.aclweb.org/anthology/W/W16/W16-2346

Details/Description:

SHEF-Multimodal: Grounding Machine Translation on Images. K Shah, J Wang, and L Specia. First Conference on Machine Translation, Berlin, Germany, pp. 657-662. 2016

Link:

http://www.aclweb.org/anthology/W/W16/W16-2363

Details/Description:

Multi30K: Multilingual English-German Image Descriptions. D Elliott, S Frank, K Sima’an and L Specia. Workshop on Vision and Language, Berlin, Germany, pp. 70-74. 2016

Link:

http://aclweb.org/anthology/W16-3210

Details/Description:

Cinar, Y.G., Zoghbi, S. & Moens, M.-F. (2015). Inferring User Interests on Social Media from Text and Images. In Proceedings of SoMeRA 2015: 2nd International Workshop on Social Media Retrieval and Analysis at ICDM 2015.

Link:

https://lirias.kuleuven.be/bitstream/123456789/510686/1/CinaretalSoMeRA2015.pdf

Details/Description:

Zoghbi, S. & Moens, M.-F. (2016). Cross-modal Fashion Search. In Proceedings of the 22nd International Conference on MultiMedia Modelling (Lecture Notes in Computer Science 9517) (pp. 367-373).

Link:

https://lirias.kuleuven.be/handle/123456789/510704

Details/Description:

Nurani Venkitasubramanian, A., Tuytelaars, T. & Moens, M.-F. (2016). Wildlife Recognition in Nature Documentaries with Weak Supervision from Subtitles and External Data. Pattern Recognition Letters. doi:10.1016/j.patrec.2016.01.025

Link:

https://lirias.kuleuven.be/handle/123456789/532207

Details/Description:

Fashion Meets Computer Vision and NLP and E-Commerce Search.Susana Zoghbi, Geert Heyman, Juan Carlos Gomez, Marie-Francine Moens International Journal of Computer and Electrical Engineering (IJCEE), 8(1), 31-43

Link:

http://people.cs.kuleuven.be/~susana.zoghbi/myPublications/IJCEE_Final.pdf

Details/Description:

Ivan Huerta, Marco Pedersoli, Jordi Gonzàlez and Albert Sanfeliu, "Combining where and what in change detection for unsupervised foreground learning in surveillance", Pattern Recognition, Volume 48, Issue 3, Pages 709–719, 2015

Link:

http://iselab.cvc.uab.es/files/Publications/2014/PDF/HPS2014.pdf

Details/Description:

Xavier Perez-Sala, Sergio Escalera, Cecilio Angulo and Jordi Gonzàlez, "A Survey on Model Based Approaches for 2D and 3D Visual Human Pose Recovery", Sensors 14(3), pp 4189-4210, 2014

Link:

http://www.mdpi.com/1424-8220/14/3/4189/htm

Details/Description:

Sergio Escalera, Jordi Gonzàlez, Xavier Baró, Pablo Pardo, Junior Fabian, Marc Oliu, Hugo Jair Escalante, Ivan Huerta, Isabelle Guyon, "ChaLearn Looking at People 2015 new competitions: Age Estimation and Cultural Event Recognition", International Joint Conference on Neural Networks, 2015

Link:

http://www.maia.ub.es/~sergio/linked/ijcnn_age_and_cultural_2015.pdf

Details/Description:

C. Crispim-Junior and F. Bremond. Uncertainty Modeling Framework for Constraint-based Elementary Scenario Detection in Vision System. In the First International Workshop on Computer vision + ONTology Applied Cross-disciplinary Technologies in conjunction with ECCV 2014, CONTACT-2014, Zurich, Switzerland, September 7th, 2014.

Link:

http://www-sop.inria.fr/members/Francois.Bremond/Postscript/carlos_contact2014.pdf

Details/Description:

A. König, C. Crispim, A. Derreumaux, G. Bensadoum, P.D. Petit, F. Bremond, R. David, F. Verhey, P. Aalten and P.H. Robert. Validation of an Automatic Video Monitoring System for the Detection of Instrumental Activities of Daily Living in Dementia Patients, - Journal of Alzheimer Disease, - 44 (2015) pp. 675~685, IOS Press, DOI 10.3233/JAD-141767, 2015.

Link:

http://www-sop.inria.fr/members/Francois.Bremond/Postscript/JAD-Alexandra2015.pdf

Details/Description:

S. Elloumi, S. Cosar, G. Pusiol, F. Bremond and M. Thonnat. Unsupervised Discovery of Human Activities from Long-Videos, IET Computer Vision, CVI-2014-0311.R1, 2014

Link:

http://www-sop.inria.fr/members/Francois.Bremond/Postscript/actdis_cv.pdf

Details/Description:

Polina Kuznetsova, Vicente Ordonez, Alexander Berg, Tamara Berg and Yejin Choi (2013). Generalizing Image Captions for Image-Text Parallel Corpus. In Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics (ACL'13).

Link:

http://www.tamaraberg.com/papers/acl13_generalization.pdf

Details/Description:

Micah Hodosh, Peter Young, Cyrus Rashtchian and Julia Hockenmaier (2010). Cross-Caption Coreference Resolution for Automatic Image Understanding. Proceedings of the 14th Conference on Natural Language Learning (CoNLL'10).

Link:

http://nlp.cs.illinois.edu/HockenmaierGroup/Papers/CoNLL2010/W10-2920.pdf