Details/Description:
Pastra, K. , & Wilks, Y. . (2004). Vision-Language Integration in AI: a reality check. In Proceedings of the 16th European Conference in Artificial Intelligence (Vol. 16, pp. 937–941).
Link:
Details/Description:
Pastra, K. . (2004). Viewing Vision-Language Integration as a Double-Grounding Case. In Proceedings of the AAAI Fall Symposium on "Achieving Human-Level Intelligence through Integrated Systems and Research" (pp. 62–67).
Link:
Details/Description:
Pastra, K. . (2006). Image-Language Association: are we looking at the right features?. In OntoImage Workshop on Language Resources for Content-based Image Retrieval, Language Resources and Evaluation Conference (LREC).
Link:
Details/Description:
Pastra K.., E. Balta, P. Dimitrakis, G. Karakatsiotis (2011), Embodied Language Processing: A New Generation of Language Technology, in AAAI workshop on 'Language-Action Tools for Cognitive Artificial Agents'.
Link:
Details/Description:
Pastra K. and Y. Aloimonos (2012), The minimalist grammar of action, Philosophical Transactions of the Royal Society of London B: Biological Sciences, vol. 367, pp. 103-117.
Link:
Details/Description:
Vatakis A. and K. Pastra (2016), A multimodal dataset of spontaneous speech and movement production on object affordances, Data Science Journal, Nature Publishing.
Link:
Details/Description:
Desmond Elliott, Stella Frank, Khalil Sima'an, Lucia Specia. Multi30K: Multilingual English-German Image Descriptions. 5th Workshop on Vision and Language, pages 70–74, Berlin, Germany, 2016.
Link:
Details/Description:
Specia, L., Frank, S., Sima'an, K., Elliott, D. A Shared Task on Multimodal Machine Translation and Crosslingual Image Description. In First Conference on Machine Translation, Volume 2: Shared Task Papers, pp. 540–550, WMT, Berlin, Germany, 2016.
Link:
Details/Description:
Elliott, D., Frank, S., Barrault, L. Bougares, F., Specia, L. Findings of the Second Shared Task on Multimodal Machine Translation and Multilingual Image Description. In 2nd Conference on Machine Translation, WMT, pp .215-233, Copenhagen, Denmark, 2017.
Link:
Details/Description:
Madhyastha, P.S., Wang, J., Specia, L. Sheffield MultiMT: Using Object Posterior Predictions for Multimodal Machine Translation. In 2nd Conference on Machine Translation, WMT, pp. 470-476, Copenhagen, Denmark, 2017.
Link:
Details/Description:
Lala, C., Madhyastha, P., Wang, J., Specia, L. Unraveling the Contribution of Image Captioning and Neural Machine Translation for Multimodal Machine Translation, Prague Bull. Math. Linguistics, vol 108, pp. 197-208, 2017.
Link:
Details/Description:
Madhyastha, P. Wang, J., Specia L. (2018). The role of image representations in vision to language tasks. Natural Language Engineering, Cambridge University Press, pp. 1-25.
Link:
Details/Description:
K. Laenen, S. Zoghbi, and M.-F. Moens. 2018. Web Search of Fashion Items with Multimodal Querying. In Proceedings of WSDM 2018: The Eleventh ACM International Conference on Web Search and Data Mining.
Link:
Details/Description:
Laenen, K., Zoghbi, S., & Moens, M-F. (2017). Cross-modal search for fashion attributes. In Proceedings of the KDD Workshop on Machine Learning Meets Fashion.
Link:
https://kddfashion2017.mybluemix.net/final_submissions/ML4Fashion_paper_7.pdf
Details/Description:
Learning Representations Specialized in Spatial Knowledge: Leveraging Language and Vision
Link:
https://transacl.org/ojs/index.php/tacl/article/view/1214/288
Details/Description:
Adapting a decision Tree based Tagger for Arabic. Zeroual, I., & Lakhouaja, A. The 2nd International Conference on Information Technology for Organizations Development, March 30 - April 1st, 2016, Fez, Morocco.
Link:
Details/Description:
Application of Arabic language processing in language learning. El Kah, A., Zeroual, I., & Lakhouaja, A. The 2nd International Conference on Big Data, Cloud and Applications, March 01-03, 2017, Tetuan, Morocco.
Link:
Details/Description:
Developing and performance evaluation of a new Arabic heavy/light stemmer. Zeroual, I., Boudchiche, M., Mazroui, A., & Lakhouaja, A. The 2nd International Conference on Big Data, Cloud and Applications, March 29-30, 2017, Tetuan, Morocco.
Link:
Details/Description:
Arabic Information Retrieval: Stemming or Lemmatization?. Zeroual, I., & Lakhouaja, A. The 2nd International Conference on Intelligent Systems and Computer Vision, April 17-18-19, 2017, Fez, Morocco.
Link:
Details/Description:
Towards a standard part of speech tagset for the Arabic language. Zeroual, I., Lakhouaja, A., and Belahbib R. Journal of King Saud University – Computer and Information Sciences, 2017
Link:
http://www.sciencedirect.com/science/article/pii/S1319157817300265
Details/Description:
Gamification for Arabic Natural Language Processing: Ideas into Practice. Zeroual, I, El Kah A. and Lakhouaja A. Transactions on Machine Learning and Artificial Intelligence 5.4 (2017)
Link:
http://scholarpublishing.org/index.php/TMLAI/article/view/3323/
Details/Description:
Feature-rich PoS Tagging through Taggers Combination: Experience in Arabic. Zeroual, I, and Lakhouaja A. Transactions on Machine Learning and Artificial Intelligence 5.4 (2017)
Link:
http://scholarpublishing.org/index.php/TMLAI/article/view/2981
Details/Description:
Arabic Corpus Linguistics: Major Progress, but Still a Long Way to Go. Zeroual I., Lakhouaja A. In: Shaalan K., Hassanien A., Tolba F. (eds) Intelligent Natural Language Processing: Trends and Applications. Studies in Computational Intelligence, vol 740. Springer, Cham. 2018
Link:
https://link.springer.com/chapter/10.1007/978-3-319-67056-0_29
Details/Description:
Hybrid Focused Crawling on the Surface and the Dark Web. C. Iliou, G. Kalpakis, T. Tsikrika, S. Vrochidis, I. Kompatsiaris. EURASIP Journal on Information Security, vol. 2017, no. 11, 2017.
Link:
Details/Description:
Multimedia retrieval based on non-linear graph-based fusion and partial least squares regression. I. Gialampoukidis, A. Moumtzidou, D. Liparas, T. Tsikrika, S. Vrochidis, I. Kompatsiaris (2017). Multimedia Tools and Applications, 2017.
Link:
Details/Description:
Gaze Movement-driven Random Forests for Query Clustering in Automatic Video Annotation. S. Vrochidis, I. Patras and I. Kompatsiaris. Multimedia Tools and Applications, 2016.
Link:
Details/Description:
Interactive Video Search Tools: A Detailed Analysis of the Video Browser Showdown 2015. C. Cobarzan, K. Schoeffmann, W. Bailer, W. Hurst, A. Blazek, J. Lokoc, S. Vrochidis, K. U. Barthel, and L. Rossetto. In Multimedia Tools and Applications (MTAP), 2016, pp. 1-33.
Link:
Details/Description:
Focussed Crawling of Environmental Web Resources Based on the Combination of Multimedia Evidence. T. Tsikrika, A. Moumtzidou, S. Vrochidis and I. Kompatsiaris. Multimedia Tools and Applications, May 2015, pp 1-25.
Link:
Details/Description:
Environmental data extraction from heatmaps using the AirMerge system Multimedia Tools and Applications. V. Epitropou, T. Bassoukos, K. Karatzas, A. Karppinen, L. Wanner, S. Vrochidis, I. Kompatsiaris, J. Kukkonen. May 2015. pp. 1-25.
Link:
Details/Description:
Ontology-centered environmental information delivery for personalized decision support. L. Wanner, M. Rospocher, S. Vrochidis, L. Johansson, N. Bouayad-Aghae, G. Casamayor, A. Karppinen, I. Kompatsiaris, S. Millee, A. Moumtzidou, L. Serafini. Expert Systems With Applications, Volume 42, Issue 12, 15 July 2015, Pages 5032–5046.
Link:
https://www.sciencedirect.com/science/article/pii/S0957417415001554
Details/Description:
Fusion of meteorological and air quality data extracted from the web for personalized environmental information services. L. Johansson, V. Epitropou, K. Karatzas, A. Karppinen, L. Wanner, S. Vrochidis, A. Bassoukos, J. Kukkonen, I. Kompatsiaris. Environmental Modeling and Software Journal, 2015, Volume 64, February 2015, pp. 143–155.
Link:
https://www.sciencedirect.com/science/article/pii/S1364815214003478
Details/Description:
A Model for Environmental Data Extraction from Multimedia and its Evaluation against various Chemical Weather Forecasting Datasets. A. Moumtzidou, V. Epitropou, S. Vrochidis, K. Karatzas, S. Voth, A. Bassoukos, J. Moßgraber, A. Karppinen, J. Kukkonen and I. Kompatsiaris. Journal of Ecological Informatics, pp. 69-82, 2014, special issue, ISSN 1574-9541.
Link:
https://www.sciencedirect.com/science/article/pii/S1574954113000745
Details/Description:
OSINT and the Dark Web. G. Kalpakis, T. Tsikrika, N. Cunningham, C. Iliou, S. Vrochidis, J. Middleton, I. Kompatsiaris. In “Open Source Intelligence Investigation – From Strategy to Implementation”, B. Akhgar, P. S. Bayerl, F. Sampson (Eds.), Springer, 2016.
Link:
https://link.springer.com/chapter/10.1007%2F978-3-319-47671-1_8
Details/Description:
Enhancing Patent Search with Content-based Image Retrieval. S. Vrochidis, A. Moumtzidou, I. Kompatsiaris. Professional Search in the Modern World, Lecture Notes in Computer Science Volume 8830, 2014, pp 250-273.
Link:
https://link.springer.com/content/pdf/10.1007%2F978-3-319-12511-4_12.pdf
Details/Description:
Description Logics and Rules for Multimodal Situational Awareness in Healthcare. G. Meditskos, S. Vrochidis, I. Kompatsiaris. Special session on Multimedia and Multimodal Interaction for Health and Basic Care Applications at MMM 2017, 2016.
Link:
https://link.springer.com/chapter/10.1007/978-3-319-51811-4_58
Details/Description:
VERGE IN VBS 2017. A. Moumtzidou, T. Mironidis, F. Markatopoulou, S. Andreadis, I. Gialampoukidis, D. Galanopoulos, A. Ioannidou, S. Vrochidis, V. Mezaris, I. Kompatsiaris, I. Patras. Proc. Video Browser Showdown (VBS’17) at the 23rd Int. Conf. on MultiMedia Modeling (MMM’17), Reykjavik, Iceland, 4 January 2017.
Link:
https://link.springer.com/chapter/10.1007/978-3-319-51814-5_46
Details/Description:
Ontology-Driven Context Interpretation and Conflict Resolution in Dialogue-Based Home Care Assistance. G. Meditskos, G., Kontopoulos, E., Vrochidis, S., & Kompatsiaris, I. (2016). In: Paschke, A., Burger, A., Splendiani, A., Marshall, M.S., and Romano, P. (eds.) 9th Int. Conf. on Semantic Web Applications and Tools for Life Sciences – SWAT4LS. CEUR Workshop Proceedings Vol 1795, Amsterdam, The Netherlands (2016).
Link:
Details/Description:
ITI-CERTH participation in TRECVID 2016. F. Markatopoulou, A. Moumtzidou, D. Galanopoulos, T. Mironidis, V. Kaltsa, A. Ioannidou, S. Symeonidis, K. Avgerinakis, S. Andreadis, I. Gialampoukidis, S. Vrochidis, A. Briassouli, V. Mezaris, I. Kompatsiaris, I. Patras. Proc. TRECVID 2016 Workshop, Gaithersburg, MD, USA, Nov. 2016.
Link:
http://www-nlpir.nist.gov/projects/tvpubs/tv16.papers/iti-certh.pdf
Details/Description:
Incremental estimation of visual vocabulary size for image retrieval. I. Gialampoukidis, S. Vrochidis, I. Kompatsiaris. In INNS Conference on Big Data, pp. 29-38. Springer International Publishing, 2016.
Link:
https://link.springer.com/content/pdf/10.1007/978-3-319-47898-2_4.pdf
Details/Description:
Community Detection in Complex Networks Based on DBSCAN* and a Martingale Process. I. Gialampoukidis, T. Tsikrika, S. Vrochidis, I. Kompatsiaris. Semantic and Social Media Adaptation and Personalization (SMAP), 2016 11th International Workshop on, pp. 1-6. IEEE, 2016.
Link:
Details/Description:
Towards Air Quality Estimation Using Collected Multimodal Environmental Data. A. Moumtzidou, S. Papadopoulos, S. Vrochidis, I. Kompatsiaris, K. Kourtidis, G. Hloupis, I. Stavrakas, K. Papachristopoulou, and C. Keratidis. 1st International Workshop on Internet and Social media for Environmental Monitoring (In conjunction with the 3rd international conference on Internet Science (INSCI 2016)), Florence, Italy, 12 September 2016.
Link:
https://link.springer.com/chapter/10.1007/978-3-319-50237-3_7
Details/Description:
Semantic integration of web data for international investment decision support. B. Simeonov, V. Alexiev, D. Liparas, M. Puigbo, S. Vrochidis, E. Jamin and I. Kompatsiaris. 3rd international conference on Internet Science, Florence, Italy, Sept. 12-14 2016.
Link:
https://link.springer.com/chapter/10.1007/978-3-319-45982-0_18
Details/Description:
Question Answering over Pattern-Based User Models. G. Meditskos, S. Dasiopoulou, S. Vrochidis, L. Wanner, I. Kompatsiaris. In Proceedings of the 12th International Conference on Semantic Systems (SEMANTiCS 2016), pp. 153-160. ACM, New York, NY, USA, 2016.
Link:
Details/Description:
Hybrid Focused Crawling for Homemade Explosives Discovery on Surface and Dark Web. C. Iliou, G. Kalpakis, T. Tsikrika, S. Vrochidis, I. Kompatsiaris. 11th International Conference on Availability, Reliability and Security (ARES 2016), Salzburg, Austria, Aug 2016.
Link:
Details/Description:
Query-based Topic Detection Using Concepts and Named Entities. I. Gialampoukidis, D. Liparas, S. Vrochidis, I. Kompatsiaris. 1st International Workshop on Multimodal Media Data Analytics (MMDA 2016), The Hague, Netherlands, August 30, 2016.
Link:
https://pdfs.semanticscholar.org/b811/747fa6f82e2f878e950fd16cda15e0358af6.pdf
Details/Description:
Key player identification in terrorism-related social media networks using centrality measures. I. Gialampoukidis, G. Kalpakis, T. Tsikrika, S. Vrochidis, I. Kompatsiaris. In Intelligence and Security Informatics Conference (EISIC), 2016 European, pp. 112-115. IEEE, 2016.
Link:
Details/Description:
Interactive Discovery and Retrieval of Web Resources Containing Home Made Explosive Recipes. G. Kalpakis, T. Tsikrika, C. Iliou, T. Mironidis, S. Vrochidis, J. Middleton, U. Williamson, I. Kompatsiaris. 4th International Conference on Human Aspects of Information Security, Privacy and Trust, Toronto, Canada, 17 – 22 July 2016.
Link:
https://link.springer.com/chapter/10.1007/978-3-319-39381-0_20
Details/Description:
A Hybrid framework for news clustering based on the DBSCAN-Martingale and LDA. I. Gialampoukidis, S. Vrochidis, I. Kompatsiaris. Machine Learning and Data Mining in Pattern Recognition, pp. 170-184, Springer International Publishing, 2016.
Link:
https://link.springer.com/chapter/10.1007/978-3-319-41920-6_13
Details/Description:
A hybrid graph-based and non-linear late fusion approach for multimedia retrieval. I. Gialampoukidis, A. Moumtzidou, D. Liparas, S. Vrochidis, I. Kompatsiaris. In Content-Based Multimedia Indexing (CBMI), 2016 14th International Workshop on, pp. 1-6, IEEE, 2016.
Link:
Details/Description:
A Multimedia Interactive Search Engine based on Graph-based and Non-linear Multimodal Fusion. A. Moumtzidou, I. Gialampoukidis, T. Mironidis, D. Liparas, S. Vrochidis, I. Kompatsiaris. In Content-Based Multimedia Indexing (CBMI), 2016 14th International Workshop on, IEEE, 2016.
Link:
Details/Description:
Retrieval of Multimedia objects by Fusing Multiple Modalities. I. Gialampoukidis, A. Moumtzidou, T. Tsikrika, S. Vrochidis and I. Kompatsiaris. In Proceedings of the 2016 ACM on International Conference on Multimedia Retrieval, pp. 359-362, ACM.
Link:
Details/Description:
Towards a Multimedia Knowledge-Based Agent with Social Competence and Human Interaction Capabilities. L. Wanner, J. Blat, S. Dasiopoulou, M. Domínguez, G. Llorach, S. Mille, F. Sukno, E. Kamateri, S. Vrochidis, I. Kompatsiaris, E. André, F. Lingenfelser, G. Mehlmann, A. Stam, L. Stellingwerff, B. Vieru, L. Lamel, W. Minker, L. Pragst, Stefan Ultes. 1st International Workshop on Multimedia Analysis and Retrieval for Multimodal Interaction (MARMI 2016), New York, USA, June 6, 2016.
Link:
Details/Description:
Towards an Ontology-driven Adaptive Dialogue Framework. G. Meditskos, S. Dasiopoulou, Louisa Pragst, S. Ultes, S. Vrochidis, I. Kompatsiaris, L. Wanner. 1st International Workshop on Multimedia Analysis and Retrieval for Multimodal Interaction (MARMI 2016), New York, USA, June 6, 2016.
Link:
Details/Description:
A Multimodal Annotation Schema for Non-Verbal Affective Analysis in the Health-Care Domain. Sukno, M. Dominguez, A. Ruiz Ovejero, D. Schiller, F. Lingenfelser, L. Pragst, E. Kamateri, S. Vrochidis. 1st International Workshop on Multimedia Analysis and Retrieval for Multimodal Interaction (MARMI 2016), New York, USA, June 6, 2016.
Link:
Details/Description:
VERGE: A Multimodal Interactive Search Engine for Video Browsing and Retrieval. A. Moumtzidou, T. Mironidis, E. Apostolidis, F. Markatopoulou, A. Ioannidou, I. Gialampoukidis, K. Avgerinakis, S. Vrochidis, V. Mezaris, I. Kompatsiaris, I. Patras. Proc. Video Browser Showdown (VBS’16) at the 22nd Int. Conf. on MultiMedia Modeling (MMM’16), Miami, USA, 4 January 2016.
Link:
https://link.springer.com/chapter/10.1007/978-3-319-27674-8_39
Details/Description:
Fast Visual Vocabulary Construction for Image Retrieval using Skewed-Split k-d trees. I. Gialampoukidis, S. Vrochidis, I. Kompatsiaris. In International Conference on Multimedia Modeling, pp. 466-477. Springer International Publishing, 2016.
Link:
https://link.springer.com/chapter/10.1007/978-3-319-27671-7_39
Details/Description:
ITI-CERTH participation to TRECVID 2015. F. Markatopoulou, A. Ioannidou, C. Tzelepis, T. Mironidis, D. Galanopoulos, S. Arestis-Chartampilas, N. Pittaras, K. Avgerinakis, N. Gkalelis, A. Moumtzidou, S. Vrochidis, V. Mezaris, I. Kompatsiaris, I. Patras. Proc. TRECVID 2015 Workshop, Gaithersburg, MD, USA, Nov. 2015.
Link:
http://www-nlpir.nist.gov/projects/tvpubs/tv15.papers/iti-certh.pdf
Details/Description:
Exploiting visual similarities for ontology alignment. C. Doulaverakis, S. Vrochidis, I. Kompatsiaris. 7th International Conference on Knowledge Engineering and Ontology Development (KEOD 2015), Lisbon, Portugal, 12-14 November, 2015.
Link:
https://www.multisensorproject.eu/wp-content/uploads/2016/11/Doulaverakis_KEOD2015_camera_ready.pdf
Details/Description:
Classification using various ML Methods and Combinations of Key-Phrases and Visual Features. Y. Hacohen-Kerner, A. Sabag, D. Liparas, A. Moumtzidou, S. Vrochidis and I. Kompatsiaris. 1st KEYSTONE Conference (IKC2015), Coimbra, Portugal, September 8-9, 2015.
Link:
Details/Description:
A Framework for the Discovery, Analysis, and Retrieval of Multimedia Homemade Explosives Information on the Web. T. Tsikrika, G. Kalpakis, S. Vrochidis, I. Kompatsiaris, I. Paraskakis, I. Kavasidis, J. Middleton, and U. Williamson. In Proceedings of the International Workshop on Multimedia Forensics and Security (MFSec 2015), held in conjunction with the 10th International Conference on Availability, Reliability and Security, Toulouse, France, 2015.
Link:
Details/Description:
Concept Detection on Multimedia Web Resources about Home Made Explosives. G. Kalpakis, T. Tsikrika, F. Markatopoulou, N. Pittaras, S. Vrochidis, V. Mezaris, I. Patras, and I. Kompatsiaris. In Proceedings of the International Workshop on Multimedia Forensics and Security (MFSec 2015), held in conjunction with the 10th International Conference on Availability, Reliability and Security, Toulouse, France, 2015.
Link:
Details/Description:
MULTISENSOR: Development of Multimedia Content Integration Technologies for Journalism, Media Monitoring and International Exporting Decision Support. S. Vrochidis, I. Kompatsiaris, G. Casamayor, I. Arapakis, R. Busch, V. Alexiev, E. Jamin, M. Jugov, N. Heise, T. Forrellat, D. Liparas, L. Wanner, I. Miliaraki, V. Aleksic, K. Simov, A. M. Soro, M. Eckhoff, T. Wagner, M. Puigbó. 2015 IEEE International Conference on Multimedia and Expo (ICME 2015), Turin, Italy, June 29 – July 3, 2015.
Link:
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7169818
Details/Description:
Discovery of Environmental Web Resources Based on the Combination of Multimedia Evidence. T. Tsikrika, A. Latas, A. Moumtzidou, E. Chatzilari, S. Vrochidis and I. Kompatsiaris. 2nd International Workshop on Environmental Multimedia Retrieval (EMR 2015), Shanghai, China, June 23, 2015.
Link:
Details/Description:
A Unified Model for Socially Interconnected Multimedia-Enriched Objects. T. Tsikrika, K. Andreadou, A. Moumtzidou, E. Schinas, S. Papadopoulos, S. Vrochidis, Y. Kompatsiaris. 21st MultiMedia Modelling Conference (MMM2015), Sydney, Australia, 5-7 January, 2015.
Link:
https://link.springer.com/chapter/10.1007/978-3-319-14445-0_32
Details/Description:
VERGE: A Multimodal Interactive Video Search Engine. A. Moumtzidou, K. Avgerinakis, E. Apostolidis, F. Markatopoulou, K. Apostolidis, T. Mironidis, S. Vrochidis, V. Mezaris, Y. Kompatsiaris, I. Patras. Proc. 21st Int. Conf. on MultiMedia Modeling (MMM15), Sydney, Australia, Jan. 2015.
Link:
https://link.springer.com/chapter/10.1007/978-3-319-14442-9_23
Details/Description:
News articles classification using Random Forests and weighted multimodal features. D. Liparas, Y. Hacohen-Kerner, A. Moumtzidou, S. Vrochidis and I. Kompatsiaris. Proceedings of the 3rd Open Interdisciplinary MUMIA Conference and 7th Information Retrieval Facility Conference (IRFC2014), 10 – 12 November 2014, Copenhagen, Denmark, LNCS 8849, pp. 63-75, Berlin: Springer-Verlag, 2014.
Link:
https://link.springer.com/chapter/10.1007/978-3-319-12979-2_6
Details/Description:
ITI-CERTH participation to TRECVID 2014. N. Gkalelis, F. Markatopoulou, A. Moumtzidou, D. Galanopoulos, K. Avgerinakis, N. Pittaras, S. Vrochidis, V. Mezaris, I. Kompatsiaris, I. Patras. Proc. TRECVID 2014 Workshop, Orlando, FL, USA, November 2014.
Link:
Details/Description:
Concept-oriented labelling of patent images based on Random Forests and proximity-driven generation of synthetic data. D. Liparas, A. Moumtzidou, S. Vrochidis, I. Kompatsiaris. COLING’14 Workshop on Vision and Language (VL’14), Dublin, August 23, 2014.
Link:
Details/Description:
Key-phrase Extraction using Textual and Visual Features. Y. HaCohen-Kerner, S. Vrochidis, D. Liparas, A. Moumtzidou and I. Kompatsiaris. 3rd Workshop on Vision and Language (VL), Dublin, Ireland, August 23-29, 2014.
Link:
https://www.multisensorproject.eu/wp-content/uploads/2016/11/W14-5421.pdf
Details/Description:
Detection of Terrorism-related Twitter Communities using Centrality Scores. I. Gialampoukidis, G. Kalpakis, T. Tsikrika, S. Papadopoulos, S. Vrochidis, I. Kompatsiaris. In Proceedings of International Workshop on Multimedia Forensics and Security (MFSec 2017), Bucharest, Romania, June 06, 2017 (accepted for publication).
Link:
Details/Description:
Intelligent traffic city management from surveillance systems (CERTH-ITI). Avgerinakis, P. Giannakeris, A. Briassouli, A. Karakostas, S. Vrochidis, I. Kompatsiaris. NVIDIA AI city challenge, IEEE Smart World, Aug. 2017, USA.
Link:
http://smart-city-sjsu.net/AICityChallenge/papers/NVIDIA_AI_City_Challenge_2017_paper_2.pdf
Details/Description:
Visual and textual analysis of social media and satellite images for flood detection. Avgerinakis, A. Moumtzidou, S. Andreadis, E. Michail, I. Gialampoukidis, S. Vrochidis, I. Kompatsiaris. Multimedia Satellite task of MediaEval 2017.
Link:
Details/Description:
Crater monitoring through social media observations. Gialampoukidis, S. Vrochidis and I. Kompatsiaris. In European Planetary Science Congress 2017, 17–22 September 2017.
Link:
http://meetingorganizer.copernicus.org/EPSC2017/EPSC2017-25-1.pdf
Details/Description:
LBP-flow and hybrid encoding for real-time water and fire classification. Avgerinakis, P. Giannakeris, A. Briassouli, A. Karakostas, S. Vrochidis, I. Kompatsiaris. ICCV-MSF 2017, IEEE/ISPRS 4th Joint Workshop on Multi-Sensor Fusion for Dynamic Scene Understanding.
Link:
Details/Description:
Unsupervised Keyword Extraction Using the GoW Model and Centrality Scores. E. Batziou, I. Gialampoukidis, S. Vrochidis, I. Antoniou, I. Kompatsiaris. In International Conference on Internet Science, pp. 344-351. Springer, Cham, 2017.
Link:
https://link.springer.com/chapter/10.1007/978-3-319-70284-1_26
Details/Description:
A long short-term memory based Schaeffer gesture recognition system SO Oprea, A Garcia‐Garcia, S Orts‐Escolano, V Villena‐Martinez, JA Castro‐Vargas, Expert Systems, online. DOI: 10.1111/exsy.12247
Link:
Details/Description:
Automatic Schaeffer's gestures recognition system Francisco Gomez‐Donoso, Miguel Cazorla, Alberto Garcia‐Garcia, Jose Garcia‐Rodriguez. Expert Systems. Volume 33, Issue 5, October 2016, Pages 480–488
Link:
Details/Description:
A Markov Network Based Passage Retrieval Method for Multimodal Question Answering in the Cultural Heritage Domain. Shurong Sheng, Aparna Nurani Venkitasubramanian and Marie-Francine Moens. In Proceedings of the 24th International Conference on Multimedia Modeling (MMM2018), Bangkok, Thailand, 2018. Lecture Notes in Computer Science
Link:
https://link.springer.com/chapter/10.1007/978-3-319-73603-7_1
Details/Description:
Collell, G., Van Gool, L., & Moens, M. F. (2018) Acquiring common sense spatial knowledge through implicit spatial templates. AAAI Conference on Artificial Intelligence. AAAI
Link:
Details/Description:
Video Description using Bidirectional Recurrent Neural Networks Álvaro Peris, Marc Bolaños, Petia Radeva, Francisco Casacuberta 25th International Conference on Artificial Neural Networks (ICANN) Lecture Notes in Computer Science: 9887:3-11, 2016
Link:
https://link.springer.com/chapter/10.1007/978-3-319-44781-0_1
Details/Description:
VIBIKNet: Visual Bidirectional Kernelized Network for Visual Question Answering Marc Bolaños, Álvaro Peris, Petia Casacuberta, Francisco, Radeva IbPRIA: 8th Iberian Conference on Pattern Recognition and Image Analysis (LNCS) Lecture Notes in Computer Science, 10255:372-380, 2017
Link:
https://link.springer.com/chapter/10.1007/978-3-319-58838-4_41
Details/Description:
Egocentric video description based on temporally-linked sequences M Bolaños, Á Peris, F Casacuberta, S Soler, P Radeva Journal of Visual Communication and Image Representation 50:205-216, 2018
Link:
https://www.sciencedirect.com/science/article/pii/S1047320317302316
Details/Description:
Albert Gatt, Marc Tanti, Adrian Muscat, Patrizia Paggio, Reuben A Farrugia, Claudia Borg, Kenneth Camilleri, Mike Rosner and Lonneke van der Plas (2018) Face2Text: Collecting an Annotated Image Description Corpus for the Generation of Rich Face Descriptions. To be published in Proceedings of LREC 2018.
Link:
http://lrec2018.lrec-conf.org/en/conference-programme/accepted-papers/
Details/Description:
Patrizia Paggio, Costanza Navarretta and Bart Jongejan (2017) Automatic identification of head movements in video-recorded conversations: can words help?. In Proceedings of the 6th Workshop on Vision and Language, pp. 40-42. The Association for Computational Linguistics.
Link:
Details/Description:
Huu Ton Le, Thierry Urruty, Syntyche Gbèhounou, François Lecellier, Jean Martinet, Christine Fernandez-Maloigne: Improving retrieval framework using information gain models. Signal, Image and Video Processing 11(2): 309-316 (2017)
Link:
http://www.cristal.univ-lille.fr/~martinej/papers/2017/le17improving.pdf
Details/Description:
Amel Aissaoui, Afifa Dahmane, Jean Martinet, Ioan Marius Bilasco: Introducing FoxFaces: A 3-in-1 Head Dataset. VISIGRAPP (4: VISAPP) 2016: 533-537
Link:
http://www.cristal.univ-lille.fr/~martinej/papers/2016/aissaoui16introducing.pdf
Details/Description:
Jalila Filali, Hajer Baazaoui Zghal, Jean Martinet: Visually Supporting Image Annotation Based on Visual Features and Ontologies. IV 2017: 182-187
Link:
http://www.cristal.univ-lille.fr/~martinej/papers/2017/filali17visually.pdf
Details/Description:
Jalila Filali, Hajer Baazaoui Zghal, Jean Martinet: Towards Visual Vocabulary and Ontology-based Image Retrieval System. ICAART (2) 2016: 560-565
Link:
http://www.cristal.univ-lille.fr/~martinej/papers/2016/filali16towards.pdf
Details/Description:
Rémi Auguste, Jean Martinet, Pierre Tirilly: Space-time Histograms And Their Application To Person Re-identification In TV Shows. ICMR 2015: 91-97
Link:
http://www.cristal.univ-lille.fr/~martinej/papers/2015/auguste15space.pdf
Details/Description:
Thierry Urruty, Syntyche Gbèhounou, Huu Ton Le, Jean Martinet, Christine Fernandez-Maloigne: Iterative Random Visual Word Selection. ICMR 2014: 249
Link:
http://www.cristal.univ-lille.fr/~martinej/papers/2014/urruty14iterative.pdf
Details/Description:
Jean Martinet: From Text Vocabularies to Visual Vocabularies - What Basis?. VISAPP (2) 2014: 668-675
Link:
http://www.cristal.univ-lille.fr/~martinej/papers/2014/martinet14from.pdf
Details/Description:
Meriem Bendris, Benoît Favre, Delphine Charlet, Géraldine Damnati, Grégory Senay, Rémi Auguste, Jean Martinet: Unsupervised face identification in TV content using audio-visual sources. CBMI 2013: 243-249
Link:
http://www.cristal.univ-lille.fr/~martinej/papers/2013/bendriss13unsupervised
Details/Description:
Ismail Elsayad, Jean Martinet, Thierry Urruty, Chabane Djeraba: Toward a higher-level visual representation for content-based image retrieval. Multimedia Tools Appl. 60(2): 455-482 (2012)
Link:
http://www.cristal.univ-lille.fr/~martinej/papers/elsayad12toward.pdf
Details/Description:
Ismail Elsayad, Jean Martinet, Thierry Urruty, Chabane Djeraba: A new spatial weighting scheme for bag-of-visual-words. CBMI 2010: 1-6
Link:
http://www.cristal.univ-lille.fr/~martinej/papers/elsayad10new.pdf
Details/Description:
Sorodoc, I., Lazaridou, A., Boleda, G., Herbelot, A., Pezzelle, S., & Bernardi, R. (2016). “Look, some green circles!”: Learning to quantify from images. In Proceedings of the 5th Workshop on Vision and Language (VL'16), co-located with ACL 2016, Berlin, Germany, August 2016.
Link:
Details/Description:
Pezzelle, S., Marelli, M., & Bernardi, R. (2017). Be Precise or Fuzzy: Learning the Meaning of Cardinals and Quantifiers from Vision. EACL 2017, Valencia, Spain, April 2017.
Link:
Details/Description:
Pezzelle, S., Shekhar, R., & Bernardi, R. (2016). Building a bagpipe with a bag and a pipe: Exploring Conceptual Combination in Vision. In Proceedings of the 5th Workshop on Vision and Language (VL'16), co-located with ACL 2016, Berlin, Germany, August 2016.
Link:
Details/Description:
Sina Zarrieß, Julian Hough, Casey Kennington, Ramesh Manuvinakurike, David DeVault, Raquel Fernández, and David Schlangen. PentoRef: A Corpus of Spoken References in Task-oriented Dialogues. In Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC), pp. 125-131, 2016.
Link:
http://www.lrec-conf.org/proceedings/lrec2016/pdf/563_Paper.pdf
Details/Description:
Angeliki Lazaridou, Grzegorz Chrupała, Raquel Fernández, and Marco Baroni. Multimodal Semantic Learning from Child-Directed Input. In Proceedings of the 15th Annual Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL-HLT), pp. 387-392, 2016.
Link:
Details/Description:
Development of a common validation framework for the evaluation of polyp detection methods, including definition of new databases and performance metrics Bernal, J., Tajkbaksh, N., Sánchez, F. J., Matuszewski, B. J., Chen, H., Yu, L., ... & Histace, A. (2017). comparative validation of polyp detection methods in video colonoscopy: results from the MICCAI 2015 endoscopic vision challenge. IEEE transactions on medical imaging, 36(6), 1231-1249.
Link:
Details/Description:
Setting up the first benchmark of polyp segmentation in colonoscopy images Vázquez, D., Bernal, J., Sánchez, F. J., Fernández-Esparrach, G., López, A. M., Romero, A., ... & Courville, A. (2017). A Benchmark for Endoluminal Scene Segmentation of Colonoscopy Images. Journal of healthcare engineering, 2017.
Link:
Details/Description:
Creation of a software to detect automatically specular highlights in images Sánchez, F. J., Bernal, J., Sánchez-Montes, C., de Miguel, C. R., & Fernández-Esparrach, G. (2017). Bright spot regions segmentation and classification for specular highlights detection in colonoscopy videos. Machine Vision and Applications, 28(8), 917-936.
Link:
Details/Description:
A Distributed Representation Based Query Expansion Approach for Image Captioning, S. Yagcioglu, E. Erdem, A. Erdem and R. Cakici, The 53rd Annual Meeting of the Association for Computational Linguistics (ACL 2015), Beijing, China, July 2015
Link:
Details/Description:
Leveraging Captions in the Wild to Improve Object Detection, M. Kilickaya, N. Ikizler-Cinbis, E. Erdem and A. Erdem, The 5th Workshop on Vision and Language (VL'16) - in conjuction with ACL 2016, Berlin, Germany, August 2016
Link:
https://web.cs.hacettepe.edu.tr/~erkut/publications/acl-vl16.pdf
Details/Description:
Re-evaluating Automatic Metrics for Image Captioning, M. Kilickaya, A. Erdem, N. Ikizler-Cinbis and E. Erdem, The 15th Conference of the European Chapter of the Association for Computational Linguistics (EACL 2017), Valencia, Spain, April 2017
Link:
Details/Description:
Automatic Description Generation from Images: A Survey of Models, Datasets, and Evaluation Measures, R. Bernardi, R. Cakici, D. Elliott, A. Erdem, E. Erdem, N. Ikizler-Cinbis, F. Keller, A. Muscat, B. Plank, Journal of Artificial Intelligence Research, 55, pp. 409-442, February 2016
Link:
Details/Description:
Kilickaya, M. , Kerim Akkus, B., Cakici, R., Erdem, A., Erdem, E., and Ikizler-Cinbis, N. (2017) Data-driven image captioning via salient region discovery. IET Computer Vision,, 11(6), pages. 398-406, September 2017
Link:
https://web.cs.hacettepe.edu.tr/~erkut/publications/iet-cv2016.pdf
Details/Description:
Jia, X., Gavves, S., Fernando, B., Tuytelaars, T., (2015) Guiding Long-Short Term Memory for Image Caption Generation, International Conference on Computer Vision (ICCV), 2015.
Link:
http://homes.esat.kuleuven.be/~xjia/xjia_publications/xjia_iccv15_glstm.pdf
Details/Description:
Calixto, I., Stein, D., Matusov, E., Lohar, P., Castilho, S., and Way, A. (2017). Using images to improve machine-translating e-commerce product listings. In Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics: Volume 2, Short Papers, pages 637–643, Valencia, Spain.
Link:
Details/Description:
Calixto, I., Stein, D., Matusov, E., Castilho, S., and Way, A. (2017d). Human evaluation of multi-modal neural machine translation: A case-study on e-commerce listing titles. In Proceedings of the Sixth Workshop on Vision and Language, pages 31–37, Valencia, Spain.
Link:
Details/Description:
Calixto, I. and Liu, Q. (2017). Sentence-Level Multilingual Multi-modal Embedding for Natural Language Processing. In: Proceedings of the International Conference Recent Advances in Natural Language Processing, RANLP 2017, pages 139-148, Varna, Bulgaria.
Link:
http://www.acl-bg.org/proceedings/2017/RANLP%202017/pdf/RANLP020.pdf
Details/Description:
Calixto, I. and Liu, Q. (2017) Incorporating Global Visual Features into Attention-based Neural Machine Translation. In: Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, pages 992-1003, Copenhagen, Denmark.
Link:
Details/Description:
Calixto, I., Liu, Q., and Campbell, N. (2017a). Doubly-Attentive Decoder for Multi-modal Neural Machine Translation. In Proceedings of the 55th Conference of the Association for Computational Linguistics: Volume 1, Long Papers, Vancouver, Canada.
Link:
Details/Description:
Calixto, I., Elliott, D., and Frank, S. (2016). DCU-UvA Multimodal MT System Report. In Proceedings of the First Conference on Machine Translation, pages 634–638, Berlin, Germany.
Link:
Details/Description:
Calixto, I., de Campos, T., and Specia, L. (2012). Images as context in Statistical Machine Translation. In The 2nd Annual Meeting of the EPSRC Network on Vision & Language (VL’12), Sheffield, UK. EPSRC Vision and Language Network.
Link:
http://www.ee.surrey.ac.uk/CVSSP/Publications/papers/Calixto-VL-2012.pdf
Details/Description:
Emiel van Miltenburg, Roser Morante, and Desmond Elliott, “Pragmatic factors in image description: the case of negations,” in Proceedings of the 5th workshop on vision and language, 2016, pp. 54-59.
Link:
Details/Description:
Emiel van Miltenburg, Desmond Elliott, and Piek Vossen, “Cross-linguistic differences and similarities in image descriptions,” in Proceedings of the 10th international conference on natural language generation, Santiago de Compostela, Spain, 2017, pp. 21-30.
Link:
Details/Description:
Elliott, D., Frank, S., Barrault, L. Bougares, F., Specia, L. Findings of the Second Shared Task on Multimodal Machine Translation and Multilingual Image Description. In 2nd Conference on Machine Translation, WMT, pp .215-233, Copenhagen, Denmark, 2017.
Link:
Details/Description:
Madhyastha, P.S., Wang, J., Specia, L. Sheffield MultiMT: Using Object Posterior Predictions for Multimodal Machine Translation. In 2nd Conference on Machine Translation, WMT, pp. 470-476, Copenhagen, Denmark, 2017.
Link:
Details/Description:
Lala, C., Madhyastha, P., Wang, J., Specia, L. Unraveling the Contribution of Image Captioning and Neural Machine Translation for Multimodal Machine Translation, Prague Bull. Math. Linguistics, vol 108, pp. 197-208, 2017.
Link:
https://ufal.mff.cuni.cz/pbml/108/art-lala-madhyastha-wang-specia.pdf
Details/Description:
Dat Tien Nguyen, Angeliki Lazaridou, Raffaella Bernardi Coloring Objects: Adjective-Noun Visual Semantic Compositionality Proceedings of the Third Workshop on Vision and Language, Dublin City University and the Association for Computational Linguistics, Pages: 112–114
Link:
Details/Description:
Dieu-Thu Le, Jasper Uijlings, Raffaella Bernardi TUHOI: Trento Universal Human Object Interaction Dataset Proceedings of the Third Workshop on Vision and Language, Dublin City University and the Association for Computational Linguistics Pages: 17–24
Link:
Details/Description:
Ionut Sorodoc and Angeliki Lazaridou and Gemma Boleda Aurelie Herbelot ´ and Sandro Pezzelle and Raffaella Bernardi “Look, some green circles!”: Learning to quantify from images Proceedings of the 5th Workshop on Vision and Language, pages 75–79, Berlin, Germany, August 12 2016. c 2016 Association for Computational Linguistics
Link:
Details/Description:
Sandro Pezzelle, Ravi Shekhar, Raffaella Bernardi Building a Bagpipe with a Bag and a Pipe: Exploring Conceptual Combination in Vision Proceedings of the 5th Workshop on Vision and Language, pages 60–64, Berlin, Germany, August 12 2016. c 2016 Association for Computational Linguistics
Link:
Details/Description:
Sandro Pezzelle, Marco Marelli, Raffaella Bernardi Be Precise or Fuzzy: Learning the Meaning of Cardinals and Quantifiers from Vision Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics: Volume 2, Short Papers, pages 337–342, Valencia, Spain, April 3-7, 2017. c 2017 Association for Computational Linguistics
Link:
Details/Description:
Ravi Shekhar, Sandro Pezzelle, Yauhen Klimovich, Aurelie Herbelot, Moin Nabi, Enver Sangineto, Raffaella Bernardi FOIL it! Find One mismatch between Image and Language caption Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, pages 255–265 Vancouver, Canada, July 30 - August 4, 2017. c 2017 Association for Computational Linguistics
Link:
Details/Description:
G. Collell and S. Moens, “Is an image worth more than a thousand words? on the fine-grain semantic differences between visual and linguistic representations,” in COLING, ACL, 2016
Link:
Details/Description:
G. Collell, T. Zhang, and M.-F. Moens, “Imagined visual representations as multimodal embeddings,” In AAAI Conference on Artificial Intelligence, AAAI, 2017.
Link:
Details/Description:
L Specia, S Frank, K Sima’an and D Elliott. A Shared Task on Multimodal Machine Translation and Crosslingual Image Description. In First Conference on Machine Translation, Volume 2: Shared Task Papers, pp. 540550, WMT, Berlin, Germany. 2016
Link:
Details/Description:
SHEF-Multimodal: Grounding Machine Translation on Images. K Shah, J Wang, and L Specia. First Conference on Machine Translation, Berlin, Germany, pp. 657-662. 2016
Link:
Details/Description:
Multi30K: Multilingual English-German Image Descriptions. D Elliott, S Frank, K Sima’an and L Specia. Workshop on Vision and Language, Berlin, Germany, pp. 70-74. 2016
Link:
Details/Description:
Cinar, Y.G., Zoghbi, S. & Moens, M.-F. (2015). Inferring User Interests on Social Media from Text and Images. In Proceedings of SoMeRA 2015: 2nd International Workshop on Social Media Retrieval and Analysis at ICDM 2015.
Link:
https://lirias.kuleuven.be/bitstream/123456789/510686/1/CinaretalSoMeRA2015.pdf
Details/Description:
Zoghbi, S. & Moens, M.-F. (2016). Cross-modal Fashion Search. In Proceedings of the 22nd International Conference on MultiMedia Modelling (Lecture Notes in Computer Science 9517) (pp. 367-373).
Link:
Details/Description:
Nurani Venkitasubramanian, A., Tuytelaars, T. & Moens, M.-F. (2016). Wildlife Recognition in Nature Documentaries with Weak Supervision from Subtitles and External Data. Pattern Recognition Letters. doi:10.1016/j.patrec.2016.01.025
Link:
Details/Description:
Fashion Meets Computer Vision and NLP and E-Commerce Search.Susana Zoghbi, Geert Heyman, Juan Carlos Gomez, Marie-Francine Moens International Journal of Computer and Electrical Engineering (IJCEE), 8(1), 31-43
Link:
http://people.cs.kuleuven.be/~susana.zoghbi/myPublications/IJCEE_Final.pdf
Details/Description:
Ivan Huerta, Marco Pedersoli, Jordi Gonzàlez and Albert Sanfeliu, "Combining where and what in change detection for unsupervised foreground learning in surveillance", Pattern Recognition, Volume 48, Issue 3, Pages 709–719, 2015
Link:
http://iselab.cvc.uab.es/files/Publications/2014/PDF/HPS2014.pdf
Details/Description:
Xavier Perez-Sala, Sergio Escalera, Cecilio Angulo and Jordi Gonzàlez, "A Survey on Model Based Approaches for 2D and 3D Visual Human Pose Recovery", Sensors 14(3), pp 4189-4210, 2014
Link:
Details/Description:
Sergio Escalera, Jordi Gonzàlez, Xavier Baró, Pablo Pardo, Junior Fabian, Marc Oliu, Hugo Jair Escalante, Ivan Huerta, Isabelle Guyon, "ChaLearn Looking at People 2015 new competitions: Age Estimation and Cultural Event Recognition", International Joint Conference on Neural Networks, 2015
Link:
http://www.maia.ub.es/~sergio/linked/ijcnn_age_and_cultural_2015.pdf
Details/Description:
C. Crispim-Junior and F. Bremond. Uncertainty Modeling Framework for Constraint-based Elementary Scenario Detection in Vision System. In the First International Workshop on Computer vision + ONTology Applied Cross-disciplinary Technologies in conjunction with ECCV 2014, CONTACT-2014, Zurich, Switzerland, September 7th, 2014.
Link:
http://www-sop.inria.fr/members/Francois.Bremond/Postscript/carlos_contact2014.pdf
Details/Description:
A. König, C. Crispim, A. Derreumaux, G. Bensadoum, P.D. Petit, F. Bremond, R. David, F. Verhey, P. Aalten and P.H. Robert. Validation of an Automatic Video Monitoring System for the Detection of Instrumental Activities of Daily Living in Dementia Patients, - Journal of Alzheimer Disease, - 44 (2015) pp. 675~685, IOS Press, DOI 10.3233/JAD-141767, 2015.
Link:
http://www-sop.inria.fr/members/Francois.Bremond/Postscript/JAD-Alexandra2015.pdf
Details/Description:
S. Elloumi, S. Cosar, G. Pusiol, F. Bremond and M. Thonnat. Unsupervised Discovery of Human Activities from Long-Videos, IET Computer Vision, CVI-2014-0311.R1, 2014
Link:
http://www-sop.inria.fr/members/Francois.Bremond/Postscript/actdis_cv.pdf
Details/Description:
Polina Kuznetsova, Vicente Ordonez, Alexander Berg, Tamara Berg and Yejin Choi (2013). Generalizing Image Captions for Image-Text Parallel Corpus. In Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics (ACL'13).
Link:
Details/Description:
Micah Hodosh, Peter Young, Cyrus Rashtchian and Julia Hockenmaier (2010). Cross-Caption Coreference Resolution for Automatic Image Understanding. Proceedings of the 14th Conference on Natural Language Learning (CoNLL'10).
Link:
http://nlp.cs.illinois.edu/HockenmaierGroup/Papers/CoNLL2010/W10-2920.pdf