PhD in Cross-modal Deep Learning between Vision, Language, Audio and Speech

2018-01-04T23:48:18+00:00 January 4th, 2018|

Would you like to join us? Call open for the INPhINIT-"la Caixa" Doctoral fellowship programme. under the BSC research center, that offers 20 PhD INPhINIT positions.   INPhINIT is a doctoral fellowship programme devoted to attracting international Early-Stage Researchers to the top Spanish research centres. INPhINIT is promoted by "la Caixa" Foundation with the aim of supporting the best scientific talent and fostering innovative and high-quality research in Spain by recruiting outstanding international students and offering them an attractive and competitive environment for conducting research of excellence. Deep neural networks have boosted the convergence of multimedia data analytics in a unified framework shared by practitioners in natural language, vision, audio and speech. Image captioning, lip reading or video sonorization are some of the first applications of a [...]

BSC-CNS at NIPS2017, a top Machine Learning and Artificial Intelligence conference

2017-12-03T18:38:00+00:00 December 3rd, 2017|

BSC-CNS at NIPS2017, a top Machine Learning and Artificial Intelligence conference BSC-CNS will be present at the 31th Annual Conference on Neural Information Processing Systems (NIPS 2017), a machine learning and computational neuroscience conference that includes invited talks, demonstrations and oral and poster presentations. It covers topics ranging from deep learning and computer vision to cognitive science and reinforcement learning. NIPS is one of the top Machine Learning and Artificial Intelligence conferences in the world and has become the academic and industry AI conference. One paper will be presented at Machine Learning for Health workshop: "Detection-aided liver lesion segmentation using deep learning". In this paper we propose a method to segment the liver and its lesions from Computed Tomography (CT) scans using Convolutional Neural Networks (CNNs), that have proven good results in a variety of computer [...]

Factors that have triggered the potential of Deep Learning

2017-11-11T13:37:43+00:00 September 27th, 2017|

Artificial Intelligence and Neural Networks, are not a new concepts! Why, all of a sudden, have them become the next big thing that is changing our life again during this decade? Deep Learning is changing our life I'm sure you have doubtlessly noticed quantum leaps in the quality of a wide range of everyday technologies.   In Speech Recognition  the transcription of voice to text has experimented amazing advances, and it is already available in different devices. We are increasingly interacting with “our” computers by just talking to them.  Also there have been some spectacular advances in Natural Language Processing, for example, by simply clicking on the micro symbol of Google Translate, the system will transcribe what you are dictating to another language. Google Translate now renders spoken sentences in [...]

A stroll through Marenostrum IV computing resources

2017-11-11T13:39:20+00:00 September 18th, 2017|

Enclosed you will find the slides that I used in my last visit to the Torre Girona chapel with UPC students. The slides show the awesome computational power of the Marenostrum IV supercomputer and the presentation of the architecture required to get it.  I have posted these transparencies (and photo) for my students as I promised them. But anyone who wants to know a little more about the Marenostrum can take a look at them to realize the magnitude of this supercomputer.      SLIDESHARE slides <blockquote>In reality, this post was intended for my UPC course's students, although I think it may be of interest to other students.</blockquote>

BSC at the 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

2017-11-11T13:41:17+00:00 July 26th, 2017|

The 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) is over.   CVPR'2017 is the premier annual computer vision event comprising the main conference and several co-located workshops, the best meeting point for students, academics and industry researchers in this area. This year Deep Learning and Artificial Intelligence were the focus of the conference, an this was the reason why BSC-CNS  has been present with contributions in two workshops: Disentangling Motion, Foreground and Background Features in Videos: Our model and source code are publicly available at this https URL  The paper that contains the detailed contribution can be downloaded from arxiv repository. SalGAN: Visual Saliency Prediction with Generative Adversarial Networks: Our model and source code are publicly available at this https URL .  The shorter extended abstract presented as spotlight in the CVPR 2017 Scene [...]