NLP News - Recurrent Highway Hypernetworks, New Multimodal Environments, LDA2vec, DL for Structured Data, NIPS highlights, AlphaZero, QAngaroo

Dec 11, 2017

Some highlights of this newsletter: An implementation of recurrent highway hypernetworks; new multimodal environments for visual question answering; why the intelligence explosion is impossible; a tutorial on LDA2vec; Deep Learning for structured data; lots of highlights from NIPS including tutorial slides, Ali Rahimi's presentation, debate and conversation notes, competition winners; AlphaZero; QAngaroo, a new generation of QA datasets.

When is the last time your teeth felt like they had little sweaters on them?

- A question by an Amazon Mechanical Turk worker to determine if they were talking to a human or AI (credit: Sarah Schwettmann).

Tools and implementations

mordecai - Custom-built full text and event geoparsing — github.com

Many applications require geographic information, but extracting it from text is difficult. mordecai is a Python library for full text geoparsing that extracts the place names from a piece of text, resolves them to the correct place, and returns their coordinates and structured geographic information.

The fast.ai deep learning library, lessons, and tutorials — github.com

fast.ai's deep learning library is known to many from its course. It is now available via pip.

Recurrent Highway Hypernetworks — github.com

Pytorch code for Character-level Language Modeling with Recurrent Highway Hypernetworks (NIPS 2017).

A Household multimodal environment — github.com

HoME (Household Multimodal Environment) is a platform for artificial agents to learn from vision, audio, semantics, physics, and interaction with objects and other agents, all within a realistic context.

Realistic and Rich 3D Environments for Embodied Question Answering — embodiedqa.org

Staying with multimodal environments, Embodied Question Answering (QA) combines Visual QA and robot navigation. In order to answer a question, the agent must first explore the environment.

pycolab game engine — github.com

We are not done with a environments yet. pycolab is a highly-customisable gridworld game engine with some batteries included that allows you to create your own gridworld games to test reinforcement learning agents.

Unsupervised Language Modeling at scale for robust sentiment classification — github.com

Implementation of Generating Reviews and Discovering Sentiment by OpenAI that reduces training time from 1 month to 5 days.

Reports on benchmarks

2017 AI Index Report

The 2017 Artificial Intelligence Annual Report gives a comprehensive overview of the role of AI in industry, academia, and sheds light on current technical performance.

Machine learning benchmarks: Hardware providers — rare-technologies.com

A comparison of machine learning frameworks and hardware platforms from practical aspects, such as their ease of use, cost, stability, scalability and performance.

Chatbots

Your chatbot needs a personality — blog.myralabs.com This blog post emphasizes that the current generation of chatbots require a personality to succeed. But maybe this is the wrong way to think about it, as pointed out by a HN user:

We do not yet have chatbots. What we do have are publicly-facing undocumented nondeterministic command line interfaces which expect the user to guess the right commands. This interface further insults the user by pretending to be a person.

‘The Woebot will see you now’ — the rise of chatbot therapy

A Washington post article about the dozens of online apps that offer instant therapy for the lonely, the confused and the anxious via chatbots.

Alexa Prize winners — developer.amazon.com

The winners of the Alexa Prize 2017 have been announced. The winning system by University of Washington was able to sustain conversations with an average duration of 10 minutes, 22 seconds and an average score of 3.17/5.

The promise and danger of AI

Using Artificial Intelligence to Augment Human Intelligence — distill.pub

A great article that outlines directions of how we can augment our intellect using AI, in line with early ideas of human-computer symbiosis envisioned by pioneers such as J.C.R. Licklider, Douglas Engelbart, and Alan Kay.

The impossibility of intelligence explosion — medium.com

An essay by François Chollet on why the singularity, i.e. the explosion of super-human intelligence is impossible.

Specifying AI safety problems in simple environments — deepmind.com

This blog posts discusses a recent DeepMind paper on AI safety that introduces a selection of simple reinforcement learning environments designed specifically to measure ‘safe behaviours’.

Bias

Interpreting Deep Neural Networks with SVCCA — research.googleblog.com

This blog post discusses SVCCA, a novel simple and scalable method to make models more interpretable.

Debugging data: Microsoft researchers look at ways to train AI systems to reflect the real world — blogs.microsoft.com

A blog post looking at the work of Microsoft researchers such as Hanna Wallach, Kate Crawford, and Patrice Simard to reduce bias in ML models.

Translations from Turkish (a gender-neutral language) are gendered.

Tutorials

LDA2vec: Word Embeddings in Topic Models — towardsdatascience.com

Lars Hulstaert gives a great overview of LDA2vec, a model that learns dense word vectors jointly with Dirichlet-distributed latent document-level mixtures of topic vectors.

Sequence Modeling with CTC — distill.pub A visual guide to Connectionist Temporal Classification, an algorithm used to train deep neural networks in speech recognition, handwriting recognition and other sequence problems.

Reproducible Data Analysis in Jupyter

A series of tutorials on conducting reproducible data analysis in Jupyter notebooks.

NIPS 2017

Deep Probabilistic Modelling with Gaussian Processes

Slides of the tutorial on Deep Probabilistic Modelling with Gaussian Processes by Neil D. Lawrence.

Geometric deep learning on graphs and manifolds tutorial — www.dropbox.com

Slides of the tutorial on geometric deep learning on graphs and manifolds.

Accepted papers of the Machine Learning for Creativity and Design workhshop — nips2017creativity.github.io

Accepted papers of the Machine Learning for Creativity and Design workshop. The AI-created art can be seen here.

Safe and Nested Subgame Solving for Imperfect-Information Games

One of the NIPS 2017 best papers, which introduces techniques for solving imperfect-information games that were used in Libratus, the first AI that defeated top humans in heads-up no-limit Texas hold'em poker. A teaser of the paper can be seen here.

Ali Rahimi's Test-of-time award presentation — www.youtube.com

In his presentation, Ali Rahimi likens the current generation of ML models to alchemy, which the community has failed to underpin with theory. Yann LeCun rebuts his arguments, to which Rahimi replies here.

Machine Learning for Systems and Systems for Machine Learning

Jeff Dean's presentation on using ML to improve systems, e.g. compilers, device placement, etc. and creating new systems for ML, e.g. TPUs, linear algebra libraries, etc.

Tesla's conversation on AI with Andrej Karpathy, Elon Musk, and others

Stephen Merity chronicles Tesla's conversation on AI, which touches on Tesla's hardware and software for AI, self-driving cars, AGI, and other topics.

Interpretability debate

Dave Gershgorn documents the debate on interpretability with Rich Caruana, Patrice Simard, Kilian Weinberger, and Yann LeCun.

Winning system white paper

The white paper of the winning system at the NIPS 2017 Human-Computer QA competition.

Industry insights

Facebook AI Research Residency Program — research.fb.com

Facebook AI Research has announced a one-year research training program designed to give you hands-on experience of machine learning research.

Victims of Sexual Harassment Have a New Resource: AI — www.technologyreview.com

Montreal-based Botler AI launched a system that provides free information and guidance to those who have been sexually harassed and are unsure of their legal rights.

How Reuters’s Revolutionary AI System Gathers Global News — www.technologyreview.com

Reuters ML system uses Twitter to record news events as they as they are happening.

Facebook to expand artificial intelligence to help prevent suicide — www.reuters.com

Facebook Inc will expand its ML software to detect users with suicidal intent to other countries after successful tests in the US.

Paper picks

Are GANs Created Equal? A Large-Scale Study

With the GAN zoo growing and growing, there is no clear evidence which is the best GAN. This large-scale study evaluates the most popular GAN variants and -- controversially -- finds no evidence that any of the tested algorithms consistently outperforms the original one.

Extracting Automata from Recurrent Neural Networks Using Queries and Counterexamples

In the line of research on interpreting neural networks, one direction is to model the underlying complex model with a simpler model. This paper proposes a method to extract an automaton from an LSTM. The technique can highlight cases where the LSTM fails to learn the intended generalization, as the extracted automata is overly complex compared to a simple target language.

Ethical Challenges in Data-driven Dialogue Systems

Chatbots and dialogue systems are becoming more commonplace, but can lead to biased or offensive conversations. This study highlights potential ethical issues that arise in dialogue systems research, e.g. implicit biases, adversarial examples, privacy violations, safety concerns, RL considerations, and reproducibility concerns.

Neural Text Generation: A Practical Guide

This paper gives an overview of encoder-decoder architectures for neural text generation and highlights some best practices.

Mastering Chess and Shogi by Self-Play with a General Reinforcement Learning Algorithm

The most recent article discussing the latest generation of AlphaGo, Alpha Zero. Similar to AlphaGo Zero, Alpha Zero has learned chess and shogi entirely from scratch using self-play. Alpha Zero's games defeating Stockfish, the strongest open-source chess engine in the world, are well worth watching.

Dataset spotlight

QAngaroo - Reading Comprehension with Multiple Hops — qangaroo.cs.ucl.ac.uk

Question answering and reading comprehension has received a large amount of interest in recent years fuelled by new and large datasets such as CNN News, SQuAD, and MSMARCO. All of these datasets, however, simplify the task in that the provided passage is known to contain the answer. Welbl et al. from UCL introduce QAngaroo, a collection of two new datasets that model the more realistic setting of first having to find the relevant passage. These datasets should drive the development of QA systems and should make them more useful for more realistic applications.

NLP News