PAC Bayes Quadratic bound open sourcing.

PiperOrigin-RevId: 302629851
2025-12-06 09:02:05 +08:00 · 2020-03-24 10:10:46 +00:00
parent afcdc77239
commit f6395d709e
13 changed files with 1080 additions and 0 deletions
--- a/glassy_dynamics/README.md
+++ b/glassy_dynamics/README.md
@@ -0,0 +1,80 @@
+# Unveiling the predictive power of static structure in glassy systems
+
+This repository contains an open source implementation of the graph neural
+network model described in our paper.
+The model can be trained using the training binary included in this repository,
+and the dataset published with our paper.
+
+
+## Abstract
+
+Despite decades of theoretical studies, the nature of the glass transition
+remains elusive and debated, while the existence of structural predictors of the
+dynamics is a major open question. Recent approaches propose inferring
+predictors from a variety of human-defined features using machine learning.
+We learn the long time evolution of a glassy system solely from the initial
+particle positions and without any hand-crafted features, using a powerful
+model: graph neural networks. We show that this method strongly outperforms
+state-of-the-art methods, generalizing over a wide range of temperatures,
+pressures, and densities. In shear experiments, it predicts the location of
+rearranging particles. The structural predictors learned by our network unveil a
+correlation length which increases with larger timescales to reach the size of
+our system. Beyond glasses, our method could apply to many other physical
+systems that map to a graph of local interactions.
+
+
+## Dataset
+
+### System description
+
+The dataset was generated with the LAMMPS molecular dynamics package.
+The simulated system has periodic boundaries and is a binary mixture of 4096
+large (A) and small (B) particles that interact via a 6-12 Lennard-Jones
+potential.
+The interaction coefficients are set for a typical Kob-Andersen configuration.
+
+### Data format
+
+The data is stored in Python's pickle format protocol version 3.
+Each file contains the data for one of the equilibrated systems in a Python
+dictionary. The dictionary contains the following entries:
+
+  - `positions` the particle positions of the equilibrated system.
+  - `types` the particle types (0 == type A and 1 == type B) of the equilibrated
+     system.
+  - `box` the dimensions of the periodic cubic simulation box.
+  - `time` the logarithmically sampled time points.
+  - `time_indices` the indices of the time points for which the sampled
+     trajectories on average reach a certain value of the intermediate
+     scattering function.
+  - `is_values` the values of the intermediate scattering function associated
+     with each time index.
+  - `trajectory_start_velocities` the velocities drawn from a Boltzmann
+     distribution at the start of each trajectory.
+  - `trajectory_target_positions` the positions of the particles for each of
+     the trajectories at selected time points (as defined by the `time_indices`
+     array and the corresponding values of the intermediate scattering function
+     stored in `is_values`).
+  - `metadata` a dictionary containing additional metadata:
+    - `temperature` the temperature at which the system was equilibrated.
+    - `pressure` the pressure at which the system was equilibrated.
+    - `fluid` the type of fluid which was simulated (Kob-Andersen).
+
+All units are in Lennard-Jones units. The positions are stored in the absolute
+coordinate system i.e. they are outside of the simulation box if the particle
+crossed a periodic boundary during the simulation.
+
+
+## Reference
+
+If this repository is helpful for your research please cite the following
+publication:
+
+Unveiling the predictive power of static structure in glassysystems
+V. Bapst, T. Keck, A. Grabska-Barwinska, C. Donner, E. D. Cubuk,
+S. S. Schoenholz, A.Obika, A. W. R. Nelson, T. Back, D. Hassabis and P. Kohli
+
+
+## Disclaimer
+This is not an official Google product.
+
--- a/glassy_dynamics/apply_binary.py
+++ b/glassy_dynamics/apply_binary.py
@@ -0,0 +1,62 @@
+# Copyright 2019 Deepmind Technologies Limited.
+#
+# Licensed under the Apache License, Version 2.0 (the "License");
+# you may not use this file except in compliance with the License.
+# You may obtain a copy of the License at
+#
+#     http://www.apache.org/licenses/LICENSE-2.0
+#
+# Unless required by applicable law or agreed to in writing, software
+# distributed under the License is distributed on an "AS IS" BASIS,
+# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+# See the License for the specific language governing permissions and
+# limitations under the License.
+
+"""Applies a graph-based network to predict particle mobilities in glasses."""
+
+from __future__ import absolute_import
+from __future__ import division
+
+from __future__ import print_function
+
+import os
+
+from absl import app
+from absl import flags
+
+from glassy_dynamics import train
+
+FLAGS = flags.FLAGS
+
+flags.DEFINE_string(
+    'data_directory',
+    '',
+    'Directory which contains the train or test datasets.')
+flags.DEFINE_integer(
+    'time_index',
+    9,
+    'The time index of the target mobilities.')
+flags.DEFINE_integer(
+    'max_files_to_load',
+    None,
+    'The maximum number of files to load.')
+flags.DEFINE_string(
+    'checkpoint_path',
+    'checkpoints/t044_s09.ckpt',
+    'Path used to load the model.')
+
+
+def main(argv):
+  if len(argv) > 1:
+    raise app.UsageError('Too many command-line arguments.')
+
+  file_pattern = os.path.join(FLAGS.data_directory, 'aggregated*')
+  train.apply_model(
+      checkpoint_path=FLAGS.checkpoint_path,
+      file_pattern=file_pattern,
+      max_files_to_load=FLAGS.max_files_to_load,
+      time_index=FLAGS.time_index)
+
+
+if __name__ == '__main__':
+  app.run(main)
--- a/glassy_dynamics/checkpoints/t044_s09.ckpt.data-00000-of-00001
+++ b/glassy_dynamics/checkpoints/t044_s09.ckpt.data-00000-of-00001
--- a/glassy_dynamics/checkpoints/t044_s09.ckpt.index
+++ b/glassy_dynamics/checkpoints/t044_s09.ckpt.index
--- a/glassy_dynamics/checkpoints/t044_s09.ckpt.meta
+++ b/glassy_dynamics/checkpoints/t044_s09.ckpt.meta
--- a/glassy_dynamics/graph_model.py
+++ b/glassy_dynamics/graph_model.py
@@ -0,0 +1,190 @@
+# Copyright 2019 Deepmind Technologies Limited.
+#
+# Licensed under the Apache License, Version 2.0 (the "License");
+# you may not use this file except in compliance with the License.
+# You may obtain a copy of the License at
+#
+#     http://www.apache.org/licenses/LICENSE-2.0
+#
+# Unless required by applicable law or agreed to in writing, software
+# distributed under the License is distributed on an "AS IS" BASIS,
+# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+# See the License for the specific language governing permissions and
+# limitations under the License.
+
+"""A graph neural network based model to predict particle mobilities.
+
+The architecture and performance of this model is described in our publication:
+"Unveiling the predictive power of static structure in glassy systems".
+"""
+
+from __future__ import absolute_import
+from __future__ import division
+
+from __future__ import print_function
+
+import functools
+
+from graph_nets import graphs
+from graph_nets import modules as gn_modules
+from graph_nets import utils_tf
+
+import sonnet as snt
+import tensorflow.compat.v1 as tf
+from typing import Any, Dict, Text, Tuple, Optional
+
+
+def make_graph_from_static_structure(
+    positions,
+    types,
+    box,
+    edge_threshold):
+  """Returns graph representing the static structure of the glass.
+
+  Each particle is represented by a node in the graph. The particle type is
+  stored as a node feature.
+  Two particles at a distance less than the threshold are connected by an edge.
+  The relative distance vector is stored as an edge feature.
+
+  Args:
+    positions: particle positions with shape [n_particles, 3].
+    types: particle types with shape [n_particles].
+    box: dimensions of the cubic box that contains the particles with shape [3].
+    edge_threshold: particles at distance less than threshold are connected by
+      an edge.
+  """
+  # Calculate pairwise relative distances between particles: shape [n, n, 3].
+  cross_positions = positions[tf.newaxis, :, :] - positions[:, tf.newaxis, :]
+  # Enforces periodic boundary conditions.
+  box_ = box[tf.newaxis, tf.newaxis, :]
+  cross_positions += tf.cast(cross_positions < -box_ / 2., tf.float32) * box_
+  cross_positions -= tf.cast(cross_positions > box_ / 2., tf.float32) * box_
+  # Calculates adjacency matrix in a sparse format (indices), based on the given
+  # distances and threshold.
+  distances = tf.norm(cross_positions, axis=-1)
+  indices = tf.where(distances < edge_threshold)
+
+  # Defines graph.
+  nodes = types[:, tf.newaxis]
+  senders = indices[:, 0]
+  receivers = indices[:, 1]
+  edges = tf.gather_nd(cross_positions, indices)
+
+  return graphs.GraphsTuple(
+      nodes=tf.cast(nodes, tf.float32),
+      n_node=tf.reshape(tf.shape(nodes)[0], [1]),
+      edges=tf.cast(edges, tf.float32),
+      n_edge=tf.reshape(tf.shape(edges)[0], [1]),
+      globals=tf.zeros((1, 1), dtype=tf.float32),
+      receivers=tf.cast(receivers, tf.int32),
+      senders=tf.cast(senders, tf.int32)
+      )
+
+
+def apply_random_rotation(graph):
+  """Returns randomly rotated graph representation.
+
+  The rotation is an element of O(3) with rotation angles multiple of pi/2.
+  This function assumes that the relative particle distances are stored in
+  the edge features.
+
+  Args:
+    graph: The graphs tuple as defined in `graph_nets.graphs`.
+  """
+  # Transposes edge features, so that the axes are in the first dimension.
+  # Outputs a tensor of shape [3, n_particles].
+  xyz = tf.transpose(graph.edges)
+  # Random pi/2 rotation(s)
+  permutation = tf.random.shuffle(tf.constant([0, 1, 2], dtype=tf.int32))
+  xyz = tf.gather(xyz, permutation)
+  # Random reflections.
+  symmetry = tf.random_uniform([3], minval=0, maxval=2, dtype=tf.int32)
+  symmetry = 1 - 2 * tf.cast(tf.reshape(symmetry, [3, 1]), tf.float32)
+  xyz = xyz * symmetry
+  edges = tf.transpose(xyz)
+  return graph.replace(edges=edges)
+
+
+class GraphBasedModel(snt.AbstractModule):
+  """Graph based model which predicts particle mobilities from their positions.
+
+  This network encodes the nodes and edges of the input graph independently, and
+  then performs message-passing on this graph, updating its edges based on their
+  associated nodes, then updating the nodes based on the input nodes' features
+  and their associated updated edge features.
+  This update is repeated several times.
+  Afterwards the resulting node embeddings are decoded to predict the particle
+  mobility.
+  """
+
+  def __init__(self,
+               n_recurrences,
+               mlp_sizes,
+               mlp_kwargs = None,
+               name='Graph'):
+    """Creates a new GraphBasedModel object.
+
+    Args:
+      n_recurrences: the number of message passing steps in the graph network.
+      mlp_sizes: the number of neurons in each layer of the MLP.
+      mlp_kwargs: additional keyword aguments passed to the MLP.
+      name: the name of the Sonnet module.
+    """
+    super(GraphBasedModel, self).__init__(name=name)
+    self._n_recurrences = n_recurrences
+
+    if mlp_kwargs is None:
+      mlp_kwargs = {}
+
+    model_fn = functools.partial(
+        snt.nets.MLP,
+        output_sizes=mlp_sizes,
+        activate_final=True,
+        **mlp_kwargs)
+
+    final_model_fn = functools.partial(
+        snt.nets.MLP,
+        output_sizes=mlp_sizes + (1,),
+        activate_final=False,
+        **mlp_kwargs)
+
+    with self._enter_variable_scope():
+      self._encoder = gn_modules.GraphIndependent(
+          node_model_fn=model_fn,
+          edge_model_fn=model_fn)
+
+      if self._n_recurrences > 0:
+        self._propagation_network = gn_modules.GraphNetwork(
+            node_model_fn=model_fn,
+            edge_model_fn=model_fn,
+            # We do not use globals, hence we just pass the identity function.
+            global_model_fn=lambda: lambda x: x,
+            reducer=tf.unsorted_segment_sum,
+            edge_block_opt=dict(use_globals=False),
+            node_block_opt=dict(use_globals=False),
+            global_block_opt=dict(use_globals=False))
+
+      self._decoder = gn_modules.GraphIndependent(
+          node_model_fn=final_model_fn,
+          edge_model_fn=model_fn)
+
+  def _build(self, graphs_tuple):
+    """Connects the model into the tensorflow graph.
+
+    Args:
+      graphs_tuple: input graph tensor as defined in `graphs_tuple.graphs`.
+
+    Returns:
+      tensor with shape [n_particles] containing the predicted particle
+      mobilities.
+    """
+    encoded = self._encoder(graphs_tuple)
+    outputs = encoded
+
+    for _ in range(self._n_recurrences):
+      # Adds skip connections.
+      inputs = utils_tf.concat([outputs, encoded], axis=-1)
+      outputs = self._propagation_network(inputs)
+
+    decoded = self._decoder(outputs)
+    return tf.squeeze(decoded.nodes, axis=-1)
--- a/glassy_dynamics/graph_model_test.py
+++ b/glassy_dynamics/graph_model_test.py
@@ -0,0 +1,147 @@
+# Copyright 2019 Deepmind Technologies Limited.
+#
+# Licensed under the Apache License, Version 2.0 (the "License");
+# you may not use this file except in compliance with the License.
+# You may obtain a copy of the License at
+#
+#     http://www.apache.org/licenses/LICENSE-2.0
+#
+# Unless required by applicable law or agreed to in writing, software
+# distributed under the License is distributed on an "AS IS" BASIS,
+# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+# See the License for the specific language governing permissions and
+# limitations under the License.
+
+"""Tests for graph_model."""
+
+from __future__ import absolute_import
+from __future__ import division
+from __future__ import print_function
+
+import itertools
+from absl.testing import parameterized
+
+from graph_nets import graphs
+import numpy as np
+import tensorflow.compat.v1 as tf
+
+from glassy_dynamics import graph_model
+
+
+class GraphModelTest(tf.test.TestCase, parameterized.TestCase):
+
+  def setUp(self):
+    """Initializes a small tractable test (particle) system."""
+    super(GraphModelTest, self).setUp()
+    # Fixes random seed to ensure deterministic outputs.
+    tf.random.set_random_seed(1234)
+
+    # In this test we use a small tractable set of particles covering all corner
+    # cases:
+    # a) eight particles with different types,
+    # b) periodic box is not cubic,
+    # c) three disjoint cluster of particles separated by a threshold > 2,
+    # d) first two clusters overlap with the periodic boundary,
+    # e) first cluster is not fully connected,
+    # f) second cluster is fully connected,
+    # g) and third cluster is a single isolated particle.
+    #
+    # The formatting of the code below separates the three clusters by
+    # adding linebreaks after each cluster.
+    self._positions = np.array(
+        [[0.0, 0.0, 0.0], [2.5, 0.0, 0.0], [0.0, 1.5, 0.0], [0.0, 0.0, 9.0],
+         [0.0, 5.0, 0.0], [0.0, 5.0, 1.0], [3.0, 5.0, 0.0],
+         [2.0, 3.0, 3.0]])
+    self._types = np.array([0.0, 0.0, 1.0, 0.0,
+                            0.0, 1.0, 0.0,
+                            0.0])
+    self._box = np.array([4.0, 10.0, 10.0])
+
+    # Creates the corresponding graph elements, assuming a threshold of 2 and
+    # the conventions described in `graph_nets.graphs`.
+    self._edge_threshold = 2
+    self._nodes = np.array(
+        [[0.0], [0.0], [1.0], [0.0],
+         [0.0], [1.0], [0.0],
+         [0.0]])
+    self._edges = np.array(
+        [[0.0, 0.0, 0.0], [-1.5, 0.0, 0.0], [0.0, 1.5, 0.0], [0.0, 0.0, -1.0],
+         [1.5, 0.0, 0.0], [0.0, 0.0, 0.0], [1.5, 0.0, -1.0],
+         [0.0, -1.5, 0.0], [0.0, 0.0, 0.0], [0.0, -1.5, -1.0],
+         [0.0, 0.0, 1.0], [-1.5, 0.0, 1.0], [0.0, 1.5, 1.0], [0.0, 0.0, 0.0],
+         [0.0, 0.0, 0.0], [0.0, 0.0, 1.0], [-1.0, 0.0, 0.0],
+         [0.0, 0.0, -1.0], [0.0, 0.0, 0.0], [-1.0, 0.0, -1.0],
+         [1.0, 0.0, 0.0], [1.0, 0.0, 1.0], [0.0, 0.0, 0.0],
+         [0.0, 0.0, 0.0]])
+
+    self._receivers = np.array(
+        [0, 1, 2, 3, 0, 1, 3, 0, 2, 3, 0, 1, 2, 3,
+         4, 5, 6, 4, 5, 6, 4, 5, 6,
+         7])
+    self._senders = np.array(
+        [0, 0, 0, 0, 1, 1, 1, 2, 2, 2, 3, 3, 3, 3,
+         4, 4, 4, 5, 5, 5, 6, 6, 6,
+         7])
+
+  def _get_graphs_tuple(self):
+    """Returns a GraphsTuple containing a graph based on the test system."""
+    return graphs.GraphsTuple(
+        nodes=tf.constant(self._nodes, dtype=tf.float32),
+        edges=tf.constant(self._edges, dtype=tf.float32),
+        globals=tf.constant(np.array([[0.0]]), dtype=tf.float32),
+        receivers=tf.constant(self._receivers, dtype=tf.int32),
+        senders=tf.constant(self._senders, dtype=tf.int32),
+        n_node=tf.constant([len(self._nodes)], dtype=tf.int32),
+        n_edge=tf.constant([len(self._edges)], dtype=tf.int32))
+
+  def test_make_graph_from_static_structure(self):
+    graphs_tuple_op = graph_model.make_graph_from_static_structure(
+        tf.constant(self._positions, dtype=tf.float32),
+        tf.constant(self._types, dtype=tf.int32),
+        tf.constant(self._box, dtype=tf.float32),
+        self._edge_threshold)
+    graphs_tuple = self.evaluate(graphs_tuple_op)
+    self.assertLen(self._nodes, graphs_tuple.n_node)
+    self.assertLen(self._edges, graphs_tuple.n_edge)
+    np.testing.assert_almost_equal(graphs_tuple.nodes, self._nodes)
+    np.testing.assert_equal(graphs_tuple.senders, self._senders)
+    np.testing.assert_equal(graphs_tuple.receivers, self._receivers)
+    np.testing.assert_almost_equal(graphs_tuple.globals, np.array([[0.0]]))
+    np.testing.assert_almost_equal(graphs_tuple.edges, self._edges)
+
+  def _is_equal_up_to_rotation(self, x, y):
+    for axes in itertools.permutations([0, 1, 2]):
+      for mirrors in itertools.product([1, -1], repeat=3):
+        if np.allclose(x, y[:, axes] * mirrors):
+          return True
+    return False
+
+  def test_apply_random_rotation(self):
+    graphs_tuple = self._get_graphs_tuple()
+    rotated_graphs_tuple_op = graph_model.apply_random_rotation(graphs_tuple)
+    rotated_graphs_tuple = self.evaluate(rotated_graphs_tuple_op)
+    np.testing.assert_almost_equal(rotated_graphs_tuple.nodes, self._nodes)
+    np.testing.assert_almost_equal(rotated_graphs_tuple.senders, self._senders)
+    np.testing.assert_almost_equal(
+        rotated_graphs_tuple.receivers, self._receivers)
+    np.testing.assert_almost_equal(
+        rotated_graphs_tuple.globals, np.array([[0.0]]))
+    self.assertTrue(self._is_equal_up_to_rotation(rotated_graphs_tuple.edges,
+                                                  self._edges))
+
+  @parameterized.named_parameters(('no_propagation', 0, (30,)),
+                                  ('multi_propagation', 5, (15,)),
+                                  ('multi_layer', 1, (20, 30)))
+  def test_GraphModel(self, n_recurrences, mlp_sizes):
+    graphs_tuple = self._get_graphs_tuple()
+    output_op = graph_model.GraphBasedModel(n_recurrences=n_recurrences,
+                                            mlp_sizes=mlp_sizes)(graphs_tuple)
+    self.assertListEqual(output_op.shape.as_list(), [len(self._types)])
+    # Tests if the model runs without crashing.
+    with self.session():
+      tf.global_variables_initializer().run()
+      output_op.eval()
+
+
+if __name__ == '__main__':
+  tf.test.main()
--- a/glassy_dynamics/requirements.txt
+++ b/glassy_dynamics/requirements.txt
@@ -0,0 +1,30 @@
+# GlassyDynamics
+absl-py==0.8.1
+astor==0.8.0
+cloudpickle==1.1.1
+contextlib2==0.6.0.post1
+decorator==4.4.0
+dm-sonnet==1.35
+future==0.18.0
+gast==0.3.2
+google-pasta==0.1.7
+graph-nets==1.0.4
+grpcio==1.24.1
+h5py==2.10.0
+Keras-Applications==1.0.8
+Keras-Preprocessing==1.1.0
+Markdown==3.1.1
+mock==3.0.5
+networkx==2.3
+numpy==1.17.2
+pkg-resources==0.0.0
+protobuf==3.10.0
+semantic-version==2.8.2
+six==1.12.0
+tensorboard==1.13.1
+tensorflow==1.13.1
+tensorflow-estimator==1.13.0
+tensorflow-probability==0.6.0
+termcolor==1.1.0
+Werkzeug==0.16.0
+wrapt==1.11.2
--- a/glassy_dynamics/testdata/test_large.pickle
+++ b/glassy_dynamics/testdata/test_large.pickle
--- a/glassy_dynamics/testdata/test_small.pickle
+++ b/glassy_dynamics/testdata/test_small.pickle
--- a/glassy_dynamics/train.py
+++ b/glassy_dynamics/train.py
@@ -0,0 +1,381 @@
+# Copyright 2019 Deepmind Technologies Limited.
+#
+# Licensed under the Apache License, Version 2.0 (the "License");
+# you may not use this file except in compliance with the License.
+# You may obtain a copy of the License at
+#
+#     http://www.apache.org/licenses/LICENSE-2.0
+#
+# Unless required by applicable law or agreed to in writing, software
+# distributed under the License is distributed on an "AS IS" BASIS,
+# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+# See the License for the specific language governing permissions and
+# limitations under the License.
+
+"""Training pipeline for the prediction of particle mobilities in glasses."""
+
+from __future__ import absolute_import
+from __future__ import division
+
+from __future__ import print_function
+
+import collections
+import pickle
+
+from absl import logging
+import enum
+import numpy as np
+
+import tensorflow.compat.v1 as tf
+import tensorflow_probability as tfp
+
+from typing import Any, Dict, List, Optional, Text, Tuple, Sequence
+
+from glassy_dynamics import graph_model
+
+tf.enable_resource_variables()
+
+LossCollection = collections.namedtuple('LossCollection',
+                                        'l1_loss, l2_loss, correlation')
+GlassSimulationData = collections.namedtuple('GlassSimulationData',
+                                             'positions, targets, types, box')
+
+
+class ParticleType(enum.IntEnum):
+  """The simulation contains two particle types, identified as type A and B.
+
+  The dataset encodes the particle type in an integer.
+    - 0 corresponds to particle type A.
+    - 1 corresponds to particle type B.
+  """
+  A = 0
+  B = 1
+
+
+def get_targets(
+    initial_positions,
+    trajectory_target_positions):
+  """Returns the averaged particle mobilities from the sampled trajectories.
+
+  Args:
+    initial_positions: the initial positions of the particles with shape
+      [n_particles, 3].
+    trajectory_target_positions: the absolute positions of the particles at the
+      target time for all sampled trajectories, each with shape
+      [n_particles, 3].
+  """
+  targets = np.mean([np.linalg.norm(t - initial_positions, axis=-1)
+                     for t in trajectory_target_positions], axis=0)
+  return targets.astype(np.float32)
+
+
+def load_data(
+    file_pattern,
+    time_index,
+    max_files_to_load = None):
+  """Returns a dictionary containing the training or test dataset.
+
+  The dictionary contains:
+    `positions`: `np.ndarray` containing the particle positions with shape
+      [n_particles, 3].
+    `targets`: `np.ndarray` containing particle mobilities with shape
+      [n_particles].
+    `types`: `np.ndarray` containing the particle types with shape with shape
+      [n_particles].
+    `box`: `np.ndarray` containing the dimensions of the periodic box with shape
+      [3].
+
+  Args:
+    file_pattern: pattern matching the files with the simulation data.
+    time_index: the time index of the targets.
+    max_files_to_load: the maximum number of files to load.
+  """
+  filenames = tf.io.gfile.glob(file_pattern)
+  if max_files_to_load:
+    filenames = filenames[:max_files_to_load]
+
+  static_structures = []
+  for filename in filenames:
+    with tf.io.gfile.GFile(filename, 'rb') as f:
+      data = pickle.load(f)
+    static_structures.append(GlassSimulationData(
+        positions=data['positions'].astype(np.float32),
+        targets=get_targets(
+            data['positions'], data['trajectory_target_positions'][time_index]),
+        types=data['types'].astype(np.int32),
+        box=data['box'].astype(np.float32)))
+  return static_structures
+
+
+def get_loss_ops(
+    prediction,
+    target,
+    types):
+  """Returns L1/L2 loss and correlation for type A particles.
+
+  Args:
+    prediction: tensor with shape [n_particles] containing the predicted
+      particle mobilities.
+    target: tensor with shape [n_particles] containing the true particle
+      mobilities.
+    types: tensor with shape [n_particles] containing the particle types.
+  """
+  # Considers only type A particles.
+  mask = tf.equal(types, ParticleType.A)
+  prediction = tf.boolean_mask(prediction, mask)
+  target = tf.boolean_mask(target, mask)
+  return LossCollection(
+      l1_loss=tf.reduce_mean(tf.abs(prediction - target)),
+      l2_loss=tf.reduce_mean((prediction - target)**2),
+      correlation=tf.squeeze(tfp.stats.correlation(
+          prediction[:, tf.newaxis], target[:, tf.newaxis])))
+
+
+def get_minimize_op(
+    loss,
+    learning_rate,
+    grad_clip = None):
+  """Returns minimization operation.
+
+  Args:
+    loss: the loss tensor which is minimized.
+    learning_rate: the learning rate used by the optimizer.
+    grad_clip: all gradients are clipped to the given value if not None or 0.
+  """
+  optimizer = tf.train.AdamOptimizer(learning_rate)
+  grads_and_vars = optimizer.compute_gradients(loss)
+  if grad_clip:
+    grads, _ = tf.clip_by_global_norm([g for g, _ in grads_and_vars], grad_clip)
+    grads_and_vars = [(g, pair[1]) for g, pair in zip(grads, grads_and_vars)]
+  minimize = optimizer.apply_gradients(grads_and_vars)
+  return minimize
+
+
+def _log_stats_and_return_mean_correlation(
+    label,
+    stats):
+  """Logs performance statistics and returns mean correlation.
+
+  Args:
+    label: label printed before the combined statistics e.g. train or test.
+    stats: statistics calculated for each batch in a dataset.
+
+  Returns:
+    mean correlation
+  """
+  for key in LossCollection._fields:
+    values = [getattr(s, key) for s in stats]
+    mean = np.mean(values)
+    std = np.std(values)
+    logging.info('%s: %s: %.4f +/- %.4f', label, key, mean, std)
+  return np.mean([s.correlation for s in stats])
+
+
+def train_model(train_file_pattern,
+                test_file_pattern,
+                max_files_to_load = None,
+                n_epochs = 1000,
+                time_index = 9,
+                augment_data_using_rotations = True,
+                learning_rate = 1e-4,
+                grad_clip = 1.0,
+                n_recurrences = 7,
+                mlp_sizes = (64, 64),
+                mlp_kwargs = None,
+                edge_threshold = 2.0,
+                measurement_store_interval = 1000,
+                checkpoint_path = None):
+  """Trains GraphModel using tensorflow.
+
+  Args:
+    train_file_pattern: pattern matching the files with the training data.
+    test_file_pattern: pattern matching the files with the test data.
+    max_files_to_load: the maximum number of train and test files to load.
+      If None, all files will be loaded.
+    n_epochs: the number of passes through the training dataset (epochs).
+    time_index: the time index (0-9) of the target mobilities.
+    augment_data_using_rotations: data is augemented by using random rotations.
+    learning_rate: the learning rate used by the optimizer.
+    grad_clip: all gradients are clipped to the given value.
+    n_recurrences: the number of message passing steps in the graphnet.
+    mlp_sizes: the number of neurons in each layer of the MLP.
+    mlp_kwargs: additional keyword aguments passed to the MLP.
+    edge_threshold: particles at distance less than threshold are connected by
+      an edge.
+    measurement_store_interval: number of steps between storing objective values
+      (loss and correlation).
+    checkpoint_path: path used to store the checkpoint with the highest
+      correlation on the test set.
+
+  Returns:
+    Correlation on the test dataset of best model encountered during training.
+  """
+  if mlp_kwargs is None:
+    mlp_kwargs = dict(initializers=dict(w=tf.variance_scaling_initializer(1.0),
+                                        b=tf.variance_scaling_initializer(0.1)))
+  # Loads train and test dataset.
+  dataset_kwargs = dict(
+      time_index=time_index,
+      max_files_to_load=max_files_to_load)
+  training_data = load_data(train_file_pattern, **dataset_kwargs)
+  test_data = load_data(test_file_pattern, **dataset_kwargs)
+
+  # Defines wrapper functions, which can directly be passed to the
+  # tf.data.Dataset.map function.
+  def _make_graph_from_static_structure(static_structure):
+    """Converts static structure to graph, targets and types."""
+    return (graph_model.make_graph_from_static_structure(
+        static_structure.positions,
+        static_structure.types,
+        static_structure.box,
+        edge_threshold),
+            static_structure.targets,
+            static_structure.types)
+
+  def _apply_random_rotation(graph, targets, types):
+    """Applies random rotations to the graph and forwards targets and types."""
+    return graph_model.apply_random_rotation(graph), targets, types
+
+  # Defines data-pipeline based on tf.data.Dataset following the official
+  # guideline: https://www.tensorflow.org/guide/datasets#consuming_numpy_arrays.
+  # We use initializable iterators to avoid embedding the training and test data
+  # directly into the graph.
+  # Instead we feed the data to the iterators during the initalization of the
+  # iterators before the main training loop.
+  placeholders = GlassSimulationData._make(
+      tf.placeholder(s.dtype, (None,) + s.shape) for s in training_data[0])
+  dataset = tf.data.Dataset.from_tensor_slices(placeholders)
+  dataset = dataset.map(_make_graph_from_static_structure)
+  dataset = dataset.cache()
+  dataset = dataset.shuffle(400)
+  # Augments data. This has to be done after calling dataset.cache!
+  if augment_data_using_rotations:
+    dataset = dataset.map(_apply_random_rotation)
+  dataset = dataset.repeat()
+  train_iterator = dataset.make_initializable_iterator()
+
+  dataset = tf.data.Dataset.from_tensor_slices(placeholders)
+  dataset = dataset.map(_make_graph_from_static_structure)
+  dataset = dataset.cache()
+  dataset = dataset.repeat()
+  test_iterator = dataset.make_initializable_iterator()
+
+  # Creates tensorflow graph.
+  # Note: We decouple the training and test datasets from the input pipeline
+  # by creating a new iterator from a string-handle placeholder with the same
+  # output types and shapes as the training dataset.
+  dataset_handle = tf.placeholder(tf.string, shape=[])
+  iterator = tf.data.Iterator.from_string_handle(
+      dataset_handle, train_iterator.output_types, train_iterator.output_shapes)
+  graph, targets, types = iterator.get_next()
+
+  model = graph_model.GraphBasedModel(
+      n_recurrences, mlp_sizes, mlp_kwargs)
+  prediction = model(graph)
+
+  # Defines loss and minimization operations.
+  loss_ops = get_loss_ops(prediction, targets, types)
+  minimize_op = get_minimize_op(loss_ops.l2_loss, learning_rate, grad_clip)
+
+  best_so_far = -1
+  train_stats = []
+  test_stats = []
+
+  saver = tf.train.Saver()
+
+  with tf.train.SingularMonitoredSession() as session:
+    # Initializes train and test iterators with the training and test datasets.
+    # The obtained training and test string-handles can be passed to the
+    # dataset_handle placeholder to select the dataset.
+    train_handle = session.run(train_iterator.string_handle())
+    test_handle = session.run(test_iterator.string_handle())
+    feed_dict = {p: [x[i] for x in training_data]
+                 for i, p in enumerate(placeholders)}
+    session.run(train_iterator.initializer, feed_dict=feed_dict)
+    feed_dict = {p: [x[i] for x in test_data]
+                 for i, p in enumerate(placeholders)}
+    session.run(test_iterator.initializer, feed_dict=feed_dict)
+
+    # Trains model using stochatic gradient descent on the training dataset.
+    n_training_steps = len(training_data) * n_epochs
+    for i in range(n_training_steps):
+      feed_dict = {dataset_handle: train_handle}
+      train_loss, _ = session.run((loss_ops, minimize_op), feed_dict=feed_dict)
+      train_stats.append(train_loss)
+
+      if (i+1) % measurement_store_interval == 0:
+        # Evaluates model on test dataset.
+        for _ in range(len(test_data)):
+          feed_dict = {dataset_handle: test_handle}
+          test_stats.append(session.run(loss_ops, feed_dict=feed_dict))
+
+        # Outputs performance statistics on training and test dataset.
+        _log_stats_and_return_mean_correlation('Train', train_stats)
+        correlation = _log_stats_and_return_mean_correlation('Test', test_stats)
+        train_stats = []
+        test_stats = []
+
+        # Updates best model based on the observed correlation on the test
+        # dataset.
+        if correlation > best_so_far:
+          best_so_far = correlation
+          if checkpoint_path:
+            saver.save(session.raw_session(), checkpoint_path)
+
+  return best_so_far
+
+
+def apply_model(checkpoint_path,
+                file_pattern,
+                max_files_to_load = None,
+                time_index = 9):
+  """Applies trained GraphModel using tensorflow.
+
+  Args:
+    checkpoint_path: path from which the model is loaded.
+    file_pattern: pattern matching the files with the data.
+    max_files_to_load: the maximum number of files to load.
+      If None, all files will be loaded.
+    time_index: the time index (0-9) of the target mobilities.
+
+  Returns:
+    Predictions of the model for all files.
+  """
+  dataset_kwargs = dict(
+      time_index=time_index,
+      max_files_to_load=max_files_to_load)
+  data = load_data(file_pattern, **dataset_kwargs)
+
+  tf.reset_default_graph()
+  saver = tf.train.import_meta_graph(checkpoint_path + '.meta')
+  graph = tf.get_default_graph()
+
+  placeholders = GlassSimulationData(
+      positions=graph.get_tensor_by_name('Placeholder:0'),
+      targets=graph.get_tensor_by_name('Placeholder_1:0'),
+      types=graph.get_tensor_by_name('Placeholder_2:0'),
+      box=graph.get_tensor_by_name('Placeholder_3:0'))
+  prediction_tensor = graph.get_tensor_by_name('Graph_1/Squeeze:0')
+  correlation_tensor = graph.get_tensor_by_name('Squeeze:0')
+
+  dataset_handle = graph.get_tensor_by_name('Placeholder_4:0')
+  test_initalizer = graph.get_operation_by_name('MakeIterator_1')
+  test_string_handle = graph.get_tensor_by_name('IteratorToStringHandle_1:0')
+
+  with tf.Session() as session:
+    saver.restore(session, checkpoint_path)
+    handle = session.run(test_string_handle)
+    feed_dict = {p: [x[i] for x in data] for i, p in enumerate(placeholders)}
+    session.run(test_initalizer, feed_dict=feed_dict)
+    predictions = []
+    correlations = []
+    for _ in range(len(data)):
+      p, c = session.run((prediction_tensor, correlation_tensor),
+                         feed_dict={dataset_handle: handle})
+      predictions.append(p)
+      correlations.append(c)
+
+  logging.info('Correlation: %.4f +/- %.4f',
+               np.mean(correlations),
+               np.std(correlations))
+  return predictions
--- a/glassy_dynamics/train_binary.py
+++ b/glassy_dynamics/train_binary.py
@@ -0,0 +1,64 @@
+# Copyright 2019 Deepmind Technologies Limited.
+#
+# Licensed under the Apache License, Version 2.0 (the "License");
+# you may not use this file except in compliance with the License.
+# You may obtain a copy of the License at
+#
+#     http://www.apache.org/licenses/LICENSE-2.0
+#
+# Unless required by applicable law or agreed to in writing, software
+# distributed under the License is distributed on an "AS IS" BASIS,
+# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+# See the License for the specific language governing permissions and
+# limitations under the License.
+
+"""Trains a graph-based network to predict particle mobilities in glasses."""
+
+from __future__ import absolute_import
+from __future__ import division
+
+from __future__ import print_function
+
+import os
+
+from absl import app
+from absl import flags
+
+from glassy_dynamics import train
+
+FLAGS = flags.FLAGS
+
+flags.DEFINE_string(
+    'data_directory',
+    '',
+    'Directory which contains the train and test datasets.')
+flags.DEFINE_integer(
+    'time_index',
+    9,
+    'The time index of the target mobilities.')
+flags.DEFINE_integer(
+    'max_files_to_load',
+    None,
+    'The maximum number of files to load from the train and test datasets.')
+flags.DEFINE_string(
+    'checkpoint_path',
+    None,
+    'Path used to store a checkpoint of the best model.')
+
+
+def main(argv):
+  if len(argv) > 1:
+    raise app.UsageError('Too many command-line arguments.')
+
+  train_file_pattern = os.path.join(FLAGS.data_directory, 'train/aggregated*')
+  test_file_pattern = os.path.join(FLAGS.data_directory, 'test/aggregated*')
+  train.train_model(
+      train_file_pattern=train_file_pattern,
+      test_file_pattern=test_file_pattern,
+      max_files_to_load=FLAGS.max_files_to_load,
+      time_index=FLAGS.time_index,
+      checkpoint_path=FLAGS.checkpoint_path)
+
+
+if __name__ == '__main__':
+  app.run(main)
--- a/glassy_dynamics/train_test.py
+++ b/glassy_dynamics/train_test.py
@@ -0,0 +1,126 @@
+# Copyright 2019 Deepmind Technologies Limited.
+#
+# Licensed under the Apache License, Version 2.0 (the "License");
+# you may not use this file except in compliance with the License.
+# You may obtain a copy of the License at
+#
+#     http://www.apache.org/licenses/LICENSE-2.0
+#
+# Unless required by applicable law or agreed to in writing, software
+# distributed under the License is distributed on an "AS IS" BASIS,
+# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+# See the License for the specific language governing permissions and
+# limitations under the License.
+
+"""Tests for train."""
+
+from __future__ import absolute_import
+from __future__ import division
+from __future__ import print_function
+
+import os
+
+import numpy as np
+import tensorflow.compat.v1 as tf
+
+from glassy_dynamics import train
+
+
+class TrainTest(tf.test.TestCase):
+
+  def test_get_targets(self):
+    initial_positions = np.array([[0, 0, 0], [1, 2, 3]])
+    trajectory_target_positions = [
+        np.array([[1, 0, 0], [1, 2, 4]]),
+        np.array([[0, 1, 0], [1, 0, 3]]),
+        np.array([[0, 0, 5], [1, 2, 3]]),
+    ]
+    expected_targets = np.array([7.0 / 3.0, 1.0])
+    targets = train.get_targets(initial_positions, trajectory_target_positions)
+    np.testing.assert_almost_equal(expected_targets, targets)
+
+  def test_load_data(self):
+    file_pattern = os.path.join(os.path.dirname(__file__), 'testdata',
+                                'test_small.pickle')
+
+    with self.subTest('ContentAndShapesAreAsExpected'):
+      data = train.load_data(file_pattern, 0)
+      self.assertEqual(len(data), 1)
+      element = data[0]
+      self.assertTupleEqual(element.positions.shape, (20, 3))
+      self.assertTupleEqual(element.box.shape, (3,))
+      self.assertTupleEqual(element.targets.shape, (20,))
+      self.assertTupleEqual(element.types.shape, (20,))
+
+    with self.subTest('TargetsGrowAsAFunctionOfTime'):
+      previous_mean_target = 0.0
+      # Time index 9 refers to 1/e = 0.36 in the IS, and therefore it is between
+      # Time index 5 (0.4) and  time index 6 (0.3).
+      for time_index in [0, 1, 2, 3, 4, 5, 9, 6, 7, 8]:
+        data = train.load_data(file_pattern, time_index)[0]
+        current_mean_target = data.targets.mean()
+        self.assertGreater(current_mean_target, previous_mean_target)
+        previous_mean_target = current_mean_target
+
+
+class TensorflowTrainTest(tf.test.TestCase):
+
+  def test_get_loss_op(self):
+    """Tests the correct calculation of the loss operations."""
+    prediction = tf.constant([0.0, 1.0, 2.0, 1.0, 2.0], dtype=tf.float32)
+    target = tf.constant([1.0, 25.0, 0.0, 4.0, 2.0], dtype=tf.float32)
+    types = tf.constant([0, 1, 0, 0, 0], dtype=tf.int32)
+    loss_ops = train.get_loss_ops(prediction, target, types)
+    loss = self.evaluate(loss_ops)
+    self.assertAlmostEqual(loss.l1_loss, 1.5)
+    self.assertAlmostEqual(loss.l2_loss, 14.0 / 4.0)
+    self.assertAlmostEqual(loss.correlation, -0.15289416)
+
+  def test_get_minimize_op(self):
+    """Tests the minimize operation by minimizing a single variable."""
+    var = tf.Variable([1.0], name='test')
+    loss = var**2
+    minimize = train.get_minimize_op(loss, 1e-1)
+    with self.session():
+      tf.global_variables_initializer().run()
+      for _ in range(100):
+        minimize.run()
+      value = var.eval()
+      self.assertLess(abs(value[0]), 0.01)
+
+  def test_train_model(self):
+    """Tests if we can overfit to a small test dataset."""
+    file_pattern = os.path.join(os.path.dirname(__file__), 'testdata',
+                                'test_small.pickle')
+    best_correlation_value = train.train_model(
+        train_file_pattern=file_pattern,
+        test_file_pattern=file_pattern,
+        n_epochs=1000,
+        augment_data_using_rotations=False,
+        learning_rate=1e-4,
+        n_recurrences=2,
+        edge_threshold=5,
+        mlp_sizes=(32, 32),
+        measurement_store_interval=1000)
+    # The test dataset contains only a single sample with 20 particles.
+    # Therefore we expect the model to be able to memorize the targets perfectly
+    # if the model works correctly.
+    self.assertGreater(best_correlation_value, 0.99)
+
+  def test_apply_model(self):
+    """Tests if we can apply a model to a small test dataset."""
+    checkpoint_path = os.path.join(os.path.dirname(__file__), 'checkpoints',
+                                   't044_s09.ckpt')
+    file_pattern = os.path.join(os.path.dirname(__file__), 'testdata',
+                                'test_large.pickle')
+    predictions = train.apply_model(checkpoint_path=checkpoint_path,
+                                    file_pattern=file_pattern,
+                                    time_index=0)
+    data = train.load_data(file_pattern, 0)
+    targets = data[0].targets
+    correlation_value = np.corrcoef(predictions[0], targets)[0, 1]
+    self.assertGreater(correlation_value, 0.5)
+
+
+if __name__ == '__main__':
+  tf.test.main()