What does roll do in numpy and pytorch?¶

Rolling in numpy or pytorch moves the data in a circle on a particular axis. Let's look at some examples.

import numpy as np
import torch

Roll in Numpy¶

Suppose we have a 6x8 matrix in numpy. We can initialize it for clarity like so.

state = np.zeros((6, 8))
for i in range(6):
    for j in range(8):
        state[i][j] = j
state

array([[0., 1., 2., 3., 4., 5., 6., 7.],
       [0., 1., 2., 3., 4., 5., 6., 7.],
       [0., 1., 2., 3., 4., 5., 6., 7.],
       [0., 1., 2., 3., 4., 5., 6., 7.],
       [0., 1., 2., 3., 4., 5., 6., 7.],
       [0., 1., 2., 3., 4., 5., 6., 7.]])

We can then roll it leftward (indicated by -1) along the horizontal axis (axis 1). The zeroes which would have been shifted off the beginning of each array are now at the end.

state = np.roll(state, -1, axis=1); state

array([[1., 2., 3., 4., 5., 6., 7., 0.],
       [1., 2., 3., 4., 5., 6., 7., 0.],
       [1., 2., 3., 4., 5., 6., 7., 0.],
       [1., 2., 3., 4., 5., 6., 7., 0.],
       [1., 2., 3., 4., 5., 6., 7., 0.],
       [1., 2., 3., 4., 5., 6., 7., 0.]])

Why might we want to do this? Suppose this is some sort of time series data and we are getting rid of the oldest elements, which are kept on the left. We could then replace the newest fields with the latest data like so:

for i in range(6):
    state[i][-1] = 8
state

array([[1., 2., 3., 4., 5., 6., 7., 8.],
       [1., 2., 3., 4., 5., 6., 7., 8.],
       [1., 2., 3., 4., 5., 6., 7., 8.],
       [1., 2., 3., 4., 5., 6., 7., 8.],
       [1., 2., 3., 4., 5., 6., 7., 8.],
       [1., 2., 3., 4., 5., 6., 7., 8.]])

The zeroes were the oldest, so we got rid of them, and now we have the newest "8" data on the right.

I've seen this done sometimes in reinforcement learning settings, where someone wants to put together state data with time frames.

You can see an example of it using numpy here.

Roll in Pytorch¶

We can do something similarly in pytorch. Let's reininitialize the state as a pytorch tensor and see an example.

state = torch.zeros((6, 8))
for i in range(6):
    for j in range(8):
        state[i, j] = j
state

tensor([[0., 1., 2., 3., 4., 5., 6., 7.],
        [0., 1., 2., 3., 4., 5., 6., 7.],
        [0., 1., 2., 3., 4., 5., 6., 7.],
        [0., 1., 2., 3., 4., 5., 6., 7.],
        [0., 1., 2., 3., 4., 5., 6., 7.],
        [0., 1., 2., 3., 4., 5., 6., 7.]])

The syntax is slightly different, but the effect is the same.

state = torch.roll(state, -1, dims=-1)
state

tensor([[1., 2., 3., 4., 5., 6., 7., 0.],
        [1., 2., 3., 4., 5., 6., 7., 0.],
        [1., 2., 3., 4., 5., 6., 7., 0.],
        [1., 2., 3., 4., 5., 6., 7., 0.],
        [1., 2., 3., 4., 5., 6., 7., 0.],
        [1., 2., 3., 4., 5., 6., 7., 0.]])

Similarly, if we were working with time series state data, we could replace the last index with something new.

for i in range(6):
    for j in range(8):
        state[i][-1] = 8
state

tensor([[1., 2., 3., 4., 5., 6., 7., 8.],
        [1., 2., 3., 4., 5., 6., 7., 8.],
        [1., 2., 3., 4., 5., 6., 7., 8.],
        [1., 2., 3., 4., 5., 6., 7., 8.],
        [1., 2., 3., 4., 5., 6., 7., 8.],
        [1., 2., 3., 4., 5., 6., 7., 8.]])

Here is a pytorch implementation example of the previous code, doing the same thing to the state:

Rolling Manually in numpy¶

I've also seen folks use a similar strategy, but using manual indexing. Here is an example of that in numpy.

state = np.zeros((6, 8))
for i in range(6):
    for j in range(8):
        state[i][j] = j
state

array([[0., 1., 2., 3., 4., 5., 6., 7.],
       [0., 1., 2., 3., 4., 5., 6., 7.],
       [0., 1., 2., 3., 4., 5., 6., 7.],
       [0., 1., 2., 3., 4., 5., 6., 7.],
       [0., 1., 2., 3., 4., 5., 6., 7.],
       [0., 1., 2., 3., 4., 5., 6., 7.]])

Instead of rolling, we just replace part of the dataset with a different part of the dataset. Notice how without the roll function, the zeroes don't end up on the end.

state[:,:7] = state[:,1:]
state

array([[1., 2., 3., 4., 5., 6., 7., 7.],
       [1., 2., 3., 4., 5., 6., 7., 7.],
       [1., 2., 3., 4., 5., 6., 7., 7.],
       [1., 2., 3., 4., 5., 6., 7., 7.],
       [1., 2., 3., 4., 5., 6., 7., 7.],
       [1., 2., 3., 4., 5., 6., 7., 7.]])

If we are just going to update those fields anyway, however, perhaps we don't care about that.

for i in range(6):
    for j in range(8):
        state[i][-1] = 8
state

array([[1., 2., 3., 4., 5., 6., 7., 8.],
       [1., 2., 3., 4., 5., 6., 7., 8.],
       [1., 2., 3., 4., 5., 6., 7., 8.],
       [1., 2., 3., 4., 5., 6., 7., 8.],
       [1., 2., 3., 4., 5., 6., 7., 8.],
       [1., 2., 3., 4., 5., 6., 7., 8.]])

Here's a reinforcement example of formatting the state data using the manual strategy.

What does roll do in numpy and pytorch?¶

Roll in Numpy¶

Roll in Pytorch¶

Rolling Manually in numpy¶

Archive