Run Remo on cloud server


I have been able to run Remo on a cloud server using ngrok to have access from anywhere in order to let all my colleagues annotate the images that we use in our project. Now my question is: is there any way to load images, that are already in the server, into a dataset? I didn’t find the way to do this and i had to upload all the dataset from my computer.

Also, let me give you some suggestions for future versions:

  1. More documentation about how we can configure the app in a server. A better explanation about the remo.json file.
  2. Add the posibility to apply data augmentation to the datasets.

Hi @Juani , welcome to the forum!

You can load images in Remo from the server in two ways at the moment:

1- using the Python library from within the server itself. E.g. see this:

2- adding a path to the local directories / files from the UI:

  • From within a dataset, click “Add data”
  • Then select the “Use local data” tab
  • Then enter the path(s) in the input box

With either option, Remo will link to your data in your server.

Thanks a lot for the suggestions!

Re documentation - great point.

We didn’t expand on it is because we are not explicitly supporting running Remo on cloud servers yet. We actually just enabled ngrok to support the Colab use case. But as you found out, it works :slight_smile:

Since you are using it, we will expand on the explanation on running on a server using ngrok and the config file in the next release - will keep you posted.

We will also be releasing a paid version with official support at some point - allowing for users authentication (so you can control who access your data) and faster loading (no need to tunnel with ngrok) + some other features.

Re data augmentation:

we actually thought about it, we decided to not do it for now because usually augmentation is done in memory and not saved to disk. Whereas Remo is meant more to deal with the core data (it links to the actual files).
Why would you want augmentation in Remo? to avoid writing the code, or to be able to visualize the data itself?
And would you be ok if Remo saved the augmented data to storage?

We were planning on enabling the creation of some datasets for self-supervised pretext tasks instead, as those are usually done on storage from what I have seen.