Difference between revisions of "Cloud Setup"
(→Rclone) |
|||
Line 207: | Line 207: | ||
None of your applications know that these files have two different real file paths. All new files would always be written to your local HD, then you can use Rclone to sync them to the cloud later. When that sync is completed, its invisible to your apps so you do not need to worry about missing episodes or movies. | None of your applications know that these files have two different real file paths. All new files would always be written to your local HD, then you can use Rclone to sync them to the cloud later. When that sync is completed, its invisible to your apps so you do not need to worry about missing episodes or movies. | ||
− | ==== Setting up | + | ==== Setting up Mergerfs ==== |
− | + | Mergefs has a dependency on Fuse, so we will start by installing that: | |
− | + | sudo apt install fuse | |
+ | Next, we must modify the <code>fuse.conf</code> file to enable the <code>allow_other</code> funtion: | ||
+ | sudo nano /etc/fuse.conf | ||
+ | Modify the last line by removing the # before <code>user_allow_other</code>. It should look like this: | ||
+ | # Allow non-root users to specify the allow_other or allow_root mount options. | ||
+ | user_allow_other | ||
+ | Now we can install Mergerfs. Go to the [https://github.com/trapexit/mergerfs/releases Mergerfs github page] and download the newest release that matches your install. | ||
+ | wget https://github.com/trapexit/mergerfs/releases/download/2.32.0/mergerfs_2.32.0.ubuntu-focal_amd64.deb | ||
+ | And then install it: | ||
+ | dpkg -i mergefs*.deb | ||
− | <code> / | + | To execute Mergerfs, run the following command. This assumes your local storage is mounted at <code>/local</code>, your cloud storage is mounted at <code>/cloud</code>, and your merged mount is <code>/merge</code>. |
+ | /usr/local/bin/mergerfs /local:/cloud /merge -o rw,use_ino,allow_other,func.getattr=newest,category.action=all,category.create=ff,cache.files=auto-full,statfs_ignore=nc,func.getattr=newest,dropcacheonclose=true,nonempty | ||
== Setup == | == Setup == |
Revision as of 16:51, 4 December 2020
Rclone, Mergerfs, and Google Drive
With many options for storing 'unlimited' data in the cloud, configuring your system to allow your Arr's and media serving apps to access the data securely is possible. For this guide we will use Google Suite Unlimited Storage or a similar Rclone compatible storage service. We will also use a utility called Mergerfs to help keep our uploads organized.
Please note that this guide is for information only and you should only store legally obtained media.
Please Note: To get unlimited storage with Google Drive, you should have 5 users (as of writing $12*5 users = $60/mo). If you have less users you technically do not have unlimited storage and Google may enforce said limits at any time.
Application Overview
Recommendations:
- Don't download into your Gdrive - Download to a local disk and move the data later w/ Rclone
- Don't import to your Gdrive - Set up a merged local cache disk and move the data later w/ Rclone
- Do all large writes locally
- Move to cloud on a schedule - Easily scriptable
- Absolutely do not write (large files) directly to the rclone mount
To connect with your Gdrive, you will need to use two applications:
- Rclone - https://rclone.org/install/
- mergerfs - https://github.com/trapexit/mergerfs
Rclone
Rclone's core purpose for the Arr's is to allow access to your data depository that is located in Gdrive. Rclone will mount and serve your file automatically, with your apps not knowing they are accessing data in the cloud. More Rclone information can be found on animosity22's Github page.
To install Rclone, run the following command:
curl https://rclone.org/install.sh | sudo bash
Setting up Rclone
To get started with an Rclone Google Suite Team Drive Mount follow the below instructions.
Making your own Google API Client ID When you use rclone with Google drive in its default configuration you are using rclone's client_id. This is shared between all the rclone users. There is a global rate limit on the number of queries per second that each client_id can do set by Google. It is strongly recommended to use your own client ID as the default rclone ID is heavily used.
- Log into the Google API Console with your Google account. It doesn't matter what Google account you use. (It need not be the same account as the Google Drive you want to access)
- Select a project or create a new project.
- Under "ENABLE APIS AND SERVICES" search for "Drive", and enable the "Google Drive API".
- Click "Credentials" in the left-side panel (not "Create credentials", which opens the wizard), then "Create credentials"
- If you already configured an "Oauth Consent Screen", then skip to the next step; if not, click on "CONFIGURE CONSENT SCREEN" button (near the top right corner of the right panel), then select "External" and click on "CREATE"; on the next screen, enter an "Application name" ("rclone" is OK) then click on "Save" (all other data is optional). Click again on "Credentials" on the left panel to go back to the "Credentials" screen.
- Click on the "+ CREATE CREDENTIALS" button at the top of the screen, then select "OAuth client ID".
- Choose an application type of "Desktop app" if you using a Google account or "Other" if you using a GSuite account and click "Create". (the default name is fine)
- It will show you a client ID and client secret. Write or copy these values down. Use these values in rclone config to add a new remote or edit an existing remote.
Setup your Google Service Account (SA) file this allows it to not be tied to a single user account.
- go to the Google Developer Console.
- go to "IAM & admin" -> "Service Accounts".
- Use the "Create Credentials" button. Fill in "Service account name" with something that identifies your client. e.g.
mount
Leave "Role" Empty
- Tick "Furnish a new private key" - select "Key type JSON".
- Tick "Enable G Suite Domain-wide Delegation". These credentials are what rclone will use for authentication. If you ever need to remove access, press the "Delete service account key" button.
Allow API access to Google Drive
- go to admin console
- Go into "Security" (or use the search bar)
- Select "Show more" and then "Advanced settings"
- Select "Manage API client access" in the "Authentication" section
- In the "Client Name" field enter the service account's "Client ID" - this can be found in the Developer Console under "IAM & Admin" -> "Service Accounts", then "View Client ID" for the newly created service account. It is a ~21 character numerical string.
- In the next field, "One or More API Scopes", enter https://www.googleapis.com/auth/drive to grant access to Google Drive specifically.
rclone config
n
- New Remote- Enter a name for your mount e.g.
cloud
13
- Google Drive- Enter your Google Application Client Id
- Enter your Google Application Client Secret
1
- Full Access- Leave Blank ID of the root folder
- enter the path and filename to your Google Drive SA Json
n
- Do Not Use Auto Configy
- use team drive- Review for Accuracy
- mount the drive using
rclone mount --daemon --daemon-timeout=5m --allow-non-empty --buffer-size=128M --use-mmap --dir-cache-time=48h --cache-info-age=48h --vfs-cache-mode=full --vfs-cache-max-age=6h --log-file=~/.config/rclone/logs/rclone.log --log-level INFO cloud: /cloud
When you executed rclone config
above, it created a file rclone.conf
and placed it at ~/.config/rclone.conf
. If you intend to have your OS auto-mount your new rclone mount using systemd, ensure this file is accessible by the user you have in your .conf file.
Encrypting your Cloud Storage
If you followed the previous steps, you created an unencrypted Google Drive mount. If you would like to encrypt your storage so that Google can not scan your files (good idea!), then follows these optional steps:
Start by running Rclone's config tool again, as you did previously:
rclone config
n
- New Remotecloudcrypt
- Name of your new encrypted remotecrypt
- Encrypt/Decrypt a remotecloud:crypt
- Remote to encrypt/decrypt, followed by a semicolon and a name indicating the function.standard
- How to encrypt the filenames.true
- Encrypt directory names
The next few questions will ask you to create two passwords, one for the encryption and one for the salt. It is recommended to choose two different passwords, and for the best protection, allow them to be generated for you and document them somewhere safe.
Once the password section is finished, it should present you with the new config, which should look similar to this:
[cloudcrypt] type = crypt remote = cloud:crypt filename_encryption = standard directory_name_encryption = true password = **Encrypted PW1** password2 = **Encrypted PW2**
Finally, make sure when you run rclone (either manually or via autostart script), you mount cloudcrypt
rather than calling cloud
.
Rclone Linux Autostart using Systemd
Most modern Linux distributions have switched to systemd, which involves a simple unit service file which gets enabled and started. It is important to remember that in Linux, capitalization matters. User
account names and Group
names are typically all lowercase, as are the directory structures mapped to them as part of the home directory.
Preparing the Unit Service File
Several items may need to be changed to match your installation:
User
should match the service account that Rclone will run as.Group
should match the service account group that Rclone will run as.
The unit service file should be named Rclone.service
and the best place for it is /etc/systemd/system/
. Alternative locations like /usr/lib/systemd/system/
and /lib/systemd/system/
may depend on the distribution used.
This example unit assumes that the User
and Group
are both plex
, Rclone's executable is placed in /usr/bin/
, your cloud storage is being mounted to /cloud
, and you have previously ran Rclone and set up a config file, which is located at ~/.config/rclone/rclone.conf
Feel free to remove the # comment
lines, they are there for those who want to know what those flags are doing and were taken from the rclone documentation.
[Unit] Description=Google Drive Encrypted (rclone) Wants=network-online.target After=network-online.target [Service] Type=notify ExecStart=/usr/bin/rclone mount cloud: /cloud \ # Allow mounting over a non-empty directory (not Windows). --allow-non-empty \ # In memory buffer size when reading files for each --transfer. (default 16M) --buffer-size=128M \ # Use mmap allocator (see docs). --use-mmap \ # Time to cache directory entries for. (default 5m0s) --dir-cache-time=48h \ # How long to cache file structure information (directory listings, file size, times etc). (default 6h0m0s) --cache-info-age=48h \ # Cache mode off|minimal|writes|full (default off) --vfs-cache-mode=full \ # Max age of objects in the cache. (default 1h0m0s) --vfs-cache-max-age=6h \ # Override the permission bits set by the filesystem. --umask 002 \ # Location where rclone logs will be stored --log-file=~/.config/rclone/logs/rclone.log \ # Logging level for rclone logs --log-level INFO \ # Location where the rclone.conf file is located --config ~/.config/rclone/rclone.conf ExecStop=/bin/fusermount -uz /cloud Restart=on-abort User=plex KillMode=none RestartSec=5 [Install] WantedBy=default.target
Verify Directory Permissions
Ensure the User
that will be running Rclone has access to both the executable directory and data directory.
Running the command ls -lad /directory/
will show the permissions for that folder. In the example below, the user and group are both plex
.
Example:
$ ls -lad /cloud drwxr-xr-x 6 plex plex 24576 Nov 28 21:30 /cloud $ ls -lad /home/plex/.config/rclone/ drwxr-xr-x 6 plex plex 24576 Nov 28 21:30 /home/plex/.config/rclone/
Enabling and starting the Unit Service File
Once you have created Rclone.service
, you will want to enable the service:
sudo systemctl enable Rclone.service
You are now ready to start Rclone. You can do so with the start
command:
sudo systemctl start Rclone.service
If you want to verify Rclone is running, you can run the status
command:
$ sudo systemctl status rclone.service ● rclone.service - Google Drive Encrypted (rclone) Loaded: loaded (/etc/systemd/system/rclone.service; enabled; vendor preset: enabled) Active: active (running) since Sun 2020-11-15 16:28:26 MST; 2 weeks 4 days ago Main PID: 2392484 (rclone) Tasks: 31 (limit: 76935) Memory: 21.9G CGroup: /system.slice/rclone.service └─2392484 /usr/bin/rclone mount cloud: /cloud --allow-non-empty --buffer-size=128M --use-mmap --dir-cache-time=48h > Nov 15 16:28:25 systemname systemd[1]: Starting Google Drive Encrypted (rclone)... Nov 15 16:28:26 systemname systemd[1]: Started Google Drive Encrypted (rclone).
Mergerfs
Mergerfs core purpose for the Arr's is to allow you to have a single merged directory structure, based on two different directories. For example, if you used rclone to mount a directory named /cloud
and you have a locally mounted directory named /local
, mergerfs
would create a single merged directory of those two, which you could call /merge
. Your media server and Arrs would access /merge
and see the combination of local + cloud storage.
Example: You have the first 4 episodes of 'Popular TV Show' located on your cloud drive, and Sonarr just downloaded and imported episode 5. If you looked in your cloud drive (/cloud), you would see:
/cloud/TV Shows/Popular TV Show/S01E01.mkv /cloud/TV Shows/Popular TV Show/S01E02.mkv /cloud/TV Shows/Popular TV Show/S01E02.mkv /cloud/TV Shows/Popular TV Show/S01E04.mkv
If you looked in your local storage (/local) you would see:
/local/TV Shows/Popular TV Show/S01E05.mkv
And if you looked in the merged directory thanks to mergerfs (/merge) you would see:
/merge/TV Shows/Popular TV Show/S01E01.mkv /merge/TV Shows/Popular TV Show/S01E02.mkv /merge/TV Shows/Popular TV Show/S01E02.mkv /merge/TV Shows/Popular TV Show/S01E04.mkv /merge/TV Shows/Popular TV Show/S01E05.mkv
None of your applications know that these files have two different real file paths. All new files would always be written to your local HD, then you can use Rclone to sync them to the cloud later. When that sync is completed, its invisible to your apps so you do not need to worry about missing episodes or movies.
Setting up Mergerfs
Mergefs has a dependency on Fuse, so we will start by installing that:
sudo apt install fuse
Next, we must modify the fuse.conf
file to enable the allow_other
funtion:
sudo nano /etc/fuse.conf
Modify the last line by removing the # before user_allow_other
. It should look like this:
# Allow non-root users to specify the allow_other or allow_root mount options. user_allow_other
Now we can install Mergerfs. Go to the Mergerfs github page and download the newest release that matches your install.
wget https://github.com/trapexit/mergerfs/releases/download/2.32.0/mergerfs_2.32.0.ubuntu-focal_amd64.deb
And then install it:
dpkg -i mergefs*.deb
To execute Mergerfs, run the following command. This assumes your local storage is mounted at /local
, your cloud storage is mounted at /cloud
, and your merged mount is /merge
.
/usr/local/bin/mergerfs /local:/cloud /merge -o rw,use_ino,allow_other,func.getattr=newest,category.action=all,category.create=ff,cache.files=auto-full,statfs_ignore=nc,func.getattr=newest,dropcacheonclose=true,nonempty
Setup
Set your download client to download to say /merge/usenet/{tv|movies}
and your library to say /merge/media/{TV|Movies}
.
Then what happens is the download is local, the import is local. And it all looks like it is in the same place.
In the background, you have a cron or systemd timer that does rsync move from your local storage to your cloud storage.
You can rotate service accounts if needed.
It is also way more efficient at uploading vs. just the rclone mount.
The cloud mount allows sonarr/radarr to delete, rename and read stuff, but big writes don't need to go right to cloud.
If you do get rate limited, it won't impact your imports because they're all local.
/local/media/{tv|movies}
//this is where ARR imports to.
/local/{usenet|torrents}/{tv/movies}
// this is where downloads should be dropped to
/cloud/
//this is your rclone crypt mount (if going crypt) or GDrive Mount /cloud/media //this is your media folder in Gdrive
/merge/
//is /cloud merged into /local/ using mergerfs
/merge/media/{tv|movies}
//point plex and ARRs here as library/root folder
and setup remote path in ARR to say remote /local/ is the same as remote /merge/
Files will be downloaded into local and then imported into /local/media/ on a scheduled basis an rclone job would be created to move from local to gdrive so they will then automatically appear in cloud and it'll be like they never left merge
Recommended Plex Server Changes
Increase the Default Cache Size of your Plex DB
With unlimited storage, some servers may run into database locking/timeout issues. Increasing the default cache size could help alleviate this.
- Stop Plex
- Locate your Plex DB. cd plex/Library/Application Support/Plex Media Server/Plug-in Support/Databases
- Run this command:
sqlite3 com.plexapp.plugins.library.db
- The SQLLite prompt will open up. Type this next:
PRAGMA default_cache_size = 6000000;
and don't forget the trailing semicolon. - There will be no confirmation that anything happened. Press CTRL + D to exit SqlLite.
- Start Plex.