Admin basics

1. Starting up

To start the SVN MultiSite Plus replicator, follow these steps:

Open a terminal window on the server and login with suitable file permissions.

Run the svn-multisite service, located in the /etc/init.d folder:

lrwxrwxrwx  1 root root    37 May  9 10:37 svn-multisite -> /opt/svn-multisite-plus/bin/svn-multisite

Run the start script:

[root@localhost init.d]#  ./svn-multisite start


20130520-164811 (24088) [INFO]: Starting WANdisco MultiSite Plus
20130520-164811 (24088) [INFO]: Started replicator (24100)
20130520-164811 (24088) [INFO]: Started ui (24110)
20130520-164811 (24088) [INFO]: Number of errors: 0
20130520-164811 (24088) [INFO]: Number of warnings: 0

The two components of SVN MultiSite Plus, the replicator and the UI will start up. Read more about the svn-multisite init.d script

2. Shutting down

To shutdown:

Open a terminal window on the server and login with suitable file permissions.

Run the svn-multisite service, located in the init.d folder:

lrwxrwxrwx  1 root root    37 May  9 10:37 svn-multisite -> /opt/svn-multisite-plus/bin/svn-multisite

Run the stop script, i.e.:

[wandisco@ip-10-0-100-7 bin]$  ./svn-multisite stop

20130520-165704 (24767) [INFO]: Stopping WANdisco MultiSite Plus
20130520-165704 (24767) [INFO]: Request received to shut down replicator
20130520-165704 (24767) [INFO]: replicator processes ended
20130520-165704 (24767) [INFO]: Request received to shut down ui
20130520-165704 (24767) [INFO]: Sending signal 15 to watched ui process (attempt 1)...
20130520-165707 (24767) [INFO]: Sending signal 15 to watched ui process (attempt 2)...
20130520-165710 (24767) [INFO]: ui processes ended
20130520-165710 (24767) [INFO]: Number of errors: 0
20130520-165710 (24767) [INFO]: Number of warnings: 0

Both the replicator and the UI processes will shut down. Read more about the svn-multisite init.d script

3. Using the init.d script

The 'start-up' script for persistent running of SVN MultiSite Plus can be found in the /etc/init.d folder. Run the script with the help command to list the available commands:

[root@localhost init.d]# ./svn-multisite help
usage: ./svn-multisite-plus (start|stop|restart|force-reload|status|uistart|uistop|repstart|repstop)

start         Start SVN MultiSite Plus services
stop          Stop SVN MultiSite Plus services
restart       Restart SVN MultiSite Plus services
force-reload  Restart SVN MultiSite Plus services
status        Show the status of SVN Multisite Plus services
uistart       Start the SVN MultiSite Plus User Interface
uistop        Stop the SVN MultiSite Plus User Interface
repstart      Start the SVN Multisite Plus Replicator
repstop       Stop the SVN Multisite Plus Replicator

4. Changing the admin console password

You can change SVN MultiSite Plus's login password at any time by following this procedure:

Login to the MultiSite admin console.

Login.
Click on the Settings tab.

Settings.
At the top of the settings screen is the password change form. Enter the current password, along with a new password.

Changed password
Click the SAVE button to store the new password. You can be sure that the new password has been accepted if you see a notification message appear at the bottom of the screen.

Growl!

** Alert! ** Changing Username
It's currently not possible to change the Administration username. In order to change the username you would need to re-install SVN MultiSite Plus.

5. Updating your license.key file

Follow this procedure if you ever need to change your product license. You would need to do this if, for example, you needed to increase the number of SVN users or the number of replication nodes.

/opt/wandisco/svn-multisite-plus/replicator/properties

and rename the license.key to license.20130625.
i.e.

total 16
-rw-r--r-- 1 wandisco wandisco 1183 Dec  5 15:58 application.properties
-rw-r--r-- 1 wandisco wandisco  512 Dec  5 15:05 license.key <-
-rw-r--r-- 1 wandisco wandisco  630 Dec 17 15:43 logger.properties
-rw-r--r-- 1 wandisco wandisco  747 Dec  4 10:31 svnok.catalog

Get your new license.key and drop it into the /opt/svn-multisite-plus/replicator/properties directory.
Restart the replicator by running the SVN MultiSite Plus script with the following argument:
```
/etc/init.d/svn-multisite-plus restart
```
This will trigger an SVN MultiSite Plus replicator restart, which will force SVN MultiSite Plus to pick up the new license file and apply any changes to permitted usage.

tip If you don't restart
If you follow the above instructions but don't do the restart SVN MultiSite Plus will continue to run with the old license until it performs a daily license validation (which runs at midnight). Providing that your new license key file is valid and has been put in the right place then SVN MultiSite Plus will then update its license properties without the need to restart.

If you run into problems, check the replicator logs (/opt/svn-multisite-plus/replicator/logs) for more information.

PANIC: License is invalid com.wandisco.fsfs.licensing.LicenseException: Failed to load filepath>

6. Update a node's properties

In the System Data section of the Settings tab there's a bank of editable properties that can be quickly updated by re-entering, saving and allowing the SVN MultiSite replicator to restart - although this may cause brief disruption to users whose in-flight commits will fail.

Node properties that you can change -- subject to a restart of the replicator.

Node Name: - This is the human-readable form of the node's ID. Unlike the Node ID it is possible to change the value of Node Name and reuse it (after it has been removed from the replication network). That is, you can't have two nodes with the same name, but you can now reuse a previously removed node name.
Location Longitude: The Node's geographical location is no longer recorded during installation. Instead you enter the details here.
Location Latitude: Along with Longitude, this value places the node on the internal map and helps the application determined the local time for the node based on the timezone in which it falls.
Hostname / IP Address: The hostname or underlying IP address can be updated.
DConE Port: The TCP port used for DConE agreement traffic - not to be confused with the Content Distribution port which carries the payload repository data.

After entering a new value, click the Save button. A growl message will appear to confirm that the change is being replicated - this will result in a restart of the replicator which may cause brief disruption to SVN users.

Other property changes

To change the Content Delivery port

content.port.<Node id>=<new port>

Once the file is in place, run the following command (on all the nodes except the one you have changed):

 java -jar svn-ms-replicator-updateinetaddress.jar -c <path to application.properties>

Go back to the node with the updated properties and Restart MultiSite.

You should login to the updated node and check its System Data (at the bottom of the Settings) tab. You should do some test commits to ensure that replication continues successfully.

Node Content Distribution Timeouts

There are two configurable properties that you can modify as part of fine-tuning an SVN MultiSite Plus deployment. They are provided to allow you to balance best possible performance against the tolerance of a poor WAN connectivity. Both properties are contained within the application properties file, by default located here: /opt/wandisco/svn-multisite-plus/replicator/properties/application.properties.

socket.timeout

socket.timeout=90000

The socket.timeout is an amount of time in milliseconds that the local node will wait for the connection to be established before throwing an exception - therefore signalling that it failed to connect within that timeout. Default value is 15 minutes (90,000 milliseconds).

** Alert! ** Not less than 10 minutes!
DO NOT set socket.timeout to less than 10 minutes (60,000 milliseconds) or you may encounter problems.

content.pull.timeout

content.pull.timeout=300000

The content pull timeout sets how long the Content Distribution system will wait for new content to be pulled fully over from a remote node. The default value is 5 minutes (300,000 milliseconds). This default is set on the assumption that there are no problems with the deployment's WAN connectivity.

Increasing the timeout

Increasing the value may help if poor connectivity is resulting in the replicator repeatedly giving up on content distribution that would have eventually transferred had it been given enough time, i.e. not as a result of a slow network rather than something that has caused a permanent error.

Decreasing the timeout

Decreasing the value is not generally recommended. Doing so is not intended as a method for boosting performance - although this may occur in some situations. We recommend that you don't drop the timeout value below 5000 (5 seconds) without consulting with our support team.

7. Setting up data monitoring

The Monitoring Data tool monitors the disk usage of SVN MultiSite Plus's database directory, providing a basic level of protection against SVN MultiSite Plus consuming all disk space. The tool also lets you set up your own monitors for user-selected resources.

** Alert! ** Monitoring Data - not intended as a final word in system protection
Monitoring Data is no substitute for dedicated, system-wide monitoring tools. Instead, it is intended to be a 'last stand' against possible disk space exhaustion that could lead to data loss or corruption.

Read our Recommendations for system-wide monitoring tools.

Default settings

Click the "View" link to go to a monitor's settings.

By default MultiSite's database directory (/opt/wandisco/svn-multisite-plus/replicator/database) is monitored - this is the location of MultiSite's prevayler database where all data and transactions files for replication are stored.

This built-in monitor runs on all nodes. Any additional monitors that you set up will monitor on a per-node basis. Monitors are not replicated so a monitor set up on one node is not applied to any other node.

Additional monitors

As well as SVN MultiSite Plus's own database folder, there are a number of other directories that could in certain circumstances grow very large and potentially consume all available file space.

MultiSite directories that it may be worth monitoring:

/opt/wandisco/svn-multisite-plus/replicator/content

/opt/wandisco/svn-multisite-plus/logs

/opt/wandisco/svn-multisite-plus/replicator/logs

Other directories that should be monitored:

/path/to/authz

If you are using Authz to manage authorization and your Authz file is situated on different file system from SVN MultiSite Plus, then you are recommended to set up monitoring of the authz file.

For most deployments all these directories will reside on the same file system, so that our default monitor would catch if any of them were consuming the available space. However, there are two scenarios where we'd recommend that you set up your own monitor for the content directory:

1) You wish to set a higher trigger amount than the default monitor (1GiB for warning, 0.09GiB for emergency shutdown).
2) You have placed the content directory on a different filesystem with its own capacity that wouldn't be tracked by the default monitor.

In either case you should follow up the setting up of a monitor with a corresponding email notification that will be sent if some or all of your monitor's trigger conditions are met.

Create additional resource monitors using the following procedure:

Click the "SETTINGS" link on the top menu bar.

Monitoring Data is situated below the Administrator Settings. Enter the full path to the resource that you wish to monitor. For example, you might wish to monitor the replicator logs: "/opt/svn-multisite-plus/replicator/logs". Enter the path and click "Add".

Add resource path

The new resource monitor will appear as a new box - it will display "No records found", indicating that it doesn't yet have any monitoring rules set. Click its corresponding "Configure" link.

Configure

The screen will update to show the Resource Monitoring screen for your selected resource.

Settings

File Path:: The full path for your selected resource
Monitor Identity:: The unique string that will identify the monitor
Edit Condition and Event List: Lists current resource monitors, initially this will state "No records found"

Add Conditional and Event to List

Storage amount entry field

Enter an amount of disk space in Gigabytes. e.g. 0.2 would be equal to 200 Megabytes of storage.

Select an Event from the dropdown:

SEVERE: - will initiate a shutdown of SVN MultiSite Plus and will also write a message to the log and the "SEVERE" logging level. See "When a Shut down is triggered" for more information.
WARNING: - will write a message to the log and the "WARNING" level of severity.
DEBUG: - will write a message to the log and the "DEBUG" level of severity.
INFO: - will write a message to the log and the "INFO" level of severity.

When you have added all the trigger points and events that you require for the resource, click "Update". You can then navigate away - Click on "Resource Monitoring" on the breadcrumb trail to return to the settings screen.

When a Shutdown is triggered

If the disk space available to a monitored resource is less than the value you have for a "Severe" event then the event is logged and MultiSite's replicator will shut down after a set interval of 10 minutes. You can configure the interval in application.properties file:

/opt/wandisco/svn-multisite-plus/replicator/properties/application.properties

resourcemonitor.period.min=10L: value in minutes!

** Alert! ** Edits to property files require a replicator restart
Any change that you make to the application.properties file will require that you restart SVN MultiSite Plus's replicator.

Once shut down all SVN repositories will become unavailable to users, you should immediately take action to make more disk space available, the replicator can be restarted using SVN MultiSite Plus's service as soon as the resource that triggered the shutdown has enough available disk space not to shut down again.

Overriding the forced shutdown
In the event that you can't start a node in order to resolve the cause of a forced shutdown -- in an absent minded moment you might create a data monitor that triggers a severe log message if there's less disk space than the disk's actual capacity. You'd be stuck because it wouldn't be possible to free up space -- short of swapping for a bigger disk.

There's a method you can use to unlock the forced shutdown.

Navigate to the properties folder, by default you'll find this here:

/opt/wandisco/svn-multisite-plus/replicator/properties/application.properties

Create a backup, then edit the file, changing the line
```
monitor.ignore.severe=false
```
to say
```
monitor.ignore.severe=true
```
Save the change to the file.
Restart the replicator (see Starting up). During the restart the replicator will now ignore the severe warning (which are still written to the log file) allowing you to delete the offending monitor.

You can't use this procedure to override the default monitor, its emergency shutdown limit of <100MiB will ALWAYS shut down the replicator.

8. Setting up email notifications

The email notification is a rules-based system to deliver alerts based on user-defined templates over one or more channels to destinations based on triggers that are activated by arbitrary system events. Put simply, email notification sends out emails when something happens within the SVN MultiSite Plus environment. The message content, trigger rules and destinations are all user-definable.

Automated alert emails

8.1 Set up a Gateway

The Gateway section stores your email (SMTP) server details. You can set up multiple gateways to ensure that the loss of the server doesn't prevent alert notifications from being delivered.

Log into the admin UI, then click the Settings tab.

Click on the Gateway section of the Notifications area.

Add Gateway

Enter your email gateway's settings:

Enter settings

IP/Hostname of SMTP Server:: your email server's address.
SMTP Server Port: The port assigned for SMTP traffic (Port 25 etc).
Encryption Type: Indicate your server's encryption type - None, SSL (Secure Socket Layer) or TLS (Transport Layer Security). SSL is a commonly used. For tips on setting up suitable keystore and truststore files see Setting up SSL Key pair.
keystores?
If you're not familiar with the finer points of setting up SSL keystores and truststores it is recommended that you read the following article: Using Java Keytool to manage keystores.
Authentication Required: Indicate whether you need a username and password to connect to the server - requires either "true" or "false".
User Name: If authentication is required, enter the authentication username here.
Password: If authentication is required, enter the authentication password here.
Sender Address: Provide an email address that your notifications will appear to come from. If you want to be able to receive replies from notifications you'll need to make sure this is a valid and monitored address.
Number of Tries Before Failing: Set the number of attempts SVN MultiSite Plus makes in order to send out notifications.
Interval Between Tries (Seconds): Set the time (in seconds) between your server's attempts to send notifications.

Click on the "+Add" button. Your gateway will appear in the table.
You can add any number of gateways. SVN Multisite Plus will exhaust the "Number of Tries Before Failing" for each registered gateway before moving on down the list to the next.

You can use the Test button to verify that your entered details will connect to a mail gateway server.

8.2 Set up a Destination

The destinations section stores the email address for your notification recipients.

Click on the + on the Destinations line.

Enter an email address for a notification recipient. Click the + Add link.

Notification

The destination will appear in a table. Click the Edit or Remove links to change the address or remove it from the system.

8.3 Set up a Template

The template section is used to store email messages. You can create any number of templates, each with its own notification message, triggered by one of a number of trigger scenarios that are set up in the Rule section.

Click on the + on the Template line.

Enter a Template Subject line, this will be the subject of the notification email.

Enter some Body Text, this will be the message that is sent out when the notification is triggered. The message has a 1024 character limit, you can track the available number of characters at the bottom of the text box.

Notification

When the message has been entered, click the + Add link to save the message template.

Available variables

When writing email notification templates, you can insert variables into the template that will be interpolated when the notification is delivered. The following two variables are available for ALL event types:

{timestamp}: This will be substituted with the time at which the event is received (not the time at which the notification is delivered).
{event}: This will be substituted with the raw dump of the event.

For the event types "Disk Monitor Info", "Disk Monitor Severe" and "Disk Monitor Warning", the following additional variable is available:

{event.message}: This will be substituted with information about the disk monitoring threshold that was exceeded.

For the event types "Deploy Repository Succeeded" and "Deploy Repository Failed", the following additional variables are available:

{event.proposerNodeId}: This will be substituted with the ID of the node that sent the event.
{event.originalProposal.repository.name}: This will be substituted with user-specified name of the repository to which the event pertains.
{event.originalProposal.repository.fSPath}: This will be substituted with location on-disk of the repository that the event pertains to. More events types and event variables will be added in the future.

8.4 Set up a Rule

The Rule section is used to define which system event should trigger a notification, what message template should be used and which recipients should be sent the notification.

** Alert! ** Known issue
It's currently not possible to edit notification rules that you create. This issue will be addressed in a later release. For now, use the simple workaround of deleting then recreating rules that you want to change.

Click on the + on the Rule line.

Choose an Event from the Event drop-down list:

Rules

Event:

Any Repository Global Read-Only Event: In case of any repository entering a global read-only mode.
Global Read-Only Due to Admin Action: In case of any repository entering a global read-only mode as a result of administrator interaction through the admin UI.
Disk Monitor Info: Disk Storage has dropped below the Info level. This will trigger if any data monitor message is written to the logs at the "INFO" level.
Disk Monitor Warning: Disk Storage has dropped below the Warning level.This will trigger if any data monitor message is written to the logs. For more information about disk warning messages, see the Setting up data monitoring section.
Disk Monitor Severe: Disk Storage has hit the Severe level. This will trigger if any "Severe" level data monitor message is written to the logs. At this level, SVN MultiSite Plus will have shutdown to ensure that disk space exhaustion doesn't corrupt your system and potentially your SVN repositories. For more information about disk warning messages, see the Setting up data monitoring section.
Deploy Repository Failed: A repository added to SVN MultiSite Plus has failed to deploy, in which case the repository will not be replicated.
Deploy Repository Succeeded: A repository added to SVN MultiSite Plus has successfully deployed. Such an event might be sent to a mail group received by SVN users, telling them that their repository is now accessible.
Global Read-Only Due to Consistency Check Failure: In case of any repository entering a global read-only mode as a result of failing a consistency check with its replicas.
Generic file replication error occurred: An error occured with the Generic Replication script.

9. Backing up SVN MultiSite Plus data

It's possible to back up SVN MultiSite Plus's own database in case you need to quickly restore a node.

** Alert! ** Only MultiSite Settings are backed-up
This procedure backs up SVN MultiSite Plus's internal Prevayler database, it doesn't touch your SVN repository data or any other system files (such as Apache configuration, authz files etc.) that you should also be backing up.

curl --user <username>:<password> -X POST http://[node_ip_address]:8082/dcone/backup

[INSTALL-DIR]multisite-plus/replicator/db/backup/X.X.X_DConE_Backup

Back up while shut down

(run from within /replicator):

java -cp ./fsfsrestore.jar com.wandisco.fsfs.backup.FsfsBackup -c ./properties/application.properties

Use this to back up the current state of all prevaylers when SVN MultiSite Plus is shut down - you don't therefore need to start the replicator in order to create a backup of the database.

10. Restore SVN MultiSite Plus Data

The restore functionality is no longer supported since the product upgrade functionality is handled using the installer.

11. Manage Access to SVN MultiSite Plus

SVN MultiSite Plus supports three different mechanisms for managing access to its admin UI:

Internally Managed users - are admin accounts that are set up from within SVN MultiSite's Admin UI.
LDAP Authorities - you can have SVN MultiSite Plus query LDAP services and filter for a suitable group from which to populate admin users.
Kerberos Security - Finally, if your organization uses a Kerberos authentication system you can set up MultiSite to use it.

Internally Managed Users

It is possible to set up multiple administrator accounts for accessing the SVN MultiSite Plus admin console. Accounts can be set up from within the admin UI (via the Security tab). These users are then able to login to any node's admin UI by providing their username and password.

The folloing section will explain how you set up multiple accounts, set up managing LDAP authorities and export/import the resulting data.

Adding additional users

Login to the Admin UI using an existing admin account.

Login
Click on the Security tab, then click on the Add User button

SVN MultiSite Plus - Add User

Enter details for the new administrator, then click the Add User button situated at the end of the entry bar.

SVN MultiSite Plus - Click Add User to save their details.
You'll see a growl message confirming that the user has been added. You'll see them listed on the Internally Managed Users after clicking the Reload button (or refershing your browser session).

SVN MultiSite Plus - New user appears

Removing or editing user details

You can modify any user details by clicking their corresponding Edit button on the Internally Managed Users table

SVN MultiSite Plus - Remove or Edit users

LDAP Authorities

SVN MultiSite Plus supports the use of LDAP authorities for managing admin loging accounts. See our brief Guide to LDAP

When connecting SVN MultiSite Plus to available LDAP authorities it is possible to classify the authority as "Local" i.e. specific to the node in question or not - in which case the authority details will be replicated to the other nodes within the replication network.

It's possible to run multiple LDAP authorities that are of mixed type, i.e. using some local authorities along with other authorities that are shared by all nodes. When multiple authorities are used, it's possible to set what order they are checked for users.

The standard settings are supported for each configured LDAP authority: URL, search base and filter and bind user credentials. Note that the bind user's password cannot be one-way encrypted using a hash function because it must be sent to the LDAP server in plain text, so for this reason the bind user should be a low privilege user with just enough permissions to search the directory for the user being authenticated. Anonymous binding is permitted for those LDAP servers that support anonymous binding.

Add Authority

Use the Add Authority feature to add one or more LDAP authorities, either local to the node or connected via WAN. Locally LDAP services are treated as having presedence. When Internally managed users are enabled they are first checked when authenticating users - see Admin Account Precedence

Procedure for adding an authority:

Click on Add Authority.

Add Authority

The Authority entry form will appear. Enter the following details:

Add Authority

URL
Enter your authorities URL. You need to include the protocol ldap:// or ldaps://

Bind User DN
Enter a LDAP admin user account that will be used to query the authority

Search Base
Enter the Base DN, that is the location of users that you wish to retrieve.

Search Filter
Optionally add A query filter that will select users based on relevant LDAP attributes. For more information about query filter syntax, consult the documentation for your LDAP server.

Is Local?
Tick this checkbox if you want the authority to only apply to the current node and not be replicated to other nodes (which is otherwise done by default).

Click the Add Authority. This will save the authority settings that you have just entered. You can click the Test button to verify that the details will successfully connect to the authority without yet adding the authority.

When running with multiple authorities, you should determine the order by which MultiSite polls the authorities. Use the +- symbols at the end of each authority entry to push it up (+) or down (-) the list.

Order authorities

Edit Authority

Modify an existing authorities settings:

Click the edit link on the line that corresponds with the authority that you wish to edit.

Edit authorities link
Update the settings in the popup box, then click Save.

Edit authorities box

Kerberos Security

This section covers the basic requirements for integrating SVN MultiSite Plus with your existing Kerberos systems. The procedure requires the following:

A Key Distribution Center
A Workstation setup on each node
A machine with a suitably configured browser. See Enabling Kerberos access on your browser

** Alert! ** Time, ladies and gentlemen, please.
Ensure that time synchronization and DNS are functioning correctly on all nodes before configuring Kerberos. A time difference between a client and the master Kerberos server that exceeds the Kerberos setting (5 mins default) will automatically cause auth failure.

Configuration

This procedure assumes that you have already set up your DNS service and master Key Distribution Center.

On each node, add the service principal:

# kadmin -p root/admin -q "addprinc -randkey HTTP/node1.example.com"
# kadmin -p root/admin -q "ktadd -k /opt/krb5.keytab HTTP/node1.example.com"
# chmod 777 /opt/krb5.keytab

Each node should have installed the add-on JCE Java 6 or Java 7 Unlimited Strength Jurisdiction Policy Files". These can be downloaded from Oracle, subject to your local import rules concerning encryption technology. Once downloaded, extract to the the Java security library, i.e.
```
$JAVA_HOME/lib/security/
```

At this point you can install SVN MultiSite Plus on each node. If that's already done, then configure the Kerberos settings under the Security tab.

Edit Kerberos box

Serivce Principal:

This unique name for an instance of a service, such as HTTP/node1.example.com

Keytab Location:

This is the location of the keytab, a file containing pairs of Kerberos principals and encrypted keys (often derived from the Kerberos password). It's used for logging into Kerberos without being prompted for a password.

Kerberos Config Location:

The krb5.conf file contains Kerberos configuration information, including the locations of KDCs and admin servers for the Kerberos realms of interest, defaults for the current realm and for Kerberos applications, and mappings of hostnames onto Kerberos realms. Normally, you should install your krb5.conf file in the directory /etc. i.e. /etc/krb5.conf

Save the settings. Log out. Return to the node in your browser, this time you should login automatically (in this as user sally@EXAMPLE.COM.)

See Security Reference: Kerberos settings.
See Configure browsers for Kerberos authentication.

Nodes

12. Adding a Node

To replicate SVN repository data between sites, you first tie the nodes together in the form of a replication network, this process starts with the adding (connecting) of nodes in a process we call induction.

You can also remove a node.

** Alert! ** Unique Node Names
You can't reuse Node IDs. If you have removed a node, you can't create a replacement that uses the old name. The replication network maintains a record of the old node and will block it from reintroduction.

Login to the SVN MultiSite admin console of the new node that you are connecting to your existing servers.
Click on the Nodes tab.
Click on the Connect to node button.

Connect to Node
Enter the details of an existing node. You can get these details from the Setting tab of the existing node.

Enter the details from an existing, connected node.

Node ID

This is the name that you gave the Node during installation. If you log into the node in question you can see the Node ID on the title of any screen that you view, it also appears in the logged in message: "Welcome to MultiSite, admin. You are connected to <NODE ID>"

Location ID

A unique string this created for each node as unique identifier. You can get this from the node's Settings tab:

System Data table, found on the Settings tab.

Hostname / IP Address

The IP Address of the node's server.

DConE Port

The TCP port that the node uses for DConE, which handles agreement traffic. The default is 6444 See Reserved Ports.
Click on the SEND CONNECTION REQUEST button. The new node will appear on the active list of Sites.
Should a problem occur you may find that the new node gets stuck in a 'pending' state. If this happens see If Induction fails.

13. Removing a Node

The removal of a node from the SVN MultiSite Plus replication group is useful if you will no longer be replicating repository data to its location and wish to tidy up your replication group settings.

** Alert! ** No ties allowed
The option to remove a node should only appear if it is not currently a member of a replication group. You may need to remove and recreate replication groups in order make it eligible for removal.

Known issue:
NOTE: If a node is inducted but not in a replication group then it is possible (from that node) to remove other inducted nodes that are in a replication group. There's currently an issue in that a node isn't aware of the membership of replication groups of which it is not itself a member. This means that it is possible to remove a node that is a member of a replication group, if done from another node that doesn't have knowledge of the replication group.

Until we block this capability you should do a manual check of any nodes that you plan to remove to make absolutely sure that it is not a member of a replication group.

** Alert! ** Once removed a node can't come back
Take care when removing nodes. In order to ensure that replication network is kept in sync, removed nodes are barred from being re-inducted. The only way that you can bring back a node is to perform a reinstallation of SVN MultiSite Plus using a new Node ID.

Click on the Nodes tab.

Nodes that are eligable for removal will have the "Remove Node" option available under the Action column. In this example, NodeSanFrancisco is eligable for removal because we have removed it from any replication groups.

Nodes table under the Nodes tab

Click on the "Remove Node" link. Don't forget that this action is irreversible, you must be absolutely sure that you want to permanently remove the node.

Ready to remove NodeSanFrancisco.

After a refresh of the admin user interface you will see that the removed node will continue to display if you click the Display Removed Nodes button. Removed nodes can be otherwise identified by their "Removed" status.

Node removed.

14. Stopping nodes

It's possible to bring all nodes to a stop through the use of a single button click (providing all associated repositories are replicating/writable).

** Alert! ** A stop can't be synchronized if associated repositories are Local Read-only
Before starting a Sync Stop All, make sure that none of your nodes have repositories in a Local Read-only state.

Here's how:

Log into the admin UI and click on the Nodes tab.

Click the Sync Stop All button.

Stop all nodes.

You'll get a 'growl' message confirming the stop has been triggered. You'll see the results on refreshing your browser session.

Stopped!

On the Node table all nodes will show as Stopped. In this state it's possible to perform maintenance or repairs without risking your replication getting out-of-sync.

Node removed.
The Sync Stop All button has changed to Start all, however, it is possible to start selected nodes by logging in to the admin console of each node that you want to start. Use the Start Node link that appears in the Action column of the nodes table.

Important!
Bringing your replication to a global stop is not a trivial business. We strongly recommend that you take the time to watch and confirm that all nodes report as stopped. If you suspect that one or more nodes are not going to stop you should investigate immediately:

On the dashboard in the Replicator tasks widget if a repository has gone local Read-only before or during your 'SyncStop all' the stop will fail without any specific error message, you'll just observe that the nodes aren't stopped.

In the tasks widget you might get:

Aborted tasksType PREPARE_COORDINATE_STOP_TASK_TYPE
Delete Task
Originating Node: Ld5UYU
tasksPropertyTASK_ABORTING_NODE: Ld5UYU
tasksPropertyTASK_ABORT_REASON: One or more replicas is already stopped.
The replica was: [[[Ld5UYU][bf0c6395-77b6-11e3-9990-0a1eeced110e]]]

The thing you would look for is the message:

Aborted tasksType PREPARE_COORDINATE_STOP_TASK_TYPE

in the replicator.log file you might also see the following error type:

"DiscardTaskProposal <task id etc> message: One or more replicas is already stopped."

15. Starting nodes

If all nodes have been brought to a stop, click the Start All button to start them replicating again.

Stopped!

After a browser refresh, all nodes will now show as running.

Replication Groups

16. Adding a Replication Group

Use the procedure to add a new Replication Group. You need to add a new replication group when you need to replicate between a new combination of sites - i.e. sites that are not currently replicating in an existing group. If you are, instead, looking to replicate a new repository between existing sites, it's possible to add a new repository to those sites. In this case see Add a new repository.

Log in to the SVN MultiSite browser-based user interface. Click on the REPLICATION GROUPS tab, then click on the CREATE REPLICATION GROUP button.

Creating a replication group.
Enter a name for the group in the Replication Group Name field, then click on the drop-down selector on the Add Sites field. Select the sites that you want to replicate between.

replication group details.

Replication Ground Rules
- A node can belong to any number of replication groups.
- A repository can only be part of a single active replication group at any particular time.
- It's possible to change membership on the fly, moving a repository between replication groups with minimal fuss.

Click on each node label to set its node type.

Click on node labels to change their type.

Advice on creating effective replication groups
For an understanding of some of the ground rules for replication you should read the section Creating resilient Replication Groups.

Nodes are automatically added to a group as "Active Voters". To understand the differences between the different types of nodes, read Guide to node types

Once all sites are in place and their settings adjusted to your needs, click CREATE REPLICATION GROUP.

Create Replication Group.
Newly created replication groups will appear on the Replication Group tab, but only on the admin UI of nodes that are themselves members of the new group.

The new replication group now appears - if you are logged into one of its constituent nodes.

17. Deleting replication groups

It's possible to remove replication groups from SVN MultiSite Plus, although only if they they have been emptied of repositories. Run through the following procedure as an example.

We have identified that replication group "VineyanRepos" is to be removed from SVN MultiSite Plus. We can see that it has a single repository associated with it. Click on the View to see which one.

View
On the Replication Group configuration screen we can see that Repo5 is associated with the group. We can see that currently the Delete Replication Group (VinyardRepos) is disabled. You can follow the link to the repositories page to remove the association.

Repositories
On the repositories screen, click on the associated repository, in this example it's Repo5, then click on the EDIT button.

Select and Edit
On the Edit Repository box, use the Replication Group drop-down to move the repository to a different Replication Group. Then click SAVE.

Edit
Repeat this process until there are no more repositories assoicated with the Replication Group that you wish to delete. In this example VinyardRepos only had a single repository, so it is now empty, and can be deleted. Click on View, then on Configure.

Move it
Now that Replication Group VinardRepos is effectively empty of replication payload the Delete link is enabled. Click on the link Delete Replication Group (VinyardRepos) to remove the replication group, taking note that there's no undo - although no data is removed when a replication group is deleted, it should be easy enough to recreate a group if necessary.

Click the Delete link button

A growl will appear confirming that the replication group has been deleted.

Deleting the replication group

18. Adding a node to a replication group

** Alert! ** Don't add a node during a period of high replication load
When adding nodes to a replication group that already contains three or more nodes, ensure that there isn't currently a large number of commits being replicated.

Adding a node during a period of high traffic (heavy level of commits) going to the repositories may cause the process to stall.

It's possible to add additional nodes to an existing replication group, so that there's minimal disruption to users. Here's the procedure:

Login to a node, click on the REPLICATION GROUPS tab. Go to the replication group to which you will add a new node, click on its VIEW.

Replication Groups
The replication group screen will appear. Click Add Nodes.

View the group settings

tip Why the Add Nodes button is disabled?
The Add Nodes button may be greyed out if the current replication group configuration won't support the addition of a new voter node.

It is also possible that a configuration that is scheduled in the future may block the addition of a new node. Check the schedule if you think that you should otherwise be able to add a new node to the replication group.

Select the node that you wish to add to the replication group.

Select
When there are no further nodes to add to the group, click on the Add Nodes button.
At this stage in the process we're ready to select a Helper node from which we'll synch repository data to the new node - select a Helper Node.

Helper node
Heed the warning about not closing the browser or logging out during this process otherwise you'll need to perform a more lengthy repair procedure. Click the Start Sync button.

start sync
You've reached the stage in the process where you need to manually synchronize the repositories from the helper node (which is places temporarily offline for users until this process is finished. See our guide: 30 Synchronizing repositories using rsync.

The process lets you do a complete sync or select specific repositories that you wish to sync. Assuming that you have synced all repositories you would click Complete All. The helper node is then released from the process, allowing it to catch up with any transactions it missed while taking part in the procedure.

complete all
A growl message will appear copnfirming that the new node has been added to the replication group.

new node!
Returning to the Replication Group screen, you can see the new node count.

Adding new node complete!

19. Removing a Node from a Replication Group

It's possible to remove a node from a replication group. This functionality is required if the developers at one of your nodes are no longer going to contribute to the repositories handled by a replication group. Removing a node from a replication group will halt further updates to its repository replicas.

tip Remove stray repositories
In the event that you remove a node from a replication group, you should delete its copy of the repositories managed by the replication group. Having an out-of-date stray copy could result in confusion/users working from old data.

You will not be allowed to remove a node that is currently assigned as the "Managing Node". In order to remove the managing node, go to the Configure Schedule page and assign a different node as a Managing Node.

Login to the admin console of one of your nodes. The node will need to be the member of the relevant Replication Group, otherwise it won't appear on the tab. Click on the Replication Groups tab.
On the Replication Groups tab, click on the View button that corresponds with the Replication Group from which you plan to remove a node.

Login and go to REPLICATION GROUPS
Click on the node that you plan to remove from the group. Providing that the removal of the node doesn't invalidate the remaining configuration you will see a Remove node from replication group link. Click the link.

Remove!
A dialog will open which asks you to confirm the removal of the selected node from the Replication Group. Click Remove.

Remove. Really!
A growl message will confirm that the removal is in progress. You many need to click the Reload button to ensure that the action has been completed on all nodes.

Reload to confirm the updated state.
The node will now be removed from the Replication Group. On the Replication Groups panel you should now see that the constituent number of nodes has reduced by one.

Less one member node

20. Scheduling node changes - follow the sun

You can schedule the member nodes of a replication group to change type according to when and where it is most beneficial to have active voters. To understand why you may want to change your nodes read about Node Types

Schedule node type changes via the public API

Instead of manually setting up schedules through a node's UI you can do it programmatically through calls to the public API.
See Public API ScheduledNodeAPIDTOList element and scheduledNodeAPIDTOList Datatype

Use the following API call

http://<ip>:8082/public-api/replicationgroup/{repgroupID}/schedule

e.g.

http://10.0.100.135:8082/public-api/replicationgroup/97913c04-bbad-11e2-877a-028e03094f8d/schedule

PUT with ReplicationGroupAPIDTO XML as body:

To make Node N3 a tie-breaker 'T' FROM 10:00 - 16:00 (GMT) every day of the week with Node N1 as tie-breaker 'T' afterwards:

tip Times are always in UTC (GMT)
When viewed on a node times are shifted to the local timezone although internally they are always recorded in UTC.

Example curl command:

Make a text file containing ReplicationgroupAPIDTO XML (as above) called schedule.xml

curl -u username:password -X PUT -d @schedule.xml http://[IP]:[PORT]/public-api/replicationgroup/97913c04-bbad-11e2-877a-028e03094f8d/schedule

Sample 'schedule.xml' file

<ReplicationGroupAPIDTO>
       <replicationGroupName>global</replicationGroupName>
     <replicationGroupIdentity>97913c04-bbad-11e2-877a-028e03094f8d</replicationGroupIdentity>
       <scheduledNodes>
           <dayOfWeek>1</dayOfWeek>
           <hourOfDay>14</hourOfDay>
           <schedulednode>
               <nodeIdentity>N1</nodeIdentity>
               <locationIdentity>c0e486a0-bbab-11e2-863b-028e03094f8e</locationIdentity>
               <isLocal>true</isLocal>
               <isUp>true</isUp>
               <lastStatusChange>0</lastStatusChange>
               <role>AV</role>
           </schedulednode>
           <schedulednode>
               <nodeIdentity>N3</nodeIdentity>
               <locationIdentity>5480f515-bbad-11e2-8301-028e03094f8c</locationIdentity>
               <isLocal>false</isLocal>
               <isUp>true</isUp>
               <lastStatusChange>0</lastStatusChange>
               <role>T</role>
           </schedulednode>
           <schedulednode>
               <nodeIdentity>N2</nodeIdentity>
               <locationIdentity>478c766f-bbad-11e2-877a-028e03094f8d</locationIdentity>
               <isLocal>false</isLocal>
               <isUp>true</isUp>
               <lastStatusChange>0</lastStatusChange>
               <role>AV</role>
           </schedulednode>

Download the full sample schedule.xml file.

Login to a node, click on the REPLICATION GROUPS tab. Click on the VIEW link for the replication group that you wish to make a schedule.

Scheduling is done through replication group settings.
The replication group's pop-up window will open, showing the member nodes together, along with their current (scheduled) roles. Click the CONFIGURE button.

Configure.

Membership views are what is scheduled not necessarily what is currently active
The roles and membership displayed in the popup is based upon the agreed schedule, it's the setup that should be in place if everything is running smoothly. It is always possible that it doesn't accurately represent the state of the replication group, due to a delay in processing on a node, or if something has caused a process to hang. This should not be a cause for concern but it's important to be aware that the displayed membership is an approximation based on the information currently available to the local node.
The replication groups configuration screen will appear. You may notice that to the left a Role Schedule is noted. By default this will show as DISABLED. Click on the Configure Schedule button, in the right-hand corner.

Role Schedule: Disabled (for now).
The Schedule screen will appear. The main feature of the screen is a table that lists all the nodes in the replication group, set against a generic day (midnight to midnight) that is divided into hourly blocks. Each hourly block is color-coded to indicate the specific node's type.
In the image below NodeSanFrancisco is coded as blue which indicates that it is set as a Passive Voter. The hourly blocks associated with NodeChengdu are Magenta, indicating that it is set as a pure voter. The blocks for NodeParis are colored yellow, indicating that this node is set as an Active Voter.

Vanilla Scheduling - no changes to type over time.
To make a change to the schedule, click on a block. It doesn't matter which block you select as the New Scheduled Configuration form will let you modify any hours for any available node.

New Schedule Form.
Click on the node icon to change its type.
In this example NodeSanFransisco is changed to a Tie-breaking Passive Voter, then NodeAuckland is changed into a Tie-breaker.

Swapping roles.
When all node changes have been made, click on the SAVE button to continue, or the CANCEL button if you change your mind.

The schedule view will now change to show the changes that you make. You must click the Save Schedule button for the changes to be applied.
With all necessary changes made, you need to review the change to the schedule table and then click SAVE SCHEDULE button.

** Alert! ** Changing role of the managing node
It's currently not possible to change the role of the node that is assigned as the managing node.
If you need to change a node's role, first make a different node the manager. This restriction was intended stop the managing node from being given a non-active role. Not only would this stop the node from managing schedule changes, it would make it impossible to move the managing node status to another node.

In a future release we may be able to make it possible to change the managing node's role to another compatible role, e.g. from Active Voter to Active.

Repositories

21. Adding Repositories

You can add additional repositories for replication through the admin UI. The repository first needs be present on all the nodes that will be part of the corresponding replication group. So the repository copies need to be introduced to the replication system in an identical state.

Ensure that the repository that you are going to replicate is copied to each node that is a member of the replication group that will be responsible for the repository. The repository copies must be in an identical state before you add them into SVN MultiSite Plus.

Click on the Add button and enter the repository information:

Enter repository details.

Repo name
A name that you want SVN MultiSite Plus to use when referring to the new repository.
Known issue: duplicate repository names allowed
It's currently possible to add multiple repositories with the same name (they'll need different paths though). Ensure that you don't use the same name for multiple repositories, this is for obvious reasons a bad practice and will be prevented in future releases.

FS Path
The absolute file system path for the repository. This should be identical on each node.

Replication Group
The replication group under which this repository will be managed. Select from available groups or go and create a new replication group.

Global Read-Only
This check-box lets you add the repository in a locked-down state that won't allow SVN users to commit changes to the repository. This feature is useful for putting a repository into maintenance mode where the copies might otherwise get out-of-sync.

When all entry fields have been filled, click on the Add Repo button. You'll then see a 'growl' message confirming that the repository has been deployed and that you should reload your browser session to confirm that the repository has been added.

Recheck the repositories table to see that the new repository has been added and has a "Replicating" status.

Replicating.

tip Repository stuck in Pending state
If a repository that you added gets stuck in the deploying state - you'll see this on the Dashboard, in the Replicator Tasks window - you can cancel the deployment and try adding the repository again. To cancel a deployment, go to the Replicator Tasks window and click on the Cancel Task link.

** Alert! ** svnadmin pack support
It's not currently possible to run the svnadmin pack command when running SVN MultiSite Plus. Support for this command is currently being added to FSFSWD and should be available in the near future.

22. Removing Repositories

It's possible to remove repositories from SVN MultiSite Plus. Follow this quick procedure.

Login to the admin console of one of your nodes. The node will need to be the member of a replication group in which the repository is replicated, otherwise it won't appear on the tab. Click on the Repositories tab to see it.

Login.
On the Repositories tab, click on the line that corresponds with the repository that you want to remove.

Repositories.
Once a repository has been highlighted (in yellow), the REMOVE button will become available. Click it.

Remove.

A dialog box will appear entitled "Remove repository from replication group". It will confirm that removing a repository from a replication group will stop any changes that are made to it from being replicated. However, no repository data is removed.

23. Editing a repository

It's possible to edit a repository's properties after they have been set up in SVN MultiSite Plus. Follow this quick procedure.

Login to the admin console of one of your nodes. The node will need to be the member of a replication group in which the repository is replicated, otherwise it won't appear on the tab. Click on the Repositories tab to see it.

Login.

On the Repositories tab, click on the line that corresponds with the repository that you want to edit. Then click the Edit button.

Repositories.

You will now see the edit window appear.

Edit Repository.

Local Read-only

Changes the Read-only setting, enable or disable the repository Local Read-only setting. When enabled, the repository will not be writable, either for local users or for the replication system (that would push changes made to the repository on other nodes). However, changes that come from the other nodes are stored away to be played out as soon as the read-only state is removed.

Global Read-only

Changes the Read-only setting, enable or disable the repository Global Read-only setting. When enabled, the repository will not be writable either locally or globally. This is used to lock a repository from any changes.

Replication Group

Use the drop-down selector to change the replication group to which the repository belongs.

24. Repository synchronized stop

The Repository Synchronized Stop is used to stop replication between repository replicas, it can be performed on a per-repository basis or on a replication group basis (where replication will be stopped for all associated repositories). To bring some or all nodes to a stop, use the Sync Stop All command found on the Nodes tab.
Repository Stops are synchronized between nodes using a 'stop' proposal to which all nodes need to agree. So that while not all nodes will come to a stop at the same time they do all stop at the same point.

Login to a node's browser-based UI and click on the Repositories tab. Click on the repository that you wish to stop replicating.
With the repository selected, click the Sync Stop button. A growl message will appear to confirm that a synchronized stop has been requested. Note that the process may not be completed immediately, especially if there are large proposals transferring over a WAN link.
On refreshing the screen you will see that a successfully sync stopped repository will have a status of Stopped and will be Local RO (Locally Read-only) at all nodes.

25. Repository synchronized start

Restarting replication after performing a Synchronized Stop requires that the stopped replication be started in a synchronized manner.

Click on a stopped repository and click on the Sync Start button.
The repository will stop being Local Read-only on all nodes and will restart replicating again.

26. Logs

SVN MultiSite Plus has a number of points where SVN and replication events are logged:

Admin UI: Growl messages: The growl messages provide immediate feedback in response to a user's interactions with the Admin UI. Growls are triggered only by local events and will only display on the node (and in the individual browser session) in which the event was triggered.

Growl messages appear in the top right-hand corner of the screen and will persist for a brief period (15 seconds in most cases) or until the screen is refreshed or changed.
Always check the dashboard
If you are troubleshooting a problem we strongly recommend that you check the Dashboard's Replicator Tasks panel as well as the log files. While we added the gowl messaging as way giving administrators an immediate alert for events as they happen, they are not intended to be used as the main method of tracking failures or important system events.
Dashboard: Replicator Tasks: Events that are more complex and are not bound by user interactions may appear on the Dashboard's Replicator Tasks. Tasks may consist of a simple statement or (with a click on the Task name) a multi-line report.
Application Logs: Read more about Application logs
Replication Logs: Read more about Replication logs

SVN MultiSite Plus has two sets of logs, one set is used for application, the other logs replication activity:

Application Logs

/opt/wandisco/svn-multisite-plus/

The general logs are chiefly produced by the watchdog process and contain messaging that is mostly related to getting SVN MultiSite Plus started up and running. replicator -- logging the startup etc of the replicator ui -- startup/everything to do with the UI, inc in-use logging. lightweight.

-rw-r--r-- 1 wandisco wandisco   88 Jan 15 16:53 multisite.log
-rw-r--r-- 1 wandisco wandisco  220 Jan 15 16:53 replicator.20140115-165324.log
-rw-r--r-- 1 wandisco wandisco 4082 Jan 15 16:53 ui.20140115-164517.log 
-rw-r--r-- 1 wandisco wandisco 1902 Jan 15 16:53 watchdog.log

multisite.log

Basic events that relate to the starting up/shutting down of SVN MultiSite Plus.
e.g.

        2014-01-15 16:45:17: [3442] Starting ui
        2014-01-15 16:53:24: [3571] Starting replicator

replicator.yyymmdd-hhmmss.log

Events relating to the start up and shutdown of the replicator, and also logging. This log never includes information about the actual operation of the replicator, for that you need to see the fsfswd.x.log files located in the replicator's own logs directory (see below)

watchdog.log

Logs the running of the watchdog process which monitors and maintains the running of the SVN MultiSite processes.

Replicator Logs

The logging system has been implemented using Simple Logging Facade for Java (SLF4J) over the log4J Java-based logging library. This change from java.util.logging has brought some benefits:

This change lets us collate data into specific package-based logs, such has a security log, application log, DConE messages etc.

Logging behavior is mostly set from the log4j properties file. /svn-multisite-plus/replicator/properites/log4j.properties

# Direct log messages to a file
log4j.appender.file=com.wandisco.vcs.logging.VCSRollingFileAppender
log4j.appender.file.File=fsfswd.log
log4j.appender.file.MaxFileSize=100MB
log4j.appender.file.MaxBackupIndex=10
log4j.appender.file.layout=org.apache.log4j.PatternLayout
log4j.appender.file.layout.ConversionPattern=%d{yyyy-MM-dd HH:mm:ss} %-5p %c{1}:%L - %m%n
log4j.appender.file.append=true

# Root logger option
log4j.rootLogger=INFO, file

This configuration controls how log files are created and managed.

The log file name is set "fsfswd.log"
The maximum size of a log file is set at 100MB
The maximum number of logs is limited to 10
The VCSRollingFileAppender offers some benefits over Log4j's default RollingFileAppender, it has a modified rollover behavior so that the log file "fsfswd.log" is saved out with a perminent file name (rather than being rotated). So when "fsfswd.log" reaches its maximum size it is saved away with the name "fsfswd.<number>.log.<RolloverTimeDateStamp>".
When the maximum number of log files is reached, the oldest log file is deleted.

Additional Log Destinations (Appenders)

Apache log4j provides Appender objects are primarily responsible for printing logging messages to different destinations such as consoles, files, sockets, NT event logs, etc.

Appenders always have a name so that they can be referenced from Loggers.

You can learn more about setting up appenders by reading through the Apache documenation - http://logging.apache.org/log4j/1.2/manual.html

We strongly recommend that you work with our support team before making any significant changes to your logging.

** Alert! ** Debug is chatty
If you enable the debug mode you should consider adjusting your log file limits (increasing the maximum file size and possibly the maximum number of files).

** Alert! ** Send logging
If it is possible, consider placing the log files an a separate file system.

Logging Levels

SEVERE: Message level indicating a serious failure.
WARNING: A message level indicating a potential problem.
INFO: Interesting runtime events (startup/shutdown). Expect these to be immediately visible on a console, so be conservative and keep to a minimum.
CONFIG: Details of configuration messages.
FINE: Provides a standard level of trace information.
FINER: Provides a more detailed level of trace information.
FINEST: Provides a boggling level of trace information for troubleshooting hard to identify problems.

Changing the Logging Level

It's possible to change the logging levels - either temporarily to help in a current investigation, or perminently if you desire to change your ongoing logging. For making changes to logging, see 35. Logging Settings Tool.

It's still possible to modify log settings directly by editing the logger properties file:

/opt/wandisco/svn-multisite-plus/replicator/properties/logger.properties

restart the replicator

tip Logs are managed per node
Log changes are not replicated between nodes, so each node has its own logging setup.

27. Consistency Check

The consistency Check gives you a quick and easy check whether a selected repository remains in the same state across the nodes of a replication group. Follow these steps to check on consistency:

tip Limits of the Consistency Checker
The Consistency Check will tell you the last common revision shared between repository replicas. Given the dynamic nature of a replication group it's possible that there will be in-flight proposals in the system that have not yet been agreed upon at all nodes. For this reason it isn't possible for a consistency check to be completely authoritative.

Specifically, consistency checks should be made on replication groups that contain only Active (inc Active Voter) nodes. The presence of passive nodes will cause consistency checks to fail.

Login to a node, click on the REPOSITORIES tab.

Go to the repository
Click on one of the listed repositories. This will activate the below line of buttons.

Consistency Check is done on a per node basis
Click on the Consistency check. A growl message "Invoking consistency check on repository <Repository Name>" will appear.

Consistency check in action

Known Issue: Don't run a consistency check if the repository has been removed from one of the nodes.
There's currently a problem with running a consistency check on a repository if the replica on one or more or more nodes has been deleted. In this situation a "Highest Common Revision" task will appear on the dashboard and will remain permanently in a 'pending' state. Until we resolve this problem you shouldn't run the consistency checker on a repository if it has been removed from the file system of any of your nodes.
Click on the DASHBOARD tab. The results of the consistency check will appear on the Replicator Tasks widget - you'll need to select All Tasks, instead of the default Pending Tasks.

Repository replicas need to be identical - are they?

Originating Node:: The node from which the check was requested
Lowest Revision Checked:: The oldest revision compared across all nodes
Number of Revisions Checked: Total number of revisions checked
Repository is Consistent: The result of the check as a 'true' or 'false' statement
Highest revision Checked: The youngest revision compared across all nodes
Repository Being Checked: The name of the repository that has had its consistency checked

Log results

It's also possible to check the results of a consistency check by viewing the replicator's log file (fsfswd.##). See Logs

28. Copying repositories

This section provides advice on getting your repository data distributed prior to starting replication.

SVN installations must have:

These items are a recap of the installation checklist. Ensure you meet these requirements in order for replication to run effectively:

the same version of SVN server
matching file and directory level permissions on repositories
exactly matched contents of the svnroot directories between servers (including the repository UUID):
Specifically following this guide:

/conf

Strongly recommend that the contents match between replicas

/db

As this is where repository data is stored it is crucial that this is a perfect match between servers

hooks

Pre-commit Hooks
Wandisco's modified version of the FSFS libraries will intercept commits after any pre-commit hooks have run. This means that the pre-commit hook run on the initiating node (on the server, Apache, SVNserve, etc.) rather than in the replicator. Should a pre-commit hook fail then the server will return an error to the client before the FSFSWD intercept call. As a result, the replicator is never involved with failed pre-commit hooks - with the possible exception of protorev/abort notifications.

So if a commit (on the originating node) is delegated for replication a corresponding pre-commit hook will already have succeeded.

Post-commit Hooks
The replicator completes the commit on the originating node by invoking a JNI function - a low level function that doesn't run any hooks. When the replicator returns the commit status to the originating repository FSFSWD a successful commit will cause the post-commit to run on the server.

The net effect is that pre- and post- hooks run in the server on the originating repository and they do not run at all for the replicated repositories. Although a replicator could explicitly invoke the hooks for the replicated repositories if required.

locks

Locks must be sync'ed between nodes. You can't afford for a commit to be rejected on one site that was allowed on all the others.

Copying Existing Repositories

It's simple enough to make a copy of a small repository and transfer it to each of your nodes. However, remember that any changes made to the original repository will invalidate your copies unless you perform a syncronzation prior to starting replication.

If a repository needs to remain available to users during the proccess, you should briefly halt access, in order to make a copy. The copy can then be transferred to each node. Then, when you are ready to begin replication, you need use rsync to update each of your replicas. Fore more information about rsync, see Synchronizing repositories using rysnc.

New Repositories

If you are creating brand new repositories, don't create them at each node, instead create the repository once, then rsync it to the other nodes. You need to do this to ensure that each replica has the same UUID.

If you do create repositories at each node instead of using rysnc, you can use SVN's UUID command to get them all matching:

You can confirm the UUID of a repository using the svnlook uuid command:

[root@ip-10-0-100-6 SVN]# svnlook uuid Repo0
67d41b33-3c7c-4ba0-8af1-119dbb0d42ba

You can use the Set UUID command to ensure that a new repository that you've created has a UUID that matches with the other replicas:

$ svnadmin setuuid /opt/SVN/Repo0 67d41b33-3c7c-4ba0-8af1-119dbb0d42ba

29. Repair an out-of-sync repository

There are a number of situations where a repository may be corrupted or lose sync with its other copies -- this could be the result of file/permission changes on the server. In such an event the node on which this copy is situated will stop replicating data for that repository (other repositories will be unaffected and should continue to replicate.) SVN MultiSite Plus has a repair tool that can be used to quickly get the repository repaired and replicating again.

No option to repair?
If an existing repository is added to a Replication Group that contains Passive nodes or a repository on a Passive node enters an Local Read-only state,

Then the UI will not offer a repair option, being unable to coordinate with the repository copy on the Passive node. The answer is to temporarily change the passive node into an active node:

Login to the Passive node, click on the Replication Group tab.
Click on the Configure button, then change the role of the passive node so that it becomes active.
Once the repair is completed successfully you can reverse this change in order to return to your establish replication model.

Read more about the Replication Group settings.

Login to a node, click on the REPOSITORIES tab. A repository that is out-of-sync will be flagged as Local RO (Read-only) which signifies that other replica may continue to update. Note that the Status for Repo2 is marked as "Stopped" instead of "Replicating". Click on the Repair button.

Out of sync

The Repair Repository window will open. This runs through a three step procedure. First, select a 'helper' from the nodes that remain in replication. It may be worth while doing a test before you choose the helper to ensure that its copy of the repository is in fact the latest version. Once selected, click the Start Repair Process button. This will briefly take the selected node offline, to ensure that changes don't occur to the repository while you conduct the repair. At this point you need to login handle the repair manually.

Start the repair!

Use the good copy of the repository on the helper node, overwriting the out-of-date/corrupted copy. We recommend using rsync for this task. There's more about using rsync in the next chapter.

** Alert! ** Hooks will be overwritten
Take note that when restoring a repository using rsync, you will also copy across the "helper" repository's hooks, overwriting those on the destination node.

Need to maintain existing hooks?
Before doing the rsync, copy the hooks folder to somewhere safe. Then when you've completed the rsync, restore the backed-up hooks.

[root@localhost repos]#  rsync -rvlHtogpc /opt/repos/repo2/ root@172.16.2.41:/opt/repos/

The authenticity of host '172.16.2.41 (172.16.2.41)' can't be established.
RSA key fingerprint is 9a:07:b2:bb:b6:85:fa:93:41:f0:01:d0:de:8f:e1:5d.
Are you sure you want to continue connecting (yes/no)? yes
Warning: Permanently added '172.16.2.41' (RSA) to the list of known hosts.
root@172.16.2.41's password:
sending incremental file list
./
README.txt
format
conf/
conf/authz
conf/passwd
conf/svnserve.conf
db/
db/current
db/format
db/fs-type
db/fsfs.conf
db/min-unpacked-rev
db/rep-cache.db
db/txn-current
db/txn-current-lock
db/uuid
db/write-lock
db/revprops/
db/revprops/0/
db/revprops/0/0
db/revprops/0/1
db/revprops/0/2
db/revprops/0/3
db/revs/
db/revs/0/
db/revs/0/0
db/revs/0/1
db/revs/0/2
db/revs/0/3
db/transactions/
db/txn-protorevs/
hooks/
hooks/post-commit.tmpl
hooks/post-lock.tmpl
hooks/post-revprop-change.tmpl
hooks/post-unlock.tmpl
hooks/pre-commit.tmpl
hooks/pre-lock.tmpl
hooks/pre-revprop-change.tmpl
hooks/pre-unlock.tmpl
hooks/start-commit.tmpl
locks/
locks/db-logs.lock
locks/db.lock

sent 1589074 bytes  received 701 bytes  167344.74 bytes/sec
total size is 1585973  speedup is 1.00
[root@localhost repos]#

Once the repository is updated you should check that the fixed repository now matches the version on your helper node.

From a terminal window, delete the repository's cache file.
```
<repository_name>/db/rep-cache.db
```
This step is not essential and could result in the repository becoming slightly larger, however it removes the risk that the repaired repository will not match with the cache file.

Restart Apache. This will free up file handlers that are holding the rep-cache.db file open as well as clearing any in-memory cache data that could point to refernces that don't exist in the repaired repository.

At this point, complete the repair process. Go back and click the "Complete Repair Process" button.

Complete!

Looking back at the REPOSITORIES tab you'll now see that the problem repository is once again replicating.

Back in sync

30. Synchronizing repositories using rysnc

If for any reason repositories are corrupted or unable to automatically catch up it's usually possible to use rsync to get them back into sync.

svnadmin verify <Repository-path>

From the node with the up-to-date repository, type the following commands:

rsync -rvlHtogpc /opt/SVN/repo/ remoteHost:/opt/SVN/

For example:
rsync -rvlHtogpc /SVN/Repo root@172.7.2.33:/SVN/

Then follow up with an additional rsync that will ensure that contents of the locks directory are identical (by deleting locks that are not present on the originating server)

rsync -rvlHtogpc --delete /path/to/repo/db/locks <Repository Name> remoteHost:/path/to/repo/db

For example:
rsync -rvlHtogpc --delete /SVN/Repo/db/locks root@172.7.2.33:/SVN/Repo/db

tip" Knowledgebase
You can read a more detailed step-by-step guide to using rsync in the Knowledge Base article Reset and rsync SVN repositories.

31. Recover from the loss of a node

It's possible for SVN MultiSite Plus to recover from the brief outage of a member node, which should be able to resync once it is reconnected. The crucial requirement for MultiSite's continued operation is that agreement over transaction ordering must be able to continue. Votes must be cast and those votes must always result in an agreement - no situation must arise where the votes are evenly split between voters.

If after the loss of a node, a replication group can no longer form agreements then replication is halted. If the lost node was a voter, and there aren't enough remaining voters to form an agreement, then either the lost node must be repaired/reconnected, or the replication group must undergo emergency reconfiguration.

Emergency Reconfiguration (EMR) is a final option for recovery

The emergency reconfiguration process can't be undone, and it represents a big shakeup of your replication system. Only complete an emergency reconfiguration if the lost node can not be repaired or reconnected in an acceptable amount of time.

** Alert! ** Gone but not forgotten
After a lost node has been removed and a replication group reconfigured, the lost node should not be allowed to come back online. Whilst the DConE replication engine will be unphased by the presence of a rogue node, it could result in confusion or be mistaken for an active repository - when in fact it will receive no further updates from the other replicas. You should ensure that you perform a cleanup after completing an emergency reconfiguration.

** Alert! ** Last node standing
Any replication group which has its membership reduced to one node will continue to exist after the emergency reconfiguration as a non-replicating group. Once you have set up a replacement node you should be able to add it back to the group to restart replication.

** Alert! ** Only one at a time
The EMR procedure needs to be co-ordinated between sites/nodes. You must not start an EMR if an EMR procedure has already started from another node. Running multiple EMR procedures at the same time can lead to unpredictable results or cause the processes to get stuck.

Emergency Reconfiguration

So, having confirmed that an emergency reconfiguration is required, follow this procedure:

Verify the details of the node that is now declared 'lost'. Login to the administrator user interface of one of the remaining nodes and view the Nodes tab. The missing node will show a status of Disconnected.

Select the lost node by ticking its corresponding checkbox and then click the Emergency Reconfiguration button.

The Emergency Reconfiguration screen will appear. Check and confirm that you have selected the correct node, then click on the Start Reconfiguration button.

A warning will appear, asking you to confirm that you are ready to start the process, and that once started the process can not be cancelled. Click CONFIRM if you are ready to proceed.

The Reconfiguration process will now run, creating new, replacement replication groups, activating them, then removing the old groups. The process is finished when all items are listed as Complete. You can then navigate back to the Nodes tab.

How Reconfiguration Works
The emergency reconfiguration process seeks to recreate functional replication groups using the remaining member nofes. In siutations where a replication group only contained two nodes, including the lost node, then a reconfiguration is not possible, in this scenario a new replication group will need to be created once a replacement node has been inducted.

You'll still see the removed node by clicking on the Display Removed Nodes button.

Finally, you should check the state of the reformed replication groups to ensure that they'll still perform according to your organization's requirements.

** Alert! ** Voter-only nodes and Emergency Reconfiguration
If you run an emergency reconfiguration on a replication group that contains a surviving node that is Voter-only, this node won't be able to detect a change to the schedule brought on by the removal of the problem node.

This problem will be fixed in a future release. For now there is a simple work-around. Login to the managing node and force a change in role for one of the remaining nodes. This change will trigger an update of the Voter-only node's schedule.

** Alert! ** When an EMR creates an unrecoverable configuration
Should performing an EMR result in the loss of all learners (nodes that are maintaining repository replicas) then the replication group is said to be 'beached'. Without any remaining learner nodes it's no longer possible to add the new learners that are required to restore replication. In this unlikely scenario you should delete the replication group after redeploying its repositories to a new group.
EMR ate my replication group
A replication group left with no learner nodes after an EMR can't be reconfigured, only deleted and then recreated.

Membership rules for Replication Group reconfiguration (EMR)

In the event of an emergency reconfiguration (EMR), it is probable that some nodes will undergo a change in role in order to maintain replication. The rules concerning role changes are as follows:

Acceptor-only nodes will remain acceptor-only nodes. Their role are never changed in an EMR change.

EMR will not succeed if the process attempts to remove all the learners (active, active voter, passive, passive voter).

If EMR leaves the membership without a proposer, a learner-only node will be promoted to the role of proposer.

If EMR leaves the membership without an acceptor, a proposer-learner node will be promoted to the role of acceptor.

Restoring a lost node

At the end of an emergency reconfiguration you'll be replicating again on your remaining nodes. However, you'll want to get back to your original configuration, with the lost node restored. The following steps show you how to get this done - and why you can't readily reinstantiate a node purely from a backup image or by using a 'backup and restore settings' function.

Repair or replace your lost node server.

Ensure that the repaired or replaced server meets the prerequisites and equally important, it should be running with the OS and core software (Apache, SVN, SSL etc)

** Alert! ** Node identicality
To be clear, it is possible to run SVN MultiSite Plus on systems that have different setups, however in doing so you introduce the risk of non-deterministic behavior, where a SVN transaction is played out differently on two or more nodes which would quickly break replication, placing one or more nodes in a read-only state. We therefore make consistency between nodes a prerequisite.

Install SVN MultiSite Plus - using the users.properties file from an existing node. (See the installation guide for the different step taken for second or subsequent nodes.

** Alert! ** Do not reuse the old Node ID
The node's previous Node ID will persist in the replication system (flagged as a removed node). You can't therefore reuse it.

Once installed, run the new node through the induction process.

Add the new node to the same replication groups as its predecessor. Initially, make it a Passive type node. It's going to be catching up before you'll be able to commit directly to its replicas.

Use the repository repair process to get the node's repository replicas back to an up-to-date state. The Repair process requires that an existing node (a helper) will stop replicating and will allow you to copy it's replicas. Once those copies are in place on the restored node, both the helper and the restored node can then be allowed to catch up with any changes that have been made since the start of the repair process.

** Alert! ** Why we reinstall and restore instead of bringing up a backup
It is common for computer systems to be restored quickly by reinstanciating software from an image created as part of a periodic backup. This approach is not well suited to an environment that is both highly dynamic and distributed. When the DConE replication engine removes a problem node, it must ensure that the removal is permanent. Attempting to return a node that was previously exiled node would almost certainly cause confusion and a loss of co-ordination.

32. Restore replication on a problem node.

It's possible that a problem on a single node could result in its copy of a repository being placed in a read-only mode. This would stop the repository from accepting changes, either from local users or via replication traffic from other nodes. If this happens, you can use the following procedure to get the repository to restart replication in which case it would automatically catch up with changes that have been made on the other nodes in the replication group.

The first sign that a transaction has not been able to complete on a node is when the repository is placed in a protective read-only state. This is done to ensure that it will remain in a condition in which it can be recovered and catch up. On the Repositories tab you'll see the repository is now flagged as locally read-only.

Repository Repo01 is flagged as local read-only
Providing there are still enough nodes to reach agreement, repository changes at the other nodes can continue to be made.
At the problem site, you would now need to identify the cause of the problem. Check SVN MultiSite Plus's logs as well as the logs generated for SVN users who are trying to commit changes to the problem repository. It may be possible to quickly fix the cause of the problem, such as a permission problem that has prevented a file to be written to on the node.
When the problem has been fixed you can go to the Repositories tab and edit the read-only repository. Remove the Local RO (Read-only) tick. The node will then attempt to catch up and get the repository back into sync with its other replicas.

33. Running Talkback

Talkback is a bash script that is provided in your SVN MultiSite Plus installation for use in the event that you need to talk to the WANdisco support team.

Manually run Talkback using the following procedure. You can run Talkback without the need for user interaction if you set up the variables noted in step 3, below:

Login to the server with admin privileges. Navigate to the SVN MultiSite Plus's binary directory:
```
/opt/wandisco/svn-multisite-plus/bin/
```
Run talkback.
```
[root@localhost bin]# ./talkback
```
You'll need to provide some information during the run - also note the environmental variables noted below which can be used to further modify how the talkback script runs:
```
#######################################################################
# WANdisco talkback - Script for picking up system & replicator       #
# information for support                                             #
#######################################################################

    To run this script non-interactively please set following environment vars:

    ENV-VAR:
    MSP_REP_UN                  Set username to login to MultiSite-Plus
    MSP_REP_PS                  Set password to login to MultiSite-Plus
    MSP_SUPPORT_TICKET          Set ticket number to give to WANdisco support team
    MSP_RUN_SVNADMIN            Run svnadmin verify, lstxns and lslocks commands - turned off by default

    By default, your talkback is not uploaded. If you wish to upload it, you may also specify
    the following variables:

    MSP_FTP_UN                  Set ftp username to upload to WANdisco support FTP server. Note that
                                specifying this may cause SSH to prompt for a password, so don't set
                                this variable if you wish to run this script non-interactively.


      ===================== INFO ========================
      The talkback agent will capture relevant configuration
      and log files to help WANdisco diagnose the problem
      you may be encountering.

Please enter replicator admin username: adminUIusername  
Please enter replicator admin password: thepasswordhere

retrieving details for repository "Repo1"
retrieving details for repository "Repo3"
retrieving details for repository "Repo4"
retrieving details for repository "repo2"
retrieving details for node "NodeSanFransisco"
retrieving details for node "NodeAuckland"
retrieving details for node "NodeParis"

Please enter your WANdisco support FTP username (leave empty to skip auto-upload process):
Skipping auto-FTP upload

TALKBACK COMPLETE

---------------------------------------------------------------
 Please upload the file:

     /opt/wandisco/svn-multisite-plus/talkback-201312191119-redhat6.3-64bit.tar.gz

 to WANdisco support with a description of the issue.

 Note: do not email the talkback files, only upload them
 via ftp or attach them via the web ticket user interface.
--------------------------------------------------------------
```
Note that we have disabled the svnadmin check as in some situations it can impeded the rapid collection of system data. If you want to turn it back on set the corresponding env variable as follows.

Enter the following string to switch the SVNAdmin checks back on:
```
    export MSP_RUN_SVNADMIN=true'
    
```
and then run the talkback. You can check the status of the variable by entering:
```
        
    echo '$MSP_RUN_SVNADMIN'
    
```
Also, you'll need to talk to Support about setting up access to WANdisco's Support FTP space.

Don't send talkback files via email
If you're not using our secure FTP you can upload your talkback output files to our support website. Just attach them to your case. Read our Knowledgebase article about How to raise a support case.

Talkback Output Example
```
replicator
        config
            application
            license
            logger.properties
            ms-resource-monitoring-elements.xml
            ms-resource-monitoring-elements.xml.old
            replicator-api-authorization.properties
            svnok.catalog
            ui.properties
        nodes
            NodeAuckland
                connection-test
                location.xml
                node.xml
            NodeParis
                connection-test
                location.xml
                node.xml
            NodeSanFrancisco
                connection-test
                location.xml
                node.xml
        recent-logs
            fsfswd.0.log
            replicator.log.20130716-105414.211
            svn-multisite
            thread-dump-2013-07-16
            ui.log.20130716-105414
            
        repositories
            Repo1
                info
                membership.xml
                replicationGroup.xml
                repository.xml
                statemachine.xml
                stats.xml              
        application
        license.xml
        locations.xml
        md5s
        memberships.xml
        nodes.xml
        replicationGroups.xml
        replicator-file-list
        repositories.xml
        statemachine.xml
        tasks.xml
        VERSION
                
system
    logs
    file-max
    file-nr
    limits.conf
    netstat
    processes
    services
    sysctl.conf
    sys-status
    top
```
34. Replication over a bad WAN link

Nodes that fall behind will eventually recover

SVN MultiSite Plus runs with a smart commit strategy and ignores all read operations so activities such as checkouts never impact upon WAN traffic. This, along with network optimization can allow deployments to provide developers with LAN-speed-like performance over a WAN for write operations at every location, while keeping all of the repositories in sync. In the event the connection to particular node is temporarily lost or experiences extreme latency or low speeds, it's possible that a node could fall behind and become temporarily out of sync while transactions are queued up.

In this situation the node should eventually catch up in a self-healing manner without administrator intervention. In this situation it is worth monitoring the state of your WAN connectivity to help gain assurance that replication is going to be able to catch up. Clearly, if connectivity drops to almost zero for a prolonged period then this will inevitably result in the node becoming isolated and increasingly out-of-sync. If this happens you should monitor traffic for a period of time, contact WANdisco's support team and start considering contingencies such as making network changes or removing the isolated node from replication, potentially using the Emergency Reconfiguration procedure.

35. Logging Setting Tool

Loggers are usually attached to packages. Here, the level for each package is specified. The global level is used by default, so levels specified here simply act as an override that takes effect in memory only - unless saved to the logger properties file.

Edit Global Logger Settings
1. Login to the admin console, click on the Settings tab.
2. Scroll down the settings till you reach the Logging Settings block.
3. Click on the Configure button.
4. The Logging Settings Config page will open. Click on the drop-down menu to change the current global logger setting. This change will be applied to all loggers that have not been specified in the edited Logger settings. Loggers that you Add or Edit (specify) will always override this global setting.
  
  Add or Edit Logger Settings
  1. Login to the admin console, click on the Settings tab.
  2. Scroll down the settings till you reach the Logging Settings block.
  3. Click on the Configure button.
  4. The Logging Settings Config page will open, it has the following sections:
    Add New Logger Settings
    
    Enter the name of the logger, assign its level then click the Add button.
    
    Edit Existing Logger Settings
    
    Use the corresponding drop-down list to change the level of any of the existing loggers or click the Delete button to remove the logger.
    All changes thus far are immediate in effect and in-memory only. Changes are not persisted after replicator restart unless you use the save or reload button:
    Reload Logging Settings
    
    CLick to refresh button to ditch all changes by reloading the logger settings from the <install-dir>/replicator/properties/logger.properties. file.
    
    Save Logging Settings
    
    Click Save Logging Settings to apply your changes to the above logger.properties file.
    
    Edit Global Logging Level
    
    Allows for a change to the global logging level, although not the deletion of logger settings.
    
    36. Disable external authentication
    
    In the event that you need to disable LDAP or Kerberos authentication and return your deployment to the default internally managed users, use the following procedure.
    1. Open a terminal on your node. Navigate to the replicator directory.
      $ cd /opt/wandisco/svn-multisite-plus/replicator/
    2. Run the following command-line utility.
      $ java -jar resetSecurity.jar
    3. You'll be asked for new administrator credentials then prompted to restart the replicator in order for the change to be applied.
    4. Now login using the orginal authentication form:
    Copyright © 2010-2014 WANdisco plc.
    All Rights Reserved
    This product is protected by copyright and distributed under licenses restricting copying, distribution and decompilation.
    
    SVN MultiSite Plus
    Last doc build: 13:06 - 16th May 2014