3. Administration Guide

This Admin Guide describes how to set up and use WANdisco's WD Fusion.

3.1 Housekeeping

This section covers basic operations for running a WD Fusion deployment, including commands and tooks that help you set up and maintain replicated directories.

Starting up

To start WD Fusion UI:

Open a terminal window on the server and log in with suitable file permissions.

Run the fusion-ui-server service from the /etc/init.d folder:

rwxrwxrwx  1 root root    47 Apr 10 16:05 fusion-ui-server -> /opt/wandisco/fusion-ui-server/bin/fusion-ui-server

Run the script with the start command:

[root@localhost init.d]#  ./fusion-ui-server start

Starting fusion-ui-server:.                         [  OK  ]

WD Fusion starts. Read more about the fusion-ui-server init.d script.

Also you can invoke the service directly. e.g.
```
service fusion-ui-server stop/start
```

Shutting down

To shut down:

Open a terminal window on the server and log in with suitable file permissions.

Run the WD Fusion UI service, located in the init.d folder:

rwxrwxrwx  1 root root    47 Dec 10 16:05 fusion-ui-server -> /opt/wandisco/fusion-ui-server/bin/fusion-ui-server

Run the stop script:

[root@redhat6 init.d]#  ./fusion-ui-server stop
stopping fusion-ui-server:                                   [  OK  ]
[root@redhat6 init.d]#

The process shuts down.

Shutdowns take some time

The shutdown script attempts to stop proceses in order before completing, as a result you may find that (from WD Fusion 2.1.3) shutdowns may take up to a minute to complete.

init.d management script

The start-up script for persistent running of WD Fusion is in the /etc/init.d folder. Run the script with the help command to list the available commands:

[root@redhat6 init.d]# service fusion-ui-server help
  usage: ./fusion-ui-server (start|stop|restart|force-reload|status|version)

start Start Fusion services
stop Stop Fusion services
restart Restart Fusion services
force-reload Restart Fusion services
status Show the status of Fusion services
version Show the version of Fusion

Check the running status (with current process ID):

[root@redhat6 init.d]# service fusion-ui-server status
Checking delegate:not running                              [  OK  ]
Checking ui:running with PID 17579                         [  OK  ]

Check the version:

[root@redhat6 init.d]# service fusion-ui-server  version
1.0.0-83

Managing Services through the WD Fusion UI

Providing that the UI service is running, you can stop and start WD Fusion through the Fusion Nodes tab.

The UI for managing WD Fusion can be accessed through a browser, providing you have network access and the port that the UI is listening on is not blocked.

http://<url-for-the-server>:<UI port>

e.g.

http://wdfusion-static-0.dev.organisation.com:8083/ui/

You should not need to add the /ui/ at the end, you should be redirected there automatically.

Currently you need to use the same username and password that are required for your platform manager, e.g. Cloudera Manager or Ambari. In a future release we will separate WD Fusion UI from the manager and use a new set of credentials.

LDAP/Active Directory and WD Fusion login

If your Cloudera-based cluster uses LDAP/Active Directory to handle authentication then please note that a user that is added to an LDAP group will not automatically be assigned the corresponding Administrator role in the internal Cloudera Manager database. A new user is LDAP that is assigned an Admin role will, by default, not be able to login to WD Fusion. To be allowed to login, they must first be changed to an administrator role type from within Cloudera Manager.

No sync between CM and LDAP
There is no sync between Cloudera Manager and LDAP in either direction, so a user who loses their Admin privileges in LDAP will still be able to login to WD Fusion until their role is updated in Cloudera Manager. You must audit WD Fusion users in Cloudera Manager.

Administrators will need to change any user in the Cloudera Manager internal database (from the Cloudera Manager UI) to the required access level for WD Fusion. Please note the warning given above, that changing access levels in LDAP will not be enough to change the admin level in WD Fusion.

Authentication misalignment

There are four possible scenarios concerning how LDAP authentication can align and potentially misalign with the internal CM database:

User has full access in CM, denied access in WD Fusion UI

User is in the Full Administrator group in LDAP
User is left as the default read-only in the internal Cloudera Manager database

User has full access in CM, full access in WD fusion UI

User is in the Full Administrator group in LDAP
User is changed Full Administrator in the internal Cloudera Manager database

User has read-only access in CM, denied access to WD Fusion UI

User is removed from the Full Administrator group in LDAP and added to the read-only group
User is left as the default read-only in the internal Cloudera Manager database

User has read-only access to CM, Full access to WD Fusion UI

User is removed from the Full Administrator group in LDAP and added to the read-only group
User is set as Full Administrator in the internal Cloudera Manager database

Clearly this scenario represents a serious access control violation, administrators must audit WD Fusion users in Cloudera Manager.

Checking cluster status on the dashboard

The WD Fusion UI dashboard provides a view of WD Fusion's status. From the world map you can identify which data centers are experiencing problems, track replication between data centers or monitor the usage of system resources.

For more details on what each section of the Dashboard, see the Reference section for the Dashboard.

UI Dashboard will indicate if there are problems with WD Fusion on your cluster.

Server Logs Settings

The WD Fusion logs that we display in the WD Fusion UI are configured by properties in the ui.properties file.

Logging

Default paths:

logs.directory.fusion /var/log/fusion/server/
Logs.directory.ihc /var/logs/fusion/ihc
logs.directory.uiserver /var/log/fusion/ui

Configure log directory

By default the log location properties are not exposed in the ui.properties file. If you need to update the UI server to look in different locations for the log files then you can add the following properties (in ui.properties). To be clear these entries do not set alternate locations for WD Fusion to write its logs, it only ensures that the UI server can still read the logs in the event that they are moved.:

logs.directory.fusion: sets the path to the WD Fusion server logs.
logs.directory.uiserver: sets the path to the UI server logs.
logs.directory.ihc: sets the path to the ihc server logs.

The file is read by the UI server on start up so you will need to restart the server for changes to take affect. The ui.properties file is not replicated between nodes so you must currently set it manually on each node.

Logging at startup

At startup the default log location is /dev/null. If there's a problem before log4j has initialised this will result in important logs getting lost. You can set the log location to a filespace that preserve early logging.

Edit fusion_env.sh adding paths to the following properties:

SERVER_LOG_OUT_FILE: Path for WD Fusion server log output
IHC_LOG_OUT_FILE: Path for IHC server log output

Induction

Induction is the process used to incorporate new nodes into WANdisco's replication system. The process is run at the end of a node installation, although it is also possible to delay the process, then use the + Induct link on the Fusion Nodes tab.

Use this procedure if you have installed a new node but did not complete its induction into your replication system at the end of the installation process.

Login to one of the active nodes, clicking on the Fusion Nodes tab. Click the + Induction button.
Enter the fully qualified domain name of the new node that you wish to induct into your replication system.

Fully Qualified Domain Name
The full domain name for the new node that you will induct into your replication system.

Fusion Server Port
The TCP port is used by the WD Fusion application for configuration and reporting, both internally and via REST API. The port needs to be open between all WD Fusion nodes and any systems or scripts that interface with WD Fusion through the REST API.

Click Start Induction.
When the induction process completes, the Fusion Node tab will refresh with the new node added to the list.

Induction Failure

The induction process performs some validation before running. If this validation failures you will quickly see a warning messages appear.
WD Fusion Deployment

Automatic Induction Failure: If the induction process can't connect to the new new using the details provided, a failure will happen instantly. This could happen because of an error in the new node's installation, however it could also be caused by the node being kerberized.
We also could not reach any of our standard ports: If connections can't be made on specific Fusion ports, they will be listed here. If none of the standard ports are reachable then you will be warned that this is the case.

Property	Description	Permitted Values	Default	Checked at...
executor.threads	The number of threads executing agreements in parallel.	1-Integer.MAX_VALUE	20	Startup

Key	Value	Default	File
ssl.key.alias	alias of private key/certificate chain in key store	NA	application.properties
ssl.key.password	encrypted password to key	NA	application.properties
ssl.keystore	path to Keystore	NA	application.properties
ssl.keystore.password	encrypted password to key store	NA	application.properties

Key	Value	Default	File
ssl.truststore	path to truststore	Default	application.properties
ssl.truststore.password	encrypted password to trust store	Default	application.properties

Key	Value	Default	File
fusion.ssl.truststore	path to truststore	NA	core-site.xml
fusion.ssl.truststore.password	encrypted password for truststore	NA	core-site.xml
fusion.ssl.truststore.type	JKS, PCKS12	JKS	core-site.xml

Key	Value	Default	File
ihc.ssl.key.alias	alias of private key/certificate chain in keystore	NA	.ihc
ihc.ssl.key.password	encrypted password to key	NA	.ihc
ihc.ssl.keystore	path to keystore	NA	.ihc
ihc.ssl.keystore.password	encrypted password to keystore	NA	.ihc
ihc.ssl.keystore.type	JKS, PCKS12	JKS	.ihc

Type	Key	Value	Default	File
Fusion Server - Fusion Server	ssl.enabled	true	false	application.properties
Fusion Server - Fusion Client	fusion.ssl.enabled	true	false	core-site.xml
Fusion Server - Fusion IHC Server	fusion.ihc.ssl.enabled	true	false	.ihc

Variable Name	Description
-genkey	Switch for generating a key pair (a public key and associated private key). Wraps the public key into an X.509 v1 self-signed certificate, which is stored as a single-element certificate chain. This certificate chain and the private key are stored in a new keystore entry identified by alias.
-keyalg RSA	The key algorithm, in this case RSA is specified.
wandisco.ks	This is file name for your private key file that will be stored in the current directory.
- alias server	Assigns an alias "server" to the key pair. Aliases are case-insensitive.
-validity 3650	Validates the keypair for 3650 days (10 years). The default would be 3 months.
- storepass <YOUR PASSWORD>	This provides the keystore with a password.

Variable Name	Example	Description
ssl.enabled	true	Requires a "true" or "false" value. Clearly when the value is set to false, none of the other variables will be used.
ssl.debug	true	Requires a "true" or "false" value. When set to true debugging mode is enabled.
ssl.keystore	./properties/wandisco.ks	The path to the SSL private Keystore file that is stored in the node. By default this is called "wandisco.ks".
ssl.key.alias	wandisco	The assigned alias for the key pair. Aliases are case-insensitive.
ssl.keystore.password	<a password>	The SSL Key password. This is described in more detail in Setting a password for SSL encryption.
ssl.truststore	./properties/wandisco.ks	The path to the SSL private truststore file that is stored in the node. By default this is called "wandisco.ks" because, by default the keystore and truststore are one and the same file, although it doesn't have to be.
ssl.truststore.password	"bP0L7SY7f/4GWSdLLZ3e+	The truststore password. The password should be encrypted.

Service	SSL Role
HDFS	server and client
MapReduce	server and client
YARN	server and client
HBase	server
Oozie	server
Hue	client

Property	Description
SSL Server Keystore File Location	Path to the keystore file containing the server certificate and private key.
SSL Server Keystore File Password	Password for the server keystore file.
SSL Server Keystore Key Password	Password that protects the private key contained in the server keystore.

Property	Description
Cluster-Wide Default SSL Client Truststore Location	Path to the client truststore file. This truststore contains certificates of trusted servers, or of Certificate Authorities trusted to identify servers.
Cluster-Wide Default SSL Client Truststore Password	Password for the client truststore file.

Property	Description
SSL Server Keystore File Location	Path to the keystore file containing the server certificate and private key.
Enable Authentication for HTTP Web-Consoles	Password for the server keystore file.
SSL Server Keystore Key Password	Password for the client truststore file.

3. Administration Guide

3.1 Housekeeping

Starting up

Shutting down

init.d management script

Managing Services through the WD Fusion UI

WD Fusion UI Login

Login credentials

LDAP/Active Directory and WD Fusion login

Authentication misalignment

Checking cluster status on the dashboard

Server Logs Settings

Default paths:

Configure log directory

Logging at startup

Induction

Induction Failure

Node properties

3.2 Troubleshooting

Troubleshooting Overview

Read the logs

WD Fusion Server Logs

WD Fusion UI Server Logs

Inter-Hadoop Connect (IHC) Server Logs

Log analysis

Quickly picking out problems

Talkback

Talkback location

Running talkback

Moving objects between mismatched filesystems

Handling file inconsistencies

Transfer reporting

Fine-tuning Replication

Increasing thread limit

Increase executor.threads property

Tuning Writer Re-election

Tunable properties

Setting the writer re-election period

Handling Induction Failure

Steps:

MEMBERSHIP

STATEMACHINE

Emergency bypass to allow writes to proceed

Enable/disable emergency bypass via the UI

Enable/disable emergency bypass via manual configuration change

Kerberos Troubleshooting

Kerberos Error with MIT Kerberos 1.8.1 and JDK6 prior to update 27

Uninstall WD Fusion

Default uninstall

Usage

Example

Uninstall with config purge

Backup config/log files

Dry run

Help

4. Managing Replication

4.1 Create a membership

Guide to node types

4.2 Replicated Folders

Create a replicated folder

Create Rule

Path interpretation

Edit/ View Replicated Folder

View/Edit

Consistency Check

Custom Consistency Check

File Transfers

Repair

Checking repair status

Parameters

Configure Hadoop

Configure for High Availability Hadoop

Known issue on failover

4.3 Reporting

4.3.1 Consistency Check

Consistency Checks through WD Fusion UI

Consistency

Fix inconsistencies with the Consistency Check tool

4.3.2 File Transfer Report

File transfer Report Output