Documentos de Académico
Documentos de Profesional
Documentos de Cultura
EMC RecoverPoint
for Virtual Machines
Release 4.2
Administrators Guide
P/N 302-001-541
REV 01
November 17, 2014
This document contains instructions for using EMC RecoverPoint for Virtual Machines
release 4.2. Topics include:
Revision history
Revision history
The following table presents the revision history of this document:
Revision
Date
Description
01
November 2014
RecoverPoint for VMs splitter: The RecoverPoint for VMs splitter is proprietary software
installed on every ESXi server in an ESXi cluster involved in RecoverPoint replication
or running virtual RPAs. The RecoverPoint for VMs splitter splits every write to the
VMDK and sends it first to the vRPA.
vRPA cluster: A vRPA cluster is a group of 2-8 vRPAs that work together to replicate and
protect data. There are one or two clusters in a RecoverPoint for VMs system; one
cluster is used for local replication and two are used for remote replication. To
scale-up and support a higher throughput rate, you can have up to eight vRPAs in your
clusters.
RecoverPoint for VMs system: A RecoverPoint for VMs system includes all
RecoverPoint for VMs components at all clusters in one installation.
vSphere Web Client Plug-in: The RecoverPoint for VMs plug-in within the vSphere Web
Client. This is the user interface for managing VM replication. Access the vSphere Web
Client at: https://<vCenter-ip-address>:<port>/vsphere-client/
Where <vCenter-IP> is the IP address of the vCenter and <port> is the port selected in
the installation of the web client plug-in. The default port is 9443.
RecoverPoint replication
The following logical entities constitute the replication environment:
Consistency groups: A consistency group is a container for virtual machines and all
their copies whose application data needs to be replicated to a consistent point in
time. For instance, if you replicate a database and its transaction log, you need both
files, and you always need both files to be at the exact same point in time. You can
achieve this by placing the virtual machine running the database and the virtual
machine running the transaction log in one consistency group or in one group set. The
consistency group comprises virtual machines, their copies, and their journals.
Copies: In RecoverPoint for VMs, a copy is an image of a virtual machine and its
application data accessible at a vRPA cluster.The application data is immediately
usable. The following types of copies exist:
Production copy: The production copy consists of all of the virtual machines with
their application data that are the source for replication (that is, protected by
RecoverPoint).
Local copy: A local copy is a copy of production virtual machines with their
application data that is accessible on the vRPA cluster running production. The
local copy is used for continuous data protection, such as recovering from logical
errors and data corruption. A virtual machine that is replicated to a local copy is
identified by the .copy extension at the end of the virtual machine name.
Remote copy: A remote copy is a copy of production virtual machines with their
application data that is accessible on a remote vRPA cluster. (Remote denotes
clusters that are connected only by WAN.) The remote copy is used for disaster
recovery. A virtual machine that is replicated to a remote copy is identified by the
.copy extension at the end of the virtual machine name.
Shadow VM: For internal RecoverPoint use only. A shadow VM is identified by the
period at the front of the virtual machine name and the .recoverpoint extension at the
end of the virtual machine name.
Journals: Each copy of a consistency group must contain a resource pool that is
dedicated to holding marking information or point-in-time history.
Production journal: Production journals store information about the replication
process that makes synchronization between the production and copies more
efficient. After failing over, the production journal becomes a copy journal.
Copy journal: The copy journal receives successive writes written to production.
Since the write-order is maintained, it is possible to apply or undo writes so that
the copy image can reflect any point in time.
Snapshot: A snapshot is the difference between one consistent image of stored data
and the next. Snapshots are stored in the copy journal.
Bookmark: A bookmark is a manual snapshot with a label you apply to it to identify it.
Parallel bookmarks are bookmarks with the same name applied at the same time to
multiple consistency groups in a group set.
Group set: A group set is a collection of consistency groups to which the system
applies parallel bookmarks at a user-defined frequency. Group sets are useful for
consistency groups that are dependent on one another or that must work together as
a single unit.
Link: A link is the communication connection between RecoverPoint copies. When the
link is open, data can be transferred between copies.
Policy templates: Policy templates are defined protection policies that are saved and
can be reused. Three default policy templates are included in RecoverPoint for VMs:
Default local link; default remote link; default copy.
Image access: Image access mode allows you read/write-access to a selected point in
time at a copy. Image access mode is used to test a copy to verify that it is a reliable
and consistent replica of production. It is also the first step in restoring production
from the copy, rolling back to a previous point in time, recovering data, and failing
over to the copy.
Failover and disaster recovery: Failing over a consistency group to a local or remote
copy sets that copy to be the production and the original production becomes the
copy. Failing over to a copy can be used to recover from a disaster at the production
site and allows system operations to continue from the copy. Failing over erases the
copy journal. During failover, transfer is paused and access to the original production
is blocked. System operations can be resumed at the original production source by
failing back.
Recovering production: Recovering production restores the production from the copy
at the selected point in time.
vMotion:
Host and Storage vMotion are supported.
Applies to both vRPAs and protected VMs.
Source VM deletion:
When the RecoverPoint for VMs detects a deletion of a VM, it recognizes it as a
disaster for all relevant VMDKs which were lost.
The copy VM remains intact.
EMC RecoverPoint for Virtual Machines Administrators Guide
Taking VMware snapshots is supported only on production virtual machines and not
on copy virtual machines. Using the VM Restore operation to restore a production
virtual machine from a VMware snapshot or clone will cause a full sweep.
Changing the configuration of virtual machines that are being replicated is not
supported, except changing network connections, which is supported.
To clone a copy VM, it must be powered off. If it is not powered off, a VMware snapshot
will be created, which causes permanent data corruption.
The resources (memory and CPU) of a running vRPA can be increased but not
decreased. The changes will take effect only when the vRPA is rebooted.
Collecting vSphere logs from a running vRPA (for instance, using vSphere File > Export >
Export System Logs command) may cause inconsistencies in the datastore or other
unexpected behavior.
Before creating a VMware snapshot of a vRPA, it must be detached from the vRPA
cluster.
Upgrading VMware tools on the vRPAs is strictly unsupported and may lead to
undesired results.
Any online or offline porting of vRPA is supported without affecting high availability,
except for suspend or resume operations, which can lead to corruption (especially if
the vRPA is the cluster leader for the consistency groups).
Licensing is enforced per vRPA cluster, using the RPA cluster ID.
When reaching license limit, users will not be able to protect new VMs. but replication
of existing VMs will continue.
d. In the Add Machine dialog box, enter a new machine name, and click Save.
Note: A machine name is like a folder, it is used to group items together logically. A
unique machine name must be specified for each entitlement.
e. In the Register screen, verify the machine name, and click Next.
f. In the Activate screen, enter the Locking ID, and click Next.
Note: The Locking ID is the field that is displayed under the Machine Information
column; its value is the entity the license is enforced to, namely, the RPA Cluster ID.
To find the RPA Cluster ID value, create an SSH connection to the cluster
management IP, log in as an admin user (user: admin password: admin), and run
the get_internal_cluster_uids command or call EMC Licensing support.
g. In the Confirm screen, enter the email addresse(s) of the recipients of the license
file in the Email to field of the Additional Email Options section, and click Finish.
Separate multiple email addresses with commas.
h. In the Complete screen, click Save to File to download the license file and save the
file locally. The resulting license file has a *.lic extension and is in plain text format
(can be opened in any text editor).
i. Redo this procedure for all inactive entitlement in each LAC email.
3. After you have turned all of your entitlements into license files, physically transfer the
license file(s) to the computer from which you will be running RecoverPoint for VMs.
4. Install your license file(s) in RecoverPoint for VMs:
a. In the vSphere Web Client home page, click the RecoverPoint for VMs Management
icon > Administration tab > Licenses tab.
b. Select the vRPA cluster to which you wish to add a license. Click Add.
c. In the Add license dialog box, enter the location of the license file or click Browse
to locate the file. Click OK.
Enabling support
To enable support, configure the system reports and alerts mechanism (SyR) to provide
communication between your RecoverPoint for VMs system and the EMC System Reports
database (ESRS).
Before you configure system reports or alerts, note the following:
To send system reports and alerts, a permanent RecoverPoint license must exist in the
system; system reports and alerts will not work with a temporary license.
Best practice is to keep both system reports and alerts, and compression and
encryption enabled.
System reports and alerts will only be sent provided a valid method of transfer (SMTP,
ESRS, or FTPS) is configured.
To transfer system reports and alerts using SMTP or ESRS, ensure that port 25 is open
and available for SMTP traffic.
To transfer system reports and alerts using FTPS, ensure that ports 990 and 989 are
open and available for FTPS traffic.
Protecting VMs
Before protecting VMs:
Register at least one datastore for the copy and production journals.
Limitations
Two VMs in one consistency group cannot have the same name, even if one is in the
production cluster and the other is a replica.
When RecoverPoint for VMs automatically creates a VM at the target, the following
limitations apply:
All VMDKs will be mapped to a single datastore. You can select the datastore, but
you cannot select different datastores for different VMDKs.
You cannot use the Protection wizard to select different networks for each NIC in
the replica VM. VM network connections can be changed afterwards in the vSphere
GUI.
If changes are made to the memory or CPU resources of a protected (production)
VM, those changes will not be replicated to the copy VM. The same changes can be
made manually at the copy VM.
9. In the Configure journal provisioning screen, define the minimum journal size for the
consistency group and select the datastore to use as the journal:
a. Select a copy from the table of copies.
b. Define the minimum journal size for that copy.
All datastores at that copy with at least as much available space as you defined is
listed in the list of datastores.
c. Choose to automatically select the optimal datastore for the journal from the list or
to manually select a datastore from the list.
d. If you want to register additional datastores, click the Add button and select the
datastores you wish you register from the candidate datastores table.
You cannot proceed to the next step until all copy icons in the copy list show a green
check mark to indicate that a journal has been provisioned for each copy.
10. In the Summary screen, review previously selected values. To complete the wizard and
start replication from the production to the copy, click Finish.
The Manage > RecoverPoint for VM screen appears and displays a graphical
representation of the consistency group. The consistency group begins an
initialization process and its progress is displayed. After initialization, the consistency
group becomes active.
Stopping protection
To stop replicating the production VM:
1. In the vSphere Web Client home page, click the RecoverPoint for VMs Management
icon > Protection tab. Click Virtual Machines.
2. Select the VM you wish to stop replicating. Click the Unprotect icon:
Replication stops and the VM is removed from its consistency group. The copy VM is
not automatically deleted. If there are no other VMs in the consistency group, the
consistency group is removed.If other VMs remain in the consistency group, the
journal will be lost.
11
3. In the Add Group Set dialog box, enter a name for the group set.
4. Choose the vRPA cluster from which to select consistency groups.
5. Select one or more consistency groups to add to the group set.
6. To enable parallel bookmarking, select Enable Parallel Bookmarking and set the
bookmarking frequency value. Click OK.
You cannot enable parallel bookmarking for a group set if any of the groups in the
group set are part of another group set that has parallel bookmarking enabled.
12
Applying a bookmark
A bookmark is a manual snapshot with a label you apply to it to identify it for future use.
Bookmarks can be applied to consistency groups or group sets.
To apply a bookmark:
1. In the vSphere Web Client home page, click the RecoverPoint for VMs Management
icon > Protection tab. Click Consistency Groups to apply a bookmark to a consistency
group or Group Sets to apply a bookmark to a group set.
2. Select the consistency group or group set to which you wish to apply a bookmark.
Click the Add bookmark icon:
Description
Bookmark name
Mark as
Crash-Consistent:
Consolidation policy
Daily:
Weekly:
Monthly:
13
2. Select the group set you wish to edit. Click the Edit Group Set icon:
3. In the Edit Group Set dialog box, if desired, modify the group set name.
4. Select or clear the checkboxes of consistency groups to include or exclude them from
the group set.
5. Enable or disable parallel bookmarking by selecting or clearing the Enable Parallel
Bookmark checkbox. Click OK.
You cannot enable parallel bookmarking for a group set if any of the groups in the
group set are part of another group set that has parallel bookmarking enabled.
14
Description
Name
Primary RPA
The vRPA that you prefer to replicate the consistency group. When
the primary vRPA is not available, the consistency group will switch
to another vRPA in the vRPA cluster. When the preferred vRPA
becomes available, the consistency group will switch back to it.
Group priority
4. Click on the Edit link policy link to edit the link policy protection settings:
Table 3 Link policy options
Options
Description
Replication Mode
Dynamic by Latency
15
Description
Dynamic by
Throughput
RPO
Compression
To load an existing link policy template, click on the Load link policy from template
link and select the template.
16
4. Click on the Edit copy policy link to edit the copy policy protection settings:
Table 4 Copy policy options
Option
Description
Journal Compression
Default = none
Compresses snapshots in the journal so that more images can be
saved in the same journal capacity. Best practice is to compress
the journal when forcing synchronous replication.
Compression impacts the CPU resources of the target vRPA of the
consistency group.
Enabling journal compression while a consistency group is enabled
will result in the loss of all snapshots in the journal.
The protection window indicates how far in time the copy image
can be rolled back.
Select this to define a required protection window and specify the
length of the required window. You will be notified if the current
window is less than the required window.
Enable RecoverPoint
Snapshot
Consolidation
Do not consolidate
any snapshots for at
least
Default = 2 days
Define the period during which snapshot data is not to be
consolidated. If no daily or weekly consolidations are specified,
the remaining snapshots are consolidated monthly.
Consolidate
snapshots that are
older than x to one
snapshot per day for
y days
Default = 5 days
Snapshots are consolidated every 24 hours.
Select Indefinitely to consolidate all subsequent snapshots in
24-hour intervals. If Indefinitely is not selected, and no weekly
consolidations are specified, the remaining snapshots are
consolidated monthly. If Indefinitely is selected, weekly and
monthly consolidations are disabled, and the remaining snapshots
are consolidated daily.
Consolidate
snapshots that are
older than x to one
snapshot per week
for y weeks
Default = 4 weeks
Snapshots are consolidated every seven days.
Select Indefinitely to consolidate all subsequent snapshots in
seven-day intervals. If Indefinitely is not selected, the remaining
snapshots are consolidated monthly. If Indefinitely is selected,
monthly consolidations are disabled, and the remaining snapshots
are consolidated weekly.
To load an existing copy policy template, click on the Load copy policy from template
link and select the template.
17
18
Transfer State
Description
Active
Active (Synchronized)
Paused by system
Init (n%)
Description
High-load (n%)
High-load
N/A
Data is not being transferred to a copy, because the copy has been
disabled by the user
Hover your mouse over a transfer state in the chart to see the names of the consistency
groups in that state.
19
These recovery actions are directed by the Recovery Wizard and can be performed on
consistency groups or group sets. The initial steps of the wizard, including selecting the
image to access and testing the network, are the same for all recovery actions. The actual
recovery action is performed at the end of the wizard, after testing the copy.
When you select the latest image, the latest available image of each relevant group
copy will be accessed.
When you select an image from the image list, the images of the first group in the
group set are displayed. When you select an image from the list, RecoverPoint for VMs
constructs a search query containing the parameters of the first groups image and
queries the rest of the groups in the group set according to those parameters.
When you select a specific point in time by entering advanced search options, the
same image search query is sent for all groups in the group set.
Testing a copy
To enter image access mode and test a copy:
1. In the vSphere Web Client home page, click the RecoverPoint for VMs Management
icon > Protection tab. Click the Test Copy icon:
20
Setting
Description
Description
Description
Isolated network
Shared network
5. In the Enable Access screen, detailed information about the selected copy is
displayed. Activate image access mode by clicking Enable Access.
The Enabling progress bar will indicate the progress of image access. You can close
the wizard without interfering with the process. You can reopen the wizard from the
Recovery Activities widget on the Dashboard.
After image access is enabled, the buffer progress bar indicates how long you can
access the copy image before the image access log is full and all writes to the copy fail.
When in image access mode, data is not distributed from the journal to the copy. When
image access is enabled, there is a pause in distribution while the system rolls to the
specified image. The length of the delay depends on your storage capabilities and how far
the selected image is from the current image.
21
Failing over
The Recovery wizard guides you through the process of selecting a copy image, testing it,
and failing over to the copy or failing back to the production from the selected image.
To enter image access mode and then fail over to the copy:
1. In the vSphere Web Client home page, click the RecoverPoint for VMs Management
icon > Protection tab. Click the Fail Over icon:
Description
22
Description
Isolated network
Shared network
5. In the Enable Access screen, detailed information about the selected copy is
displayed. Activate image access mode by clicking on the Enable Access button.
The Enabling progress bar will indicate the progress of image access. You can close
the wizard without interfering with the process. You can reopen the wizard from the
Recovery Activities widget on the Dashboard.
Once image access is complete, click Finish to activate failover.
After failover:
The production and copy VM names do not change, but the VM roles do. Therefore,
after failover, the new production VM will still be <YourVMName>.copy and the new
copy VM name will still be <YourVMName>.
The marking information in the production journal is deleted, the copy journal is
deleted, and the consistency group undergoes a full sweep synchronization.
Before you fail back to the production, the Recovery wizard is displayed to enable you
to select an image at the production that predates your failover, and to verify the
image before permanently selecting it as the image you want to fail back to.
Recovering production
Use Recover Production to correct file or logical corruption by rolling the production back
to a previous point-in-time. The Recovery wizard guides you through the process of
selecting a copy image, testing it, and recovering the production from the selected image.
To recover production:
1. In the vSphere Web Client home page, click the RecoverPoint for VMs Management
icon > Protection tab. Click the Recover Production icon:
23
3. In the Select Image screen, select the image to access. You may want to start with the
last image known to be valid.
Table 10 Select Image options
Setting
Description
Description
Isolated network
Shared network
5. In the Enable Access screen, detailed information about the selected copy is
displayed. Activate image access mode by clicking on the Enable Access button.
The Enabling progress bar will indicate the progress of image access. You can close
the wizard without interfering with the process. You can reopen the wizard from the
Recovery Activities widget on the Dashboard.
Once image access is complete, click Finish to activate production recovery. During
production recovery, host access to storage is blocked.
24
The group undergoes a short initialization process to synchronize the new production
data at the copy.
25
Troubleshooting
Troubleshooting
This section describes troubleshooting issues and procedures for solving them in
RecoverPoint for VMs.
Detecting bottlenecks
Bottleneck detection returns statistics about RecoverPoint for VMs system performance,
by consistency group, vRPA, and vRPA cluster. Bottleneck detection analyzes the system
data in an effort to detect the existence of any of the predefined problem types, called
bottlenecks. The types of bottlenecks are presented in Table 12.
Table 12 Bottleneck detection
Type
Output/Notes
RPAs are not balanced., with data on the load handled by each vRPA at the
cluster.
Note: vRPA balance is checked only if the time period defined is greater than
30 minutes.
Compression
Compression level is too high. The RPA resources cannot handle the current
level.
SAN target
RPA may be regulating the application. Consider reducing RPA load., with
data on the total amount of incoming data, the number of writes, and the
amount of incoming data per write.
RPA utilization
Writing to the local journal volume was slow during this period., with data
on the delay factor.
Journal phase 1
Journal is unable to handle the incoming data rate., with the required I/O
rates for the journal and the replication volumes at local or remote copies, for
both normal and fast-forward distribution modes.
Journal phase 2
Journal and replication volumes are unable to handle the incoming data
rate., with data on the required I/O rates for the journal and the replication
volumes at local or remote copies, for both normal and fast-forward
distribution modes.
Journal regulation
Remote storage is too slow to handle incoming data rate and regulate the
distribution process. with data on the required I/O rates for the journal and
the replication volumes at local or remote copies, for both normal and
fast-forward distribution modes.
Unknown
distribution
problem
26
Target cluster cannot handle the incoming data rate., with the required I/O
rates for the journal and the replication volumes at the remote cluster, for
both normal and fast-forward distribution modes.
Troubleshooting
Output/Notes
Slow WAN
WAN is too slow., with data on total throughput for the cluster, the identity
of the RPAs at which the problem appeared, and the throughput of that RPA
(or RPAs).
Note: A slow WAN bottleneck is detected by group, but generates data by
cluster and vRPA.
Link utilization
To detect bottlenecks:
1. Use an SSH client to connect to the vRPA management IP address, and enter your
RecoverPoint username and password to log into the CLI.
2. Run the detect_bottlenecks command to check for any bottlenecks. To view command
parameters that can refine your search, run:
detect_bottlenecks ?
Load balancing
Load balancing is the process of assigning preferred vRPAs to consistency groups so that
the preferred vRPA will perform data transfer for that group. This is done to balance load
across the system and to prevent the system from entering a high-load state.
You should perform load balancing:
When a new consistency group is added to the system. Wait one week after the new
group is added to the system for a long enough traffic history to accumulate before
you perform load balancing.
When a new vRPA is added to a cluster. Perform load balancing immediately after the
vRPA is added.
If the system enters high load frequently. When load balancing is required, the event
logs will display a message indicating so. When you see this message, perform load
balancing.
If the bottleneck detection tool recommends it. When load balancing is required, the
detect_bottlenecks CLI command returns RPAs are not balanced. When you see this
message, perform load balancing.
Periodically. To ensure that your system is always handling distributing loads evenly, a
script can be created to periodically perform load balancing.
27
Troubleshooting
1. Use an SSH client to connect to the vRPA management IP address, and enter your
RecoverPoint username and password to log into the CLI.
2. Run the balance_load command to balance the load. To view command parameters
that can refine your search, run:
balance_load ?
System alerts
To view system errors and warnings, in the vSphere Web Client home page, click the
RecoverPoint for VMs Management icon > Dashboard tab > Alerts widget.
28
Getting help
Getting help
EMC support, product, and licensing information can be obtained as follows:
Product information For documentation, release notes, software updates, or
information about EMC products, go to EMC Online Support at:
https://support.emc.com
Technical support Go to EMC Online Support and click Service Center. You will see
several options for contacting EMC Technical Support. Note that to open a service request,
you must have a valid support agreement. Contact your EMC sales representative for
details about obtaining a valid support agreement or with questions about your account.
Copyright 2014 EMC Corporation. All rights reserved. Published in the USA.
Published November 2014
EMC believes the information in this publication is accurate as of its publication date. The
information is subject to change without notice.
The information in this publication is provided as is. EMC Corporation makes no representations or
warranties of any kind with respect to the information in this publication, and specifically disclaims
implied warranties of merchantability or fitness for a particular purpose. Use, copying, and
distribution of any EMC software described in this publication requires an applicable software
license.
EMC2, EMC, the EMC logo, and EMC RecoverPoint are registered trademarks or trademarks of EMC
Corporation in the United States and other countries. All other trademarks used herein are the
property of their respective owners.
For the most up-to-date regulatory document for your product line, go to EMC Online Support
(https://support.emc.com).
29