I have 2 Site DAG. In normal mode quorum is in the Main Site. PAM is also in the Main Site. Datacenter Activation mode – DAG Only.
![]()
After maintenance and reboot of server in DR Site, servers can’t determinate database status of each other. But replication and replaying logs works fine. Server in DR site can’t connect to PAM.
Get-DatabaseAvailabilityGroup -Status -Identity MYDAG | fl name,primaryActiveManager in DR Site takes very long time and can’t determinate PAM in the end. After a long time output is:
WARNING: Unable to get Primary Active Manager information due to an Active
Manager call failure. Error: An Active Manager operation failed. Error: An
Active Manager operation encountered an error. To perform this operation, the
server must be a member of a database availability group, and the database
availability group must have quorum. Error: Automount consensus not reached.
[Server: DRExch.mydomain.local]
Test-replicationhealth in DR Site
Server Check Result
Error
------ -----
------ -----
DRExch ClusterService Passed
DRExch ReplayService Passed
DRExch ActiveManager *FAILED* Active Manager is in an unknown state on server 'DRExch'. Basic database
administrative operations such as mounting or dismounting an…
DRExch TasksRpcListener Passed
DRExch TcpListener Passed
DRExch ServerLocatorService Passed
DRExch DagMembersUp Passed
DRExch DatabaseRedundancy *FAILED* Failures:...
DRExch DatabaseAvailability *FAILED* Failures:...
Get-MailboxDatabaseCopyStatus in DR Site.
Get-MailboxDatabaseCopyStatus * | where {$_.name -like "*DRExch" -or $_.name -like "*MAINExch"} | ft
Name
Status CopyQueue ReplayQueue LastInspectedLogTime
Length Length
----
------ --------- ----------- --------------------
DB1\MAINExch ServiceDown 0
0
DB2\MAINExch ServiceDown 0
0
DB3\MAINExch ServiceDown 0 0
DB4\MAINExch ServiceDown 0 0
DB5\MAINExch ServiceDown 0
0
DB6\MAINExch ServiceDown 0 0
DB7\MAINExch ServiceDown 0
0
DB5\DRExch Healthy 0
0 5/22/2016 11:57:37 AM
DB2\DRExch Healthy 0
0 5/22/2016 11:59:56 AM
DB4\DRExch Healthy 0
0 5/22/2016 11:49:19 AM
DB1\DRExch Healthy 0
0 5/22/2016 11:54:42 AM
DB3\DRExch Healthy 0
0 5/22/2016 11:55:45 AM
DB6\DRExch Healthy 0
0 5/22/2016 11:54:11 AM
DB7\DRExch Healthy 0
0 5/22/2016 11:50:58 AM
In main Site this command works well and says that the server in main site is PAM.
Get-DatabaseAvailabilityGroup -Status -Identity MYDAG | fl name,primaryActiveManager in MAIN Site
Get-DatabaseAvailabilityGroup -Status -Identity MYDAG | fl name,primaryactivemanager
Name : MYDAG
PrimaryActiveManager : MAINExch
Test-replicationhealth in MAIN Site
Server Check
Result Error
------ -----
------ -----
MAINExch ClusterService Passed
MAINExch ReplayService Passed
MAINExch ActiveManager Passed
MAINExch TasksRpcListener Passed
MAINExch TcpListener Passed
MAINExch ServerLocatorService Passed
MAINExch DagMembersUp Passed
MAINExch MonitoringService Passed
MAINExch ClusterNetwork Passed
MAINExch QuorumGroup Passed
MAINExch FileShareQuorum Passed
MAINExch DatabaseRedundancy *FAILED* Failures:...
MAINExch DatabaseAvailability *FAILED* Failures:...
Get-MailboxDatabaseCopyStatus in MAIN Site.
Name
Status CopyQueue ReplayQueue LastInspectedLogTime
Length Length
----
------ --------- ----------- --------------------
DB1\MAINExch Mounted 0
0
DB2\MAINExch Mounted 0
0
DB3\MAINExch Mounted 0 0
DB4\MAINExch Mounted 0
0
DB5\MAINExch Mounted 0
0
DB6\MAINExch Mounted 0 0
DB7\MAINExch Mounted 0 0
DB5\DRExch ServiceDown 0
0
DB2\DRExch ServiceDown 0
0
DB4\DRExch ServiceDown 0
0
DB1\DRExch ServiceDown 0
0
DB3\DRExch ServiceDown 0
0
DB6\DRExch ServiceDown 0
0
DB7\DRExch ServiceDown 0
0
MYDAG configuration is below
Get-DatabaseAvailabilityGroup MYGAG | fl
RunspaceId : b4e31455-78d2-43ee-bd92-e9f570403ce9
Name
: MYGAG
Servers
: {EXCHMain, EXCHDr}
WitnessServer : srv02cas15p.domain.local
WitnessDirectory : C:\MYGAG
AlternateWitnessServer : srv01cas15p.domain.local
AlternateWitnessDirectory : C:\MYGAG
NetworkCompression : InterSubnetOnly
NetworkEncryption : InterSubnetOnly
ManualDagNetworkConfiguration : False
DatacenterActivationMode : DagOnly
StoppedMailboxServers : {}
StartedMailboxServers : {EXCHDr.domain.local, EXCHMain.domain.local}
DatabaseAvailabilityGroupIpv4Addresses : {10.110.128.126, 10.100.128.126}
DatabaseAvailabilityGroupIpAddresses : {10.110.128.126, 10.100.128.126}
AllowCrossSiteRpcClientAccess : False
OperationalServers :
PrimaryActiveManager :
ServersInMaintenance :
ServersInDeferredRecovery :
ThirdPartyReplication : Disabled
ReplicationPort : 64327
NetworkNames : {}
WitnessShareInUse :
DatabaseAvailabilityGroupConfiguration :
AutoDagSchemaVersion : 1.0
AutoDagDatabaseCopiesPerDatabase : 1
AutoDagDatabaseCopiesPerVolume : 1
AutoDagTotalNumberOfDatabases : 0
AutoDagTotalNumberOfServers : 0
AutoDagDatabasesRootFolderPath : C:\ExchangeDatabases
AutoDagVolumesRootFolderPath : C:\ExchangeVolumes
AutoDagAllServersInstalled : False
AutoDagAutoReseedEnabled : True
AutoDagDiskReclaimerEnabled : True
AutoDagBitlockerEnabled : False
AutoDagFIPSCompliant : False
ReplayLagManagerEnabled : False
MailboxLoadBalanceMaximumEdbFileSize :
MailboxLoadBalanceRelativeLoadCapacity :
MailboxLoadBalanceOverloadedThreshold :
MailboxLoadBalanceUnderloadedThreshold :
MailboxLoadBalanceEnabled : False
AdminDisplayName :
ExchangeVersion : 0.10 (14.0.100.0)
DistinguishedName : CN=MYGAG,CN=Database Availability Groups,CN=Exchange
Administrative Group
(FYDIBOHF23SPDLT),CN=Administrative Groups,CN=domain,CN=Microsoft Exchange,
CN=Services,CN=Configuration,DC=domain,DC=local
Identity
: MYGAG
Guid
: 61920fc9-6496-4bc3-ab78-341eaedb2917
ObjectCategory : domain.local/Configuration/Schema/ms-Exch-MDB-Availability-Group
ObjectClass : {top, msExchMDBAvailabilityGroup}
WhenChanged : 09.02.2016
11:06:01
WhenCreated : 09.02.2016
9:55:54
WhenChangedUTC : 09.02.2016 8:06:01
WhenCreatedUTC : 09.02.2016 6:55:54
OrganizationId :
Id
: MYGAG
OriginatingServer : dcp.domain.local
IsValid
: True
ObjectState : Unchanged
There are errors in Microsoft - Exchange - Managed availability - monitoring on both sides for all databases. Example
The RPC to retrieve the status of database copy 'DB1\DRExch' failed. Error: A server-side administrative operation has failed. The Microsoft Exchange Replication service may not be running on server DRExch.domain.local. Specific RPC error message: Error
0x71a (The remote procedure call was cancelled) from RpccGetCopyStatusEx4
Connectivity between servers is OK. MS failover cluster also shows that servers are UP.
What can be the reason of this strange behavior and how can I fix it?