Exchange 2010 DAG - 1 Active Node and 2 passive nodes

July 9, 2013, 2:34 am

≫ Next: resynchronizing database copy fails! High Copy Queue Length

≪ Previous: Restore Exchange Backup on Different DAG

Hi All,

I currently have Exchange 2010 DAG with 2 nodes and no load balancer. An active node is on-site and the passive node (with manual failover) is at our DR site. Each Server holds all the Exchange roles.

I'm looking at installing a new passive node on-site (containing all the roles).

Questions:

Do I need a load balancer if third node is a passive node?
Has anyone got this setup and anything that I need to consider?

Thanks in advance for your time and help! :)
Amar

↧

resynchronizing database copy fails! High Copy Queue Length

July 11, 2013, 8:35 am

≫ Next: Exchange 2013 in a Windows 2012 Hyper-V Cluster - What are my High availability options?

≪ Previous: Exchange 2010 DAG - 1 Active Node and 2 passive nodes

Exchange 2010 SP2 2 host DAG.

The mounted database has a clean shutdown state.

I reseeded the database copy and that was sucessful.

However when it tries to do resynchronizing the copy queue length continues to grow and it fails.

I'm also seeing this in the logs

The active copy of Mailbox Database Faculty has a missing or corrupted log file (E030042A9B0.log). To keep this copy healthy, replication will restart at generation 4368817. A full backup must be performed before an incremental backup is possible.

Should I delete the database copy and create a new one and let it reseed?

↧

Exchange 2013 in a Windows 2012 Hyper-V Cluster - What are my High availability options?

July 12, 2013, 6:41 pm

≫ Next: DAG multi-site design with 2 DAG member in one site and one DAG member on the recovery site

≪ Previous: resynchronizing database copy fails! High Copy Queue Length

Hi All.

I have a few questions regarding a deployment that we are planning to do in a new installation. Currently we are running a 4 node Hyper-V 2012 cluster connected to iSCSI storage. The general details are:

Nodes in cluster: 4 / Physical OS: Windows 2012 Datacenter Edition / Storage: iSCSI

iSCSI Storage Network: 172.128.x.x (isolated, not routed)

Production Network LAN: 10.252.x.x

Amount of mailboxes to create: 1000

Exchange Server version: 2013

Virtual Machines: All data is stored on the iSCSI SAN. No local storage can/will be used.

VMs Running Exchange 2013 Mailbox Role (desired): One in blade1 and One in blade2

VMs Running CAS role(desired): One in blade3 and one in blade4

Goals:

1- To balance the load of the users in two MBX VMs (500 mailboxes each)

2- To use 2 CAS VMs to balance load

3- To protect each MBX VM databases in case of failure (with a database copy??)

4- (future) To create a replica/failover on another physical distant site.

Questions:

Should I use DAGs even if I am using a cluster and all data will reside on the iSCSI SAN?

What kind of recommendations exist for such scenarios?

I´ve only seen documents talking about hyperv2008 and exchange 2010 and since there are so many changes in both products, I better ask before i begin setup.

Thanks in advance.

↧

DAG multi-site design with 2 DAG member in one site and one DAG member on the recovery site

July 15, 2013, 11:03 am

≫ Next: Mailboxdatabase gets ContentIndexState AutoSuspended when moved

≪ Previous: Exchange 2013 in a Windows 2012 Hyper-V Cluster - What are my High availability options?

Hi,

As the issue description states, I have a DAG design of two MBX servers (DAG members) in the primary site and one MBX server (DAG member) in the recovery site. The FWS is currently in the primary site.

I want to make sure that if, for example, the primary site is no longer reachable, the single DAG member on the recovery site will take over the task of serving mailboxes and that the mailbox stores will be mounted correctly. Should I create an alternate FSW in the recovery site? What is the best procedure?

Thanks

Jean D.

↧

Mailboxdatabase gets ContentIndexState AutoSuspended when moved

June 25, 2013, 6:02 am

≫ Next: Exchange 2010 DAG across sites - different subnets vs stretched VLAN

≪ Previous: DAG multi-site design with 2 DAG member in one site and one DAG member on the recovery site

When i move an active database with "Move-ActiveMailboxDatabase -MountDialOverride:None" in my Exchange 2013 CU1 DAG the source servers ContentIndex gets "AutoSuspended"

If i try to "update-mailboxdatabasecopy -catalogonly" i get:
WARNING: Seeding of content index catalog for database 'DB02' failed. Please verify that the Microsoft Search
(Exchange) and the Host Controller service for Exchange services are running and try the operation again. Error: There
was no endpoint listening at
net.tcp://localhost:3863/Management/SeedingAgent-ABF80EC7-D4D5-4D12-93D8-6B13D8996EC712/Single that could accept the
message. This is often caused by an incorrect address or SOAP action. See InnerException, if present, for more
details..

Restarting services does not help.
Nor did: http://support.microsoft.com/kb/2807668

The only thing making the ContentIndex Healty is rebooting the server...

Any clues?

↧

Exchange 2010 DAG across sites - different subnets vs stretched VLAN

July 16, 2013, 12:47 pm

≫ Next: exchange 2007, how to force a mailbox server to use a transport server in a different ad site.

≪ Previous: Mailboxdatabase gets ContentIndexState AutoSuspended when moved

In the scenario where you have an Exchange 2010 DAG (using Windows 2008 R2 failover clustering) with members servers in two AD sites where "Site A" is the normally active site and "Site B" is the DR site, does Microsoft actually publish any guidance or recommendations as far as using separate subnets in the two sites vs stretching a "Site A" subnet via VLAN?

I think most implementations use separate subnets however I'm aware that some have implemented this using a stretched VLAN. Seems to me that the latter approach is more complicated as well as introducing potential issues during a switchover to "Site B" during a DR - i.e. the DAG member at "Site B" belongs to a subnet (AD site) from the primary data center and all servers (including domain controllers) at "Site A" are down.

Is anyone aware of any specific guidance or recommendations from Microsoft regarding these two approaches.

Not looking for speculation, just some specifics from Microsoft, thanks.

Sam

↧

exchange 2007, how to force a mailbox server to use a transport server in a different ad site.

July 16, 2013, 12:22 pm

≫ Next: Exchange logs not flushing after backup

≪ Previous: Exchange 2010 DAG across sites - different subnets vs stretched VLAN

We have two ad sites, each with a mailbox/hub server and a cas/hub server. The cas/hub in each location is setup with the only send connector so the mailbox/hub in each location send all traffic through them and out through the send connector. This works currently and all is well. However, if the hub fails on the hub/cas server then mail flow in that site comes to a halt. How can I force the mailbox/hub to redirect traffic to the cas/hub in the other ad site so that mail will continue to flow? I tried setting the SubmissionServerOverrideList of the mailbox server and it didn't appear to work.

↧

Exchange logs not flushing after backup

July 16, 2013, 1:39 pm

≫ Next: Minimum rights for backing up Exchange

≪ Previous: exchange 2007, how to force a mailbox server to use a transport server in a different ad site.

Problem: Exchange logs are not flushing after backup exec runs full backup.

The backup in backup exec 2010 R3 Is successful. It is also set to do full backup/flush logs

The logs fill up to around 50,000 files or more. How do you fix this?

ENV: exchange 2010 SP2 Rollup 4 Dag.

I see the following in the logs on database server when backup exec is complete.

Information Store (2972) Mailbox Database Faculty: The backup procedure has been successfully completed.

Information Store (2972) There were 75299 log file(s) not found in the log range (D:\Program Files\Microsoft\Exchange Server\V14\Mailbox\Mailbox Database Faculty\E030003BA9C.log - D:\Program Files\Microsoft\Exchange Server\V14\Mailbox\Mailbox Database Faculty\E030004EA5F.log) that we attempted to truncate.

↧

Minimum rights for backing up Exchange

July 15, 2013, 5:36 am

≫ Next: copyqueuelength continues to grow on database copy!

≪ Previous: Exchange logs not flushing after backup

Hi,

What is the bare minimum rights required to backup an exchange database? Everywhere I read, I always come across KBs saying that the account should either be Organization Admin or even worse, Domain Admin. I have a really hard time accepting this as the minimum rights as it would grant way too much power to a single account used by the backup administrator.

I am certain that would provision a new security group with just the proper required cmdlets but I can't seem to find anything regarding this.

Anyone has any insight?

Thanks,

Michel

↧

copyqueuelength continues to grow on database copy!

July 17, 2013, 6:48 am

≫ Next: DAG Member used for Voting ONLY?

≪ Previous: Minimum rights for backing up Exchange

Exchange 2010 Sp2 Rollup 4

Just rebuilt a database copy.

The status shows "resynchronizing" but the copy queue length is 510255 and growing.

Eventually the status will go to failed.

How do you force the copy queue length to go down?

The mounted database is healthy and has a clean shutdown state!

↧

DAG Member used for Voting ONLY?

July 18, 2013, 2:20 am

≫ Next: Exchange 2013 DAG _ High Availability

≪ Previous: copyqueuelength continues to grow on database copy!

Hey there,

I currently have a 2 member DAG using a File Share Witness Quorum. I have already experienced the file share go offline and cause problems to my cluster, resulting in a node failure... This may or may not have been completely the fault of the fsw going offline, however it did get me thinking of the alternate quorum methods.

Would it be wise/problematic/other to have a 3 DAG member (in 1 site), but the 3rd member will NOT have any database copies, it will be used for node majority only?

If you can help me understand why this would be a good or bad idea, please do.

ps. a short blunt sentence either way is not helpful.

Environment:
Hardware Load Balancer
Exchange 2010 SP3
2x CAS servers in CAS Array
2x MB servers in DAG
WS2008R2
HyperV VMs
iSCSI Storage

Andrew Huddleston | Hillsong Church | Sydney

↧

Exchange 2013 DAG _ High Availability

July 18, 2013, 6:18 am

≫ Next: Exchange 2013 CAS loadbalancing

≪ Previous: DAG Member used for Voting ONLY?

I am in the process of setting up an Exchange 2013 Cluster in our organisation.

Already I have a MBX/CAS server on a stand-alone basis.

I now want to create a DAG system, so that we have 2 servers with mailbox server roles. One server shall be active while the other passive.

Should the active server fail for whatever reason, then it should failover to the passive server.

I did some research on how to set this up and have been unable to decide how best tp approach thsi scenario.

I would like to know this - do I just go ahead and create a second MBX/CAS exchange server, then create a DAG on the two?

Or do I need to do it the way we did in Exchange 2010 - 1 cas server, 2 mailbox database servers?

Any help on this shall be appreciated.

This question may sound trivial, but I am not an exchange guru.

Thanks

↧

Exchange 2013 CAS loadbalancing

March 29, 2013, 2:32 am

≫ Next: Exchange 2010 Cross-Site DAG

≪ Previous: Exchange 2013 DAG _ High Availability

Hi

Just a little question to verify if I have understood things correct :-)

In 2013, CAS is automatic loadbalanced via AD for Outlook clients? Or?
Only for OWA, ECP, ActiveSync devices I can loadbalancing via eighter, DNS RR, WNLB or a HWLB?

I have set up a testlab at home with 2 cas and 2 mb servers, and the DAG is as expected, but I am in doubt what goes for the CAS, since we don't have a CAS array object anymore. I don't know how Outlook finds the CAS?

BR
Steen

↧

Exchange 2010 Cross-Site DAG

July 20, 2013, 8:28 am

≫ Next: Cluster issue Exchange 2010 Sp3

≪ Previous: Exchange 2013 CAS loadbalancing

Hi,

I am looking to see is someone can confirm my theory on my setup for my Exchange 2010 DAG setup. I am currently building an active/passive setup. It consists of a two site datacenter setup with site A the active site made up of one mailbox server, two CAS/Hub servers which are load balanced. Site B which is the passive/DR site which too is also made up of one mailbox server, two CAS/Hub servers which are load balanced.

Now the idea here is to span a DAG across the two sites and the two mailbox servers. Now from what I read the FSW should be located in the primary site in this case A. Now being the FSW is in site A if there is a WAN issue or Power Outage quorum would be retained there assuming the servers came back up and or didn't go down. While site B would just unmount its database since it can’t talk to the FSW and thus cannot hold quorum. But if just the database fails on site A then in theory it should activate in site B.

Now something suggested by a colleague of mine was to locate the FSW in another site completely in this case site C. With the theory being that if site A fails i.e. power failure/WAN outage. It loses quorum and because site B is still online and can talk to the FSW it now holds quorum and the databases would mount. But reading the following Exchange team blog:

http://blogs.technet.com/b/exchange/archive/2011/05/31/exchange-2010-high-availability-misconceptions-addressed.aspx

It seems to suggest that would not work. Further to that I was also reading up on DAC mode and came accross the following article:

http://www.exchangeinbox.com/article.aspx?i=172

After reading it I starting to think that I was misinterpreting how my setup would behave. And that because both exchange setups are in completely different sites i.e. in Active Directory/physical locations. This automatic switch over in the event of a full site failure that we were looking to achieve would in fact not happen. That would then lead us to have to perform a data center switch over as outline in countless other technet and blogs posted online.

In in the end is my conclusion correct? Because of the two site set up the whole “automatic” fail over wouldn't occur regardless of the placement of the FSW server? And that the whole automatic failover only applies to mailbox servers within the same site?

↧

Cluster issue Exchange 2010 Sp3

July 23, 2013, 12:43 am

≫ Next: Exchange SCR target lost disk. How to recover?

≪ Previous: Exchange 2010 Cross-Site DAG

All,

Issue while online the cluster resource.

Error Below

Cluster IP address resource 'IPv4 Static Address 2 (Cluster Group)' cannot be brought online because the cluster network 'Cluster Network 2' is not configured to allow client access. Please use the Failover Cluster Manager snap-in to check the configured properties of the cluster network.

I select the "Allow clients to connect through this network" but it automatically unticked again. I am thinking to change the Role value from 1 to 3.

Before doing it via registry is there any way to resolve it.

↧

Exchange SCR target lost disk. How to recover?

July 23, 2013, 6:37 am

≫ Next: Server Crash Restore EDB Mailboxes

≪ Previous: Cluster issue Exchange 2010 Sp3

I have a 250gb Exchange 2007 mailbox that has an offsite SCR target. Replication was healthy until yesterday when I lost my iscsi connection on the target's disk. I have this back now but test-replicationhealth states that log files are missing and I need to use the update-storagegroupcopy cmdlet.

Is there a way to do this without clearing out the entire target and reseeding over the WAN?

↧

Server Crash Restore EDB Mailboxes

July 23, 2013, 2:58 am

≫ Next: Exchange 5.5 not starting

≪ Previous: Exchange SCR target lost disk. How to recover?

Ok, so my server at a well known webhost died for the second time, as they continue to refuse to replace the faulty hard drive I've moved to a new host, and got the new server up and running.

The backup catalogue of the old server was corrupt so can't restore the backup, I was lucky enough to grab the Exchange DB + Logs and have managed to mount it on the new server, whilst the new server is the same domain, obviously AD is seeing things with different GUID etc.

I have managed to get all users back online with email etc, however I need to recover their old email, 1 in particular, with the mailbox mounted and all healthy, how can I get Exchange 2013 to extract the Mailboxes from the old mounted EDB either by coping the mailbox or exporting as a PST. The mailboxes are not disconnected or deleted, they are in effect hanging in space. I've tried New-MailboxExportRequest (or what ever it is again) but that is only exporting the New Mailbox from the new EDB.

I am trying to do this for one mailbox only, so do not want to invest in 3rd party tools.

↧

Exchange 5.5 not starting

July 23, 2013, 7:42 am

≫ Next: event id 2038 on Exchange 2007 CCR

≪ Previous: Server Crash Restore EDB Mailboxes

I have Exchange 5.5 running on a NT 4.0 machine; exchange is not starting up and in event log I am getting an Event ID 124 (263) Unable to read the log header. Error -1022.

I do not have a good back-up

Thank you,

Matthew

↧

event id 2038 on Exchange 2007 CCR

July 23, 2013, 6:28 pm

≫ Next: Multi Site DAG (Can't add 2nd Node)

≪ Previous: Exchange 5.5 not starting

Hi all,

I encountered this problem every night after my scheduled backup ( we are using wbadmin.exe ) has completed. It appeared 7 times in the application log, which coincide with the number of storage group I have. Naturally, the logs are not truncated and hence filling up my drive.

It all began after we have rebuilt our passive node Exchange 2007 server. We are running on W2008 Ent 64-bit with sp2 and the Exchange 2007 is also on sp2. The same configuration also applies to the active node of the CCR configuration.

Any advice how to get rid of this error?

Thanks

↧

Multi Site DAG (Can't add 2nd Node)

July 23, 2013, 3:35 pm

≫ Next: Exchange server goes down... Export EDB into PST?

≪ Previous: event id 2038 on Exchange 2007 CCR

Hi,

I've trying to create a Exchange 2013 DAG, but can't had the second node. I'm using a single NIC in both servers. I know its not the best solution but, it is supported.

One server is a VM on VMWARE the other is a VM on HYPER-V. Can this be the problem, since the virtual nics are different?

I've looked at everything I can find online. I've tried to add the 2nd node throw the Failover Cluster Manager, but no luck.

Can someone help me?

Thanks

I get the following log file:

[2013-07-23T22:17:44] Working
[2013-07-23T22:17:44] GetRemoteCluster() for the mailbox server failed with exception = An Active Manager operation failed. Error: An error occurred while attempting a cluster operation. Error: Cluster API '"OpenCluster(EXCH13FIG01.private.euroatlantic.pt) failed with 0x6d9. Error: There are no more endpoints available from the endpoint mapper"' failed.. This is OK.
[2013-07-23T22:17:44] Ignoring previous error, as it is acceptable if the cluster does not exist yet.
[2013-07-23T22:17:44] DumpClusterTopology: Opening remote cluster EXCH13DAG01.
[2013-07-23T22:17:45] Dumping the cluster by connecting to: EXCH13DAG01.
[2013-07-23T22:17:45] The cluster's name is: EXCH13DAG01.
[2013-07-23T22:17:45] Groups
[2013-07-23T22:17:45] group: Available Storage [not a CMS]
[2013-07-23T22:17:45] OwnerNode: EXCH13BLR01.private.euroatlantic.pt
[2013-07-23T22:17:45] State: Offline
[2013-07-23T22:17:45] group: Cluster Group [Cluster Main Group]
[2013-07-23T22:17:45] OwnerNode: EXCH13BLR01.private.euroatlantic.pt
[2013-07-23T22:17:45] State: Online
[2013-07-23T22:17:45] Resource: Cluster IP Address [Online, type = IP Address, PossibleOwners = EXCH13BLR01 ]
[2013-07-23T22:17:45] Address = [10.0.0.27]
[2013-07-23T22:17:45] EnableDhcp = [0]
[2013-07-23T22:17:45] Network = [Cluster Network 1]
[2013-07-23T22:17:45] Resource: Cluster Name [Online, type = Network Name, PossibleOwners = EXCH13BLR01 ]
[2013-07-23T22:17:45] NetName = [EXCH13DAG01]
[2013-07-23T22:17:45] Nodes
[2013-07-23T22:17:45] node: EXCH13BLR01.private.euroatlantic.pt [ state = Up ]
[2013-07-23T22:17:45] Subnets
[2013-07-23T22:17:45] Name(Cluster Network 1), Mask(10.0.0.0/23), Role(ClusterNetworkRoleInternalAndClient)
[2013-07-23T22:17:45] NIC 10.0.0.9 on Node EXCH13BLR01 in State=Up
[2013-07-23T22:17:45] Opening the cluster on nodes [exch13blr01].
[2013-07-23T22:17:45] Other mailbox servers in the DAG are already members of cluster 'EXCH13DAG01'
[2013-07-23T22:17:45] The server EXCH13FIG01 does not belong to a cluster, and the other servers belong to EXCH13DAG01.
[2013-07-23T22:17:45] Successfully resolved the servers based on the stopped servers list.
[2013-07-23T22:17:45] The following servers are in the StartedServers list (The list is the StartedServers property of the DAG in AD):
[2013-07-23T22:17:45] The following servers are in the StoppedServers list:
[2013-07-23T22:17:45] Verifiying that the members of database availability group 'EXCH13DAG01' are also members of the cluster.
[2013-07-23T22:17:45] Verifying that the members of cluster 'EXCH13DAG01' are also members of the database availability group.
[2013-07-23T22:17:45] According to GetNodeClusterState(), the server EXCH13FIG01 is NotConfigured.
[2013-07-23T22:17:45] The CNO is currently Online.
[2013-07-23T22:17:45] InternalValidate() done.
[2013-07-23T22:17:47] Updated Progress 'Adding server 'EXCH13FIG01' to database availability group 'EXCH13DAG01'.' 6%.
[2013-07-23T22:17:47] Working
[2013-07-23T22:17:47] Updated Progress 'Adding server 'EXCH13FIG01' to the cluster.' 8%.
[2013-07-23T22:17:47] Working
[2013-07-23T22:24:07] The operation wasn't successful because an error was encountered. You may find more details in log file "C:\ExchangeSetupLogs\DagTasks\dagtask_2013-07-23_22-17-37.824_add-databaseavailabiltygroupserver.log".
[2013-07-23T22:24:07] WriteError! Exception = Microsoft.Exchange.Cluster.Replay.DagTaskOperationFailedException: A server-side database availability group administrative operation failed. Error The operation failed. CreateCluster errors may result from incorrectly configured static addresses. Error: An error occurred while attempting a cluster operation. Error: Cluster API '"AddClusterNode() (MaxPercentage=100) failed with 0x5b4. Error: This operation returned because the timeout period expired"' failed.. ---> Microsoft.Exchange.Cluster.Replay.AmClusterApiException: An Active Manager operation failed. Error: An error occurred while attempting a cluster operation. Error: Cluster API '"AddClusterNode() (MaxPercentage=100) failed with 0x5b4. Error: This operation returned because the timeout period expired"' failed. ---> System.ComponentModel.Win32Exception: This operation returned because the timeout period expired
--- End of inner exception stack trace ---
at Microsoft.Exchange.Cluster.ClusApi.AmCluster.AddNodeToCluster(AmServerName nodeName, IClusterSetupProgress setupProgress, IntPtr context, Exception& errorException, Boolean throwExceptionOnFailure)
at Microsoft.Exchange.Cluster.Replay.DagHelper.AddDagClusterNodeInternal(HaTaskStringBuilderOutputHelper output, AmServerName mailboxServerName, String& verboseLog)
--- End of inner exception stack trace (Microsoft.Exchange.Cluster.Replay.AmClusterApiException) ---
at Microsoft.Exchange.Cluster.Replay.DagHelper.ThrowDagTaskOperationWrapper(Exception exception)
at Microsoft.Exchange.Cluster.Replay.DagHelper.AddDagClusterNodeInternal(HaTaskStringBuilderOutputHelper output, AmServerName mailboxServerName, String& verboseLog)
at Microsoft.Exchange.Cluster.Replay.NodeActionTracker.PerformNodeAction(ITaskOutputHelper output, AmServerName nodeName, NodeAction nodeAction, Action clusterAction)
at Microsoft.Exchange.Cluster.Replay.DagHelper.AddDagClusterNode(AmServerName mailboxServerName, String& verboseLog)
at Microsoft.Exchange.Data.Storage.Cluster.HaRpcExceptionWrapperBase`2.RunRpcServerOperation(String databaseName, RpcServerOperation rpcOperation)
--- End of stack trace on server (EXCH13BLR01.private.euroatlantic.pt) ---
at Microsoft.Exchange.Data.Storage.Cluster.HaRpcExceptionWrapperBase`2.ClientRethrowIfFailed(String databaseName, String serverName, RpcErrorExceptionInfo errorInfo)
at Microsoft.Exchange.Cluster.Replay.ReplayRpcClientWrapper.RunAddNodeToCluster(AmServerName serverName, AmServerName newNode, String& verboseLog)
at Microsoft.Exchange.Management.SystemConfigurationTasks.AddDatabaseAvailabilityGroupServer.JoinNodeToCluster()
[2013-07-23T22:24:16] Updated Progress 'Done!' 100%.
[2013-07-23T22:24:16] COMPLETED
add-databaseavailabiltygroupserver explicitly called CloseTempLogFile().

↧