Quantcast
Channel: Exchange Server 2013 - High Availability and Disaster Recovery forum
Viewing all 1985 articles
Browse latest View live

Exchange 2010 DAG - 1 Active Node and 2 passive nodes

$
0
0

Hi All,

I currently have Exchange 2010 DAG with 2 nodes and no load balancer. An active node is on-site and the passive node (with manual failover) is at our DR site. Each Server holds all the Exchange roles.

I'm looking at installing a new passive node on-site (containing all the roles).

Questions:

  1. Do I need a load balancer if third node is a passive node?
  2. Has anyone got this setup and anything that I need to consider?

Thanks in advance for your time and help! :)
Amar


AP


resynchronizing database copy fails! High Copy Queue Length

$
0
0

Exchange 2010 SP2 2 host DAG.

The mounted database has a clean shutdown state.

I reseeded the database copy and that was sucessful.

However when it tries to do resynchronizing the copy queue length continues to grow and it fails.

I'm also seeing this in the logs

The active copy of Mailbox Database Faculty has a missing or corrupted log file (E030042A9B0.log). To keep this copy healthy, replication will restart at generation 4368817. A full backup must be performed before an incremental backup is possible.

Should I delete the database copy and create a new one and let it reseed?

Exchange 2013 in a Windows 2012 Hyper-V Cluster - What are my High availability options?

$
0
0

Hi All.

I have a few questions regarding a deployment that we are planning to do in a new installation. Currently we are running a 4 node Hyper-V 2012 cluster connected to iSCSI storage. The general details are:

Nodes in cluster: 4 / Physical OS: Windows 2012 Datacenter Edition / Storage: iSCSI

iSCSI Storage Network: 172.128.x.x (isolated, not routed)

Production Network LAN: 10.252.x.x

Amount of mailboxes to create: 1000

Exchange Server version: 2013

Virtual Machines: All data is stored on the iSCSI SAN. No local storage can/will be used.

VMs Running Exchange 2013 Mailbox Role (desired): One in blade1 and One in blade2

VMs Running CAS role(desired): One in blade3 and one in blade4


Goals:

1- To balance the load of the users in two MBX VMs (500 mailboxes each)

2- To use 2 CAS VMs to balance load

3- To protect each MBX VM databases in case of failure (with a database copy??)

4- (future) To create a replica/failover on another physical distant site.

Questions:

Should I use DAGs even if I am using a cluster and all data will reside on the iSCSI SAN?

What kind of recommendations exist for such scenarios?

I´ve only seen documents talking about hyperv2008 and exchange 2010 and since there are so many changes in both products, I better ask before i begin setup. 

Thanks in advance.

DAG multi-site design with 2 DAG member in one site and one DAG member on the recovery site

$
0
0

Hi,

As the issue description states, I have a DAG design of two MBX servers (DAG members) in the primary site and one MBX server (DAG member) in the recovery site. The FWS is currently in the primary site.

I want to make sure that if, for example, the primary site is no longer reachable, the single DAG member on the recovery site will take over the task of serving mailboxes and that the mailbox stores will be mounted correctly. Should I create an alternate FSW in the recovery site? What is the best procedure?

Thanks

Jean D.

Mailboxdatabase gets ContentIndexState AutoSuspended when moved

$
0
0

When i move an active database with "Move-ActiveMailboxDatabase -MountDialOverride:None" in my Exchange 2013 CU1 DAG the source servers ContentIndex gets "AutoSuspended"

If i try to "update-mailboxdatabasecopy -catalogonly" i get:
WARNING: Seeding of content index catalog for database 'DB02' failed. Please verify that the Microsoft Search
(Exchange) and the Host Controller service for Exchange services are running and try the operation again. Error: There
was no endpoint listening at
net.tcp://localhost:3863/Management/SeedingAgent-ABF80EC7-D4D5-4D12-93D8-6B13D8996EC712/Single that could accept the
message. This is often caused by an incorrect address or SOAP action. See InnerException, if present, for more
details..

Restarting services does not help.
Nor did: http://support.microsoft.com/kb/2807668

The only thing making the ContentIndex Healty is rebooting the server...

Any clues?

Exchange 2010 DAG across sites - different subnets vs stretched VLAN

$
0
0

In the scenario where you have an Exchange 2010 DAG (using Windows 2008 R2 failover clustering) with members servers in two AD sites where "Site A" is the normally active site and "Site B" is the DR site, does Microsoft actually publish any guidance or recommendations as far as using separate subnets in the two sites vs stretching a "Site A" subnet via VLAN?

I think most implementations use separate subnets however I'm aware that some have implemented this using a stretched VLAN.  Seems to me that the latter approach is more complicated as well as introducing potential issues during a switchover to "Site B" during a DR - i.e. the DAG member at "Site B" belongs to a subnet (AD site) from the primary data center and all servers (including domain controllers) at "Site A" are down.

Is anyone aware of any specific guidance or recommendations from Microsoft regarding these two approaches. 

Not looking for speculation, just some specifics from Microsoft, thanks. 

Sam

exchange 2007, how to force a mailbox server to use a transport server in a different ad site.

$
0
0
We have two ad sites, each with a mailbox/hub server and a cas/hub server. The cas/hub in each location is setup with the only send connector so the mailbox/hub in each location send all traffic through them and out through the send connector. This works currently and all is well. However, if the hub fails on the hub/cas server then mail flow in that site comes to a halt. How can I force the mailbox/hub to redirect traffic to the cas/hub in the other ad site so that mail will continue to flow? I tried setting the SubmissionServerOverrideList of the mailbox server and it didn't appear to work.

Exchange logs not flushing after backup

$
0
0

Problem: Exchange logs are not flushing after backup exec runs full backup.

The backup in backup exec 2010 R3 Is successful. It is also set to do full backup/flush logs

The logs fill up to around 50,000 files or more. How do you fix this?

ENV: exchange 2010 SP2 Rollup 4 Dag.

I see the following in the logs on database server when backup exec is complete.

Information Store (2972) Mailbox Database Faculty: The backup procedure has been successfully completed.

Information Store (2972) There were 75299 log file(s) not found in the log range (D:\Program Files\Microsoft\Exchange Server\V14\Mailbox\Mailbox Database Faculty\E030003BA9C.log - D:\Program Files\Microsoft\Exchange Server\V14\Mailbox\Mailbox Database Faculty\E030004EA5F.log) that we attempted to truncate.


Minimum rights for backing up Exchange

$
0
0

Hi,

What is the bare minimum rights required to backup an exchange database? Everywhere I read, I always come across KBs saying that the account should either be Organization Admin or even worse, Domain Admin. I have a really hard time accepting this as the minimum rights as it would grant way too much power to a single account used by the backup administrator.

I am certain that would provision a new security group with just the proper required cmdlets but I can't seem to find anything regarding this.

Anyone has any insight?

Thanks,

Michel

copyqueuelength continues to grow on database copy!

$
0
0

Exchange 2010 Sp2 Rollup 4

Just rebuilt a database copy.

The status shows "resynchronizing" but the copy queue length is 510255 and growing.

Eventually the status will go to failed.

How do you force the copy queue length to go down?

The mounted  database is healthy and has a clean shutdown state!

DAG Member used for Voting ONLY?

$
0
0

Hey there,

I currently have a 2 member DAG using a File Share Witness Quorum.  I have already experienced the file share go offline and cause problems to my cluster, resulting in a node failure...  This may or may not have been completely the fault of the fsw going offline, however it did get me thinking of the alternate quorum methods.

Would it be wise/problematic/other to have a 3 DAG member (in 1 site), but the 3rd member will NOT have any database copies, it will be used for node majority only?

If you can help  me understand why this would be a good or bad idea, please do.

ps. a short blunt sentence either way is not helpful.

Environment:
Hardware Load Balancer
Exchange 2010 SP3
2x CAS servers in CAS Array
2x MB servers in DAG
WS2008R2
HyperV VMs
iSCSI Storage


Andrew Huddleston | Hillsong Church | Sydney

Exchange 2013 DAG _ High Availability

$
0
0

I am in the process of setting up an Exchange 2013 Cluster in our organisation.

Already I have a MBX/CAS server on a stand-alone basis.

I now want to create a DAG system, so that we have 2 servers with mailbox server roles. One server shall be active while the other passive.

Should the active server fail for whatever reason, then it should failover to the passive server.

I did some research on how to set this up and have been unable to decide how best tp approach thsi scenario.

I would like to know this - do I just go ahead and create a second MBX/CAS exchange server, then create a DAG on the two? 

Or do I need to do it the way we did in Exchange 2010 - 1 cas server, 2 mailbox database servers?

Any help on this shall be appreciated.

This question may sound trivial, but I am not an exchange guru.

Thanks

Exchange 2013 CAS loadbalancing

$
0
0

Hi

Just a little question to verify if I have understood things correct :-)

In 2013, CAS is automatic loadbalanced via AD for Outlook clients? Or?
Only for OWA, ECP, ActiveSync devices I can loadbalancing via eighter, DNS RR, WNLB or a HWLB?

I have set up a testlab at home with 2 cas and 2 mb servers, and the DAG is as expected, but I am in doubt what goes for the CAS, since we don't have a CAS array object anymore. I don't know how Outlook finds the CAS?

BR
Steen

Exchange 2010 Cross-Site DAG

$
0
0

Hi,


I am looking to see is someone can confirm my theory on my setup for my Exchange 2010 DAG setup. I am currently building an active/passive setup. It consists of a two site datacenter setup with site A the active site made up of one mailbox server, two CAS/Hub servers which are load balanced. Site B which is the passive/DR site which too is also made up of one mailbox server, two CAS/Hub servers which are load balanced.


Now the idea here is to span a DAG across the two sites and the two mailbox servers. Now from what I read the FSW should be located in the primary site in this case A. Now being the FSW is in site A if there is a WAN issue or Power Outage quorum would be retained there assuming the servers came back up and or didn't go down. While site B would just unmount its database since it can’t talk to the FSW and thus cannot hold quorum. But if just the database fails on site A then in theory it should activate in site B.


Now something suggested by a colleague of mine was to locate the FSW in another site completely in this case site C. With the theory being that if site A fails i.e. power failure/WAN outage. It loses quorum and because site B is still online and can talk to the FSW it now holds quorum and the databases would mount. But reading the following Exchange team blog:


http://blogs.technet.com/b/exchange/archive/2011/05/31/exchange-2010-high-availability-misconceptions-addressed.aspx

It seems to suggest that would not work. Further to that I was also reading up on DAC mode and came accross the following article:


http://www.exchangeinbox.com/article.aspx?i=172


After reading it I starting to think that I was misinterpreting how my setup would behave. And that because both exchange setups are in completely different sites i.e. in Active Directory/physical locations. This automatic switch over in the event of a full site failure that we were looking to achieve would in fact not happen. That would then lead us to have to perform a data center switch over as outline in countless other technet and blogs posted online.


In in the end is my conclusion correct? Because of the two site set up the whole “automatic” fail over wouldn't occur regardless of the placement of the FSW server? And that the whole automatic failover only applies to mailbox servers within the same site?





Cluster issue Exchange 2010 Sp3

$
0
0

All,

Issue while online the cluster resource.

Error Below

Cluster IP address resource 'IPv4 Static Address 2 (Cluster Group)' cannot be brought online because the cluster network 'Cluster Network 2' is not configured to allow client access. Please use the Failover Cluster Manager snap-in to check the configured properties of the cluster network.

I select the "Allow clients to connect through this network" but it automatically unticked again. I am thinking to change the Role value from 1 to 3.

Before doing it via registry is there any way to resolve it.



Exchange SCR target lost disk. How to recover?

$
0
0

I have a 250gb Exchange 2007 mailbox that has an offsite SCR target. Replication was healthy until yesterday when I lost my iscsi connection on the target's disk. I have this back now but test-replicationhealth states that log files are missing and I need to use the update-storagegroupcopy cmdlet. 

Is there a way to do this without clearing out the entire target and reseeding over the WAN?

Server Crash Restore EDB Mailboxes

$
0
0

Ok, so my server at a well known webhost died for the second time, as they continue to refuse to replace the faulty hard drive I've moved to a new host, and got the new server up and running.

The backup catalogue of the old server was corrupt so can't restore the backup, I was lucky enough to grab the Exchange DB + Logs and have managed to mount it on the new server, whilst the new server is the same domain, obviously AD is seeing things with different GUID etc.

I have managed to get all users back online with email etc, however I need to recover their old email, 1 in particular, with the mailbox mounted and all healthy, how can I get Exchange 2013 to extract the Mailboxes from the old mounted EDB either by coping the mailbox or exporting as a PST.  The mailboxes are not disconnected or deleted, they are in effect hanging in space.  I've tried New-MailboxExportRequest (or what ever it is again) but that is only exporting the New Mailbox from the new EDB.

I am trying to do this for one mailbox only, so do not want to invest in 3rd party tools.

Exchange 5.5 not starting

$
0
0

I have Exchange 5.5 running on a NT 4.0 machine; exchange is not starting up and in event log I am getting an Event ID 124  (263) Unable to read the log header. Error -1022.

I do not have a good back-up

Thank you,

Matthew

event id 2038 on Exchange 2007 CCR

$
0
0
Hi all,

I encountered this problem every night after my scheduled backup ( we are using wbadmin.exe ) has completed. It appeared 7 times in the application log, which coincide with the number of storage group I have. Naturally, the logs are not truncated and hence filling up my drive. 

It all began after we have rebuilt our passive node Exchange 2007 server. We are running on W2008 Ent 64-bit with sp2 and the Exchange 2007 is also on sp2. The same configuration also applies to the active node of the CCR configuration.


Any advice how to get rid of this error?

Thanks

Multi Site DAG (Can't add 2nd Node)

$
0
0

Hi,

I've trying to create a Exchange 2013 DAG, but can't had the second node. I'm using a single NIC in both servers. I know its not the best solution but, it is supported.

One server is a VM on VMWARE the other is a VM on HYPER-V. Can this be the problem, since the virtual nics are different?

I've looked at everything I can find online. I've tried to add the 2nd node throw the Failover Cluster Manager, but no luck.

Can someone help me?

Thanks

JP

I get the following log file:

[2013-07-23T22:17:44] Working
[2013-07-23T22:17:44] GetRemoteCluster() for the mailbox server failed with exception = An Active Manager operation failed. Error: An error occurred while attempting a cluster operation. Error: Cluster API '"OpenCluster(EXCH13FIG01.private.euroatlantic.pt) failed with 0x6d9. Error: There are no more endpoints available from the endpoint mapper"' failed.. This is OK.
[2013-07-23T22:17:44] Ignoring previous error, as it is acceptable if the cluster does not exist yet.
[2013-07-23T22:17:44] DumpClusterTopology: Opening remote cluster EXCH13DAG01.
[2013-07-23T22:17:45] Dumping the cluster by connecting to: EXCH13DAG01.
[2013-07-23T22:17:45] The cluster's name is: EXCH13DAG01.
[2013-07-23T22:17:45] Groups
[2013-07-23T22:17:45]     group: Available Storage [not a CMS]
[2013-07-23T22:17:45]         OwnerNode: EXCH13BLR01.private.euroatlantic.pt
[2013-07-23T22:17:45]         State: Offline
[2013-07-23T22:17:45]     group: Cluster Group [Cluster Main Group]
[2013-07-23T22:17:45]         OwnerNode: EXCH13BLR01.private.euroatlantic.pt
[2013-07-23T22:17:45]         State: Online
[2013-07-23T22:17:45]             Resource: Cluster IP Address [Online, type = IP Address, PossibleOwners = EXCH13BLR01 ]
[2013-07-23T22:17:45]                 Address = [10.0.0.27]
[2013-07-23T22:17:45]                     EnableDhcp = [0]
[2013-07-23T22:17:45]                     Network = [Cluster Network 1]
[2013-07-23T22:17:45]             Resource: Cluster Name [Online, type = Network Name, PossibleOwners = EXCH13BLR01 ]
[2013-07-23T22:17:45]                 NetName = [EXCH13DAG01]
[2013-07-23T22:17:45] Nodes
[2013-07-23T22:17:45]     node: EXCH13BLR01.private.euroatlantic.pt [ state = Up ]
[2013-07-23T22:17:45] Subnets
[2013-07-23T22:17:45]     Name(Cluster Network 1), Mask(10.0.0.0/23), Role(ClusterNetworkRoleInternalAndClient)
[2013-07-23T22:17:45]         NIC 10.0.0.9 on Node EXCH13BLR01 in State=Up
[2013-07-23T22:17:45] Opening the cluster on nodes [exch13blr01].
[2013-07-23T22:17:45] Other mailbox servers in the DAG are already members of cluster 'EXCH13DAG01'
[2013-07-23T22:17:45] The server EXCH13FIG01 does not belong to a cluster, and the other servers belong to EXCH13DAG01.
[2013-07-23T22:17:45] Successfully resolved the servers based on the stopped servers list.
[2013-07-23T22:17:45] The following servers are in the StartedServers list (The list is the StartedServers property of the DAG in AD):
[2013-07-23T22:17:45] The following servers are in the StoppedServers list:
[2013-07-23T22:17:45] Verifiying that the members of database availability group 'EXCH13DAG01' are also members of the cluster.
[2013-07-23T22:17:45] Verifying that the members of cluster 'EXCH13DAG01' are also members of the database availability group.
[2013-07-23T22:17:45] According to GetNodeClusterState(), the server EXCH13FIG01 is NotConfigured.
[2013-07-23T22:17:45] The CNO is currently Online.
[2013-07-23T22:17:45] InternalValidate() done.
[2013-07-23T22:17:47] Updated Progress 'Adding server 'EXCH13FIG01' to database availability group 'EXCH13DAG01'.' 6%.
[2013-07-23T22:17:47] Working
[2013-07-23T22:17:47] Updated Progress 'Adding server 'EXCH13FIG01' to the cluster.' 8%.
[2013-07-23T22:17:47] Working
[2013-07-23T22:24:07] The operation wasn't successful because an error was encountered. You may find more details in log file "C:\ExchangeSetupLogs\DagTasks\dagtask_2013-07-23_22-17-37.824_add-databaseavailabiltygroupserver.log".
[2013-07-23T22:24:07] WriteError! Exception = Microsoft.Exchange.Cluster.Replay.DagTaskOperationFailedException: A server-side database availability group administrative operation failed. Error The operation failed. CreateCluster errors may result from incorrectly configured static addresses. Error: An error occurred while attempting a cluster operation. Error: Cluster API '"AddClusterNode() (MaxPercentage=100) failed with 0x5b4. Error: This operation returned because the timeout period expired"' failed.. ---> Microsoft.Exchange.Cluster.Replay.AmClusterApiException: An Active Manager operation failed. Error: An error occurred while attempting a cluster operation. Error: Cluster API '"AddClusterNode() (MaxPercentage=100) failed with 0x5b4. Error: This operation returned because the timeout period expired"' failed. ---> System.ComponentModel.Win32Exception: This operation returned because the timeout period expired
   --- End of inner exception stack trace ---
   at Microsoft.Exchange.Cluster.ClusApi.AmCluster.AddNodeToCluster(AmServerName nodeName, IClusterSetupProgress setupProgress, IntPtr context, Exception& errorException, Boolean throwExceptionOnFailure)
   at Microsoft.Exchange.Cluster.Replay.DagHelper.AddDagClusterNodeInternal(HaTaskStringBuilderOutputHelper output, AmServerName mailboxServerName, String& verboseLog)
   --- End of inner exception stack trace (Microsoft.Exchange.Cluster.Replay.AmClusterApiException) ---
   at Microsoft.Exchange.Cluster.Replay.DagHelper.ThrowDagTaskOperationWrapper(Exception exception)
   at Microsoft.Exchange.Cluster.Replay.DagHelper.AddDagClusterNodeInternal(HaTaskStringBuilderOutputHelper output, AmServerName mailboxServerName, String& verboseLog)
   at Microsoft.Exchange.Cluster.Replay.NodeActionTracker.PerformNodeAction(ITaskOutputHelper output, AmServerName nodeName, NodeAction nodeAction, Action clusterAction)
   at Microsoft.Exchange.Cluster.Replay.DagHelper.AddDagClusterNode(AmServerName mailboxServerName, String& verboseLog)
   at Microsoft.Exchange.Data.Storage.Cluster.HaRpcExceptionWrapperBase`2.RunRpcServerOperation(String databaseName, RpcServerOperation rpcOperation)
   --- End of stack trace on server (EXCH13BLR01.private.euroatlantic.pt) ---
   at Microsoft.Exchange.Data.Storage.Cluster.HaRpcExceptionWrapperBase`2.ClientRethrowIfFailed(String databaseName, String serverName, RpcErrorExceptionInfo errorInfo)
   at Microsoft.Exchange.Cluster.Replay.ReplayRpcClientWrapper.RunAddNodeToCluster(AmServerName serverName, AmServerName newNode, String& verboseLog)
   at Microsoft.Exchange.Management.SystemConfigurationTasks.AddDatabaseAvailabilityGroupServer.JoinNodeToCluster()
[2013-07-23T22:24:16] Updated Progress 'Done!' 100%.
[2013-07-23T22:24:16] COMPLETED
add-databaseavailabiltygroupserver explicitly called CloseTempLogFile().

Viewing all 1985 articles
Browse latest View live


<script src="https://jsc.adskeeper.com/r/s/rssing.com.1596347.js" async> </script>