August 2005 - Posts

Microsoft Cluster Summit

Yep, this is a commercial...

Rodney R. Fournier and I will be in NYC enjoying the life of teaching others all about Microsoft clustering. We will cover NLB and server clustering with labs on installing SQL and Exchange server clusters.

So, if you have 4 days to spare and can get to NYC, you should sign up. See www.clusterhelp.com for more information. Check out the Schedule link to see how to sign up.

Please return to your regularly scheduled newsfeeds.

Update: Four days of cluster training can lead to bleeding from the ears. Several students left looking like they had been stunned by percussion grenades when the class was over, but all of them said it was the most useful class they have ever taken. It looks like Rod and I are on the right track.

Posted by cluster

Exchange Q&A with David Elfassy

David Elfassy IM'd me yesterday and asked me to play a game of word association. I hate word associations. It seems that word associations always lead to people thinking I am insane, and I get locked in padded rooms.
 
Anyways, he said it would really help him with some Exchange materials that he is working on, so I agreed.
 
David: SAN optimization. (First word that comes to your mind)
Russ: Banana (I was hungry and was reaching for a banana)
 
David: You *&(%$__*(&*(@#** - (wow, that was rude)  :)
Russ: Cheerios (what else do you say to that?)
 
Note: at this point, I could already hear the ambulance and the guys in white suits coming to take me away.
 
David: SAN optimization.
Russ: diskpar, multipathing (see the link for Rod Fournier's diskpar blog, and see  on the Exchange team blog on the same subject here)
 
David: SAN availability
Russ: Multipathing, SAN replication, geo clustering
 
David: Transaction log performance on SAN's
Russ: disable write back cache for the LUN if possible, provide a separate LUN, RAID 1 within the SAN
Note: David asked me to explain why I did not say anything about RAID 10 or 0+1. I told him that I can't see throwing that much at a transaction log unless you really have huge amounts of transactions that require scaling up the size of the log space to hold a large size.
David: iSCSI for Exchange
Russ: Interesting solution, not near the I/O achievable via a SAN with multipathing, though. Must use a dedicated gig/e network to achieve best performance.
 
David: WIndows Storage Server with NAS for small orgs
Russ: Good solution depending on size of org. Again, not near the I/O of an inexpensive SAN device and should again have a dedicated gig/e network for the connection.
Posted by cluster
Filed under: , ,

Unicast vs. Multicast - Original Posted Feb 21, 2005

As usual, confusion motivates me to blog some more. In this case, I have blogged this because I was confused, and I am pretty sure that I have it straight now. Comments may prove me wrong.

When designing, planning, testing, and implementing Network Load Balancing (NLB) Clustering, a choice has to be made regarding unicast vs. multicast. There are a few differences, but the main difference is in the way MAC addresses are implemented.

Unicast - Each NLB cluster node replaces its real (hard coded) MAC address with a new one (generated by the NLB software) and each node in the NLB cluster uses the same (virtual) MAC. Because of this virtual MAC being used by multiple computers, a switch is not able to learn the port for the virtual NLB cluster MAC and is forced to send the packets destined for the NLB MAC to all ports of a switch to make sure packets get to the right destination.

So, basically, the way NLB traffic is handled is kind of like this:

1. An inbound packet for IP address w.x.y.z (NLB Virtual IP) arrives
2. The ARP request is generated and is sent across all ports of the switch since there is no mapping at this point
3. All of the NLB cluster nodes respond with the same MAC
4. The switch sends the traffic to all ports because it is not able to tell which is the proper port and this leads to switch flooding

If an NLB cluster node is using unicast, NLB isn't able to tell each node apart as they all have the same MAC. Since each NLB cluster node has the same MAC, communication between NLB cluster nodes is not possible unless each NLB cluster node has an additional NIC with a unique MAC.

Multicast - NLB adds a layer 2 MAC address to the NIC of each node. Each NLB cluster node basically has two MAC addresses, its real one and its NLB generated address. With multicast, you can create static entries in the switch so that it sends the packets only to members of the NLB cluster. Mapping the address to the ports being used by the NLB cluster stops all ports from being flooded. Only the mapped ports will receive the the packets for the NLB cluster instead of all ports in the switch. If you don't create the static entries, it will cause switch flooding just like in unicast.

Flooding Solutions:

  1. Hook all NLB devices to a hub and then connect it to a port on the switch. Since all NLB nodes with the same MAC come through the same port, there is no switch port flooding.
  2. Configure a VLAN for all NLB cluster nodes to contain all NLB cluster traffic to just the VLAN and not run it over the entire switch.
  3. Use multicast and configure static mapping for the NLB cluster nodes in the switch so it only floods the mapped ports instead of the entire switch.
  4. Use port mirroring so that all ports involved in the NLB cluster mirror each other.
Posted by cluster

Costs of High Availability - Clustering Windows Server 2003 - Original Posted Jul 29, 2005

NOTE: For anyone looking for an actual cost, sorry, there isn't anything in this blog entry about the actual dollars needed.
 
I am having a flash back today, it must be the new medication. :)
 
The costs of HA seems to be a normal topic of discussion when a company looks into clustering and has sticker shock. I can't stress enough that clustering is not the end-all solution. Please do a quick read on my blog about my HA definition.
 
I was just talking to a client about how much clustering costs and how much the services cost to implement clustering. Yep, it isn't the same as just installing a standard server and multiplying it times the number of nodes. Servers with large hard drives, lots of RAM and multiple processors have come down a great deal in the last couple of years. What used to be about the cost of a 700 series BMW is now about the cost of a Chrysler 300. Really. However, when you start talking about HA, you have much more than the costs of individual nodes in a cluster.
 
The main cost issue with clustering is the cost of the additional components that are needed above and beyond the nodes themselves. For example, I keep hearing the term "disk is cheap" bandied (I love that word) about in meetings. It isn't true in all cases. Yes, a large hard drive is not that expensive. A LUN on a high-end SAN is expensive. It is even more expensive when you consider the initial costs of building the infrastructure to host that LUN.
 
OK, so back to the discussion of cost. Yes, clustering is costly, because it requires:
  • Windows Server 2003, Enterprise Edition which costs a good bit more than Standard Edition
  • Host Bus Adapters (two per server for redundancy) for the fiber fabric (yes, there are other less costly alternatives, but let's stick to mainstream right now) and the software for the HBAs
  • Fiber switches
  • SAN devices (or NAS depending on the certification of the hardware)
  • Experienced administrators (if you want it done right) to design and configure it
  • A 24/7 team for maintaining it (remember HA is not just clustering)
  • Significant documentation (in case the administrator gets hit by a bus)
  • Tried and tested processes
To achieve High Availability, an organization must implement well defiined, planned, tested, and implemented processes, software, and fault tolerant hardware. The focus is application availability. Yes, this costs money.
 
My favorite sales person used to use this phrase a great deal when we would talk to clients and potential clients about HA, "How much does it cost for the application to be down?" If it doesn't cost much, implementing clustering and instilling an HA attitude just might now be worth it. If they say it costs a fortune, then the response is simple, "if it costs you so much to be down, why are you sweating this relatively small amount to do the best job possible of keeping it up?"
 
I hate to think about how many organizations out there are gambling (yep, that is what it is) with their IT assets and the businesses that run on them. If your company will go out of business if an application fails, don't you owe it to the owners to protect that application?
 
Posted by cluster

Can't Send or Receive Email - Original Posted Apr 14, 2005

One of the most common posts when it comes to Exchange is a request for help troubleshooting Internet email traffic. This really isn't that hard as it is almost always one of a few issues that are very easy to fix.

Can't send email to the Internet - There are a few simple and easy steps to help identify the problem. It is almost always DNS or port 25 is blocked.

Log onto the Exchange server

Open a command prompt (no, it is not a DOS prompt) and type nslookup and press enter. You will then connect to a DNS server which will be shown along with its IP address. This is the DNS server that your Exchange server is configured to use because somebody configured the server to point to it. If it doesn't work, then you found your problem. Here is what it should look like (or at least look similar):

C:\>nslookup
Default Server:  h-64-105-152-186.dnvtco56.covad.net
Address:  64.105.152.186

> set type=mx
> microsoft.com
Server:  h-64-105-152-186.dnvtco56.covad.net
Address:  64.105.152.186

Non-authoritative answer:
microsoft.com   MX preference = 10, mail exchanger = mailc.microsoft.com
microsoft.com   MX preference = 10, mail exchanger = maila.microsoft.com
microsoft.com   MX preference = 10, mail exchanger = mailb.microsoft.com

microsoft.com   nameserver = ns1.msft.net
microsoft.com   nameserver = ns2.msft.net
microsoft.com   nameserver = ns3.msft.net
microsoft.com   nameserver = ns4.msft.net
microsoft.com   nameserver = ns5.msft.net
mailc.microsoft.com     internet address = 207.46.121.53
mailc.microsoft.com     internet address = 207.46.121.52
maila.microsoft.com     internet address = 131.107.3.125
maila.microsoft.com     internet address = 131.107.3.124
mailb.microsoft.com     internet address = 131.107.3.123
mailb.microsoft.com     internet address = 207.46.121.51
ns1.msft.net    internet address = 207.46.245.230
ns2.msft.net    internet address = 64.4.25.30
ns3.msft.net    internet address = 213.199.144.151
ns4.msft.net    internet address = 207.46.66.75
ns5.msft.net    internet address = 207.46.138.20

The writing in bold is what you should type. The information returned shows that Microsoft has three MX records (maila, mailb, and mailc) and each of the records point to multiple IP address (doh! it is a total of six actual server sharing the load) where Microsoft servers can accept mail.

After you have verified that your DNS works for outbound email, next you need to test that your can communicate from your server to systems on the Internet using port 25 (the port used by SMTP). Use telnet on port 25 to test connectivity.

C:\>telnet maila.microsoft.com 25

Again, type the bold text at the command prompt and if everything works correctly, you should receive a response like this one from the remote server:

220 IGS-IMC-03.northamerica.corp.microsoft.com <Inbound SMTP Virtual Server> Thu, 14 Apr 2005 20:55:51 -0700

This shows that the remote mail server received your connection attempt and responded. Guess what that means? Yes, good guess if you said that it means that Exchange can see and connect to remote systems without anything blocking any ports (you are also right if you said that I like my eggs with bacon).

Yep, you just solved 90% of all outgoing email problems if the DNS resolution step failed (if it failed, fix it - duh!) or the connection failed using port 25 (again, fix it if it failed).

 You can fully test that the destination received your email by continuing with the following steps:

 After connecting using telnet to port 25, you can perform the following steps at the magic blinking cursor:

  1. Type HELO or EHLO (EHLO is for the enhanced version of SMTP) 
  2. Type MAIL FROM: yourname@YourDomain.com 
  3. You should get a response message of "250 ok"
  4. Type RCPT TO: someotherguy@TheirDomain.com
  5. You should get a response message of "250 ok"
  6. Enter your subject info by typing Subject: Test Message (or whatever subject you want)
  7. Type DATA
  8. Enter your message
  9. Continue entering your message with multiple lines if needed
  10. End the message by put a period on a line by itself and hit enter one last time  

If all goes well, the recipient on the remote email server should get the message as if it were sent by an email client.

Can't receive email from the Internet - There are a few simple and easy steps to help identify the problem.

From another system on the Internet, verify that remote systems are able to find your MX records for your company using the following:

C:\>nslookup
Default Server:  h-64-105-152-186.dnvtco56.covad.net
Address:  64.105.152.186

> set type=mx
> yourcomanyname.com
Server:  h-64-105-152-186.dnvtco56.covad.net
Address:  64.105.152.186

This should provide a response with the name and IP address of your email server. If it doesn't, that that should tell you that I also like sausage with my eggs (and that your DNS server is either not properly configured on the Internet with proper MX records or DNS records for your company are not available at all.

If this does work, then you should (from a remote system on the Internet) try to telnet to your server on port 25 by typing:

C:\>telnet hostname.yourcompany.com 25

If you get a nice response like this one,

220 hostname.yourcompany.com Microsoft ESMTP MAIL Service, Version: 6.0.3790.211 ready at  Thu, 14 Apr 2005 22:40:02 -0600

then others can connect to your Exchange server without any problems. If it doesn't work, then you probably have some firewall or router problems that need to be fixed.

Yes, it really is that simple. Remember SMTP stands for simple mail transfer protocol.

 

Posted by cluster
Filed under:

Clustering is not the solution all the time - Original Posted Feb 8, 2005

Everyone repeat after me, "I will not waste money trying to cluster every freaking service offered by Microsoft servers."

I just feel the need to scream this out loud today. That there are often simpler and easier ways to provide redundancy for some services than using Microsoft server clusters. A couple of quick examples (please write these down) include:

  • Domain Controllers (and Global Catalog servers, too) - It is simple. Install more than one. If one goes down, you still have more. Yes, users can log into a different DC. Yes, users can even log into a different DC in a different physical location. FSMO roles can be seized if the holder of the roles falls out of the server rack and catches on fire.
  • DHCP - THere are lots of great articles on how to split DHCP scopes among more than one server so if one server fails, clients can still get IP addresses.
  • WINS - Think: Primary and Secondary. If the secondary fails, nobody will care or notice (unless it is engulfed in flames). If the primary fails, systems will start using the secondary.
  • DNS - uummmm, use Active Directory integrated DNS. It doesn't get much more HA than that.
  • Yes, the list goes on and on, but here are some common ones that I see that just make my blood boil

You don't need multiple nodes and a SAN to provide redundancy all the time. Remember, your company has other needs than have 8 way SMP WINS clusters with 4 TB of shared storage. Please, save a little money and please use some common sense.

Posted by cluster

Exchange Cluster - Is Active/Active worth it? - Original Posted Feb 5, 2005

This is kind of a regurgitation of a couple of threads on the microsoft.public.exchange.clustering newsgroup. In the threads, there are questions regarding the whole Active/Active issue. Several people, including a couple of good friends and a couple of top-notch Microsofties pointed out the evils of Active/Active. To be clear, Microsoft supports A/A for Exchange but does not recommend it. Best practices are developed based on the experiences of Microsoft's internal usage (often referred to as eating their own dog food), the early deployment programs, and through trouble reports and the experiences of many customers as reported and tracked though PSS.

Over the years, I have explained to my students that Active/Passive is the best practice when it comes to clustering Exchange. Almost always a student will protest stating that their managers and others don't want a wasted node so they want to know why A/A is such a problem. I point out that the store.exe is well known for sucking up all the RAM it can get. So, if you have two servers (node1 and node2) both running store.exe and consuming a very large amount of RAM on each node, then you can expect problems with the failover of one resource hog to a node where another resource hog lives. According to all of the literature, the store.exe on the surviving node should give up enough memory for the store.exe on the failing node to exist along with it as both store.exe's will basically drain down (this is a real high level summary and not the term normally used, but I think it helps to understand what is happening) so they will both have smaller memory footprints and can coexist. In practice, this process is less than smooth. Another concern that is well documented is that if both Exchange Virtual Servers (EVSs) life on the same node, their stores and storage groups add together and apply to constraints in Exchange. For example, if EVS1 has three storage groups and EVS2 has three storage groups, when you combine them, they exceed the limits for Exchange (a max of 4 storage groups) and they both will not function on the same node.

Anyways, the issue in this discussion was around performance. With two active nodes, their memory, their CPUs, and their disk spindles should (according to some basic logic) provide better overall performance than one active node with the same resources. At first glance this makes a great deal of sense.

When you dig deeper, this common sense stops making sense. Wow, did I just type that? Try to follow me here (it should be easy, I am a pretty big guy).

According to Microsoft, if you use Exchange in a best practice configuration, you should manage the resource consumption so that you don't exceed 80% of CPU. This is for a single server. If you consider that two nodes are active in an A/A cluster, and since there is a need to failover to a single node, then in order to maintain best practice configurations each node should be only utilized up to 40% of CPU utilization. This is basic math in that 40+40=80. This is discussed in 815180 here http://support.microsoft.com/default.aspx?scid=kb;en-us;815180&product=exch2003. This article also discusses the limit of 1,900 concurrent users per node. This article, however, doesn't address the added scalability of multiple server backplanes, multiple fiber adapters, and multiple spindles. So, the argument then becomes, do you really get enough benefit out of the additional I/O provided with an A/A cluster while still strictly limiting CPU? I would probably go so far as to say yes, but only because it is very clear from all of the Exchange work that I have done that the disk I/O is the limiting factor for higher performance.

So to summarize the arguments/discussions:

Pro A/A

  • Provides greater disk I/O, but this is assuming that you would not use the same number of spindles for a similar A/P configuration. I am not sure if this is a fair assumption. I can say from experience, it is easier to ask for more spindles from the storage group when you have multiple active nodes. I don't feel that most organizations would find using the increased number of spindles for an A/P configuration to be within reason.
  • Provides more RAM for two store.exe's when not in a failed state which results in better performance.
  • Provides greater throughput using additional HBAs and additional server backplanes when not in a failed state which results in better performance.

Pro A/P

  • Provides the same CPU as an A/A based on the 40/40 rule previously discussed.
  • Provides the same performance whether in a failed state or not.
  • Is a best practice, and is the recommended configuration.

There are other issues to consider, for example:

  • Does the inter-Exchange messaging (email from node1 to node2) and the loss of single instance storage override any performance gains from A/A?
  • With two A/A nodes fully subscribed at 40% of CPU, are they hitting I/O bottle necks and thus utilizing the additional RAM, additional spindles, and the greater backplane bandwidth?
  • Is there a tendency to over subscribe the two A/A nodes in most organizations so they are well over 40% CPU utilization?
  • Is there a tendency to over subscribe a single node in A/P as well?

The reason I bring up this whole topic in this blog entry is that the A/A vs A/P issue really isn't as cut and dried as many of us would like to believe.

However, it is strongly discouraged for a reason, and those reasons are because of HA impacts. Therefore, Active-Active is evil.  :)

Posted by cluster
Filed under:

OK, now I am ready to start fresh

I finally decided that MSN Spaces was too structured for me, and it has has serious performance problems for me. So, I just spend the last 30+ minutes copying all of my old posts from MSN Spaces to here.

OK, no, I didn't copy them all. I left some of the personal ones, like my cigar ones, on the old blog and did not bring them here.

I think, for the near future, I will maintain both blogs, but will keep my personal bits of stuff over on MSN and try to keep this blog focused on professional stuff only.

Posted by cluster

E-mail Reputation Score - Original Posted Aug 3, 2005

From the MS Exchange Blog post by Chris Meirick...
 
You can send email to reputation@ironport.com and receive a reputation score that evaluates the possibility that your email is spam. I highly recommend that you read Chris' blog entry for more information about Ironport and his evaluation of the product.
 
Based on his evaluation, I have to say that Ironport looks like a good product and it should be strongly considered for email filtering.
Posted by cluster
Filed under:

Wish: New Exchange 12 Feature - Original Posted Jul 7, 2005

I want to make this clear, this is just a wish that I have. There is nothing, that I am aware of, in the works to make my silly dream come true.
 
Scenario: Company has Exchange Server running with Outlook 2003 cached mode clients. The Exchange Server database becomes corrupted (yes, this does happen in the real world) and needs to be restored.
 
My dream solution: You delete the existing .stm and .edb files. You right-click the properties of the storage group, you can then select the previous store name and select the option "Rebuild Store from Cached Clients" and the Exchange creates and mounts a new store with the previous name. The Exchange Server then sucks all of the messaging data, all free-busy info, and all resource info (after all, resources are called out in meeting requests) out of the Outlook 2003 .ost files on the client computers and rebuilds as much of the information as it can in the new store.
 
Yep, a reverse switch for the cache mode. Instead of the clients synching with the server, the server synchs with the clients.
 
Somebody, make that happen, please. :)
Posted by cluster
Filed under:

Exchange Server 2003 /3GB and /USERVA=3030 - Original Posted Jul 13, 2005

One more time...
 
I am sure it has been posted before, but it doesn't seem to be getting out there to everyone. Hopefully this will hit at least one person that hasn't read it.
 
There are many documents out there that say if your Exchange Server 2003 server has 1 GB of RAM or more, you should edit your boot.ini to include the /3GB and /USERVA=3030 switches in the boot configuration. What seems to get missed is that you should only do this if the Exchange server is a:
  • Mailbox Server
  • Public Folder Server
  • Connector Bridgehead (MTA, x.400, GroupWise, Notes, etc)
  • SMTP Gateway/Bridgehead (only when using Envelope Journaling - otherwise don't use the switches)
You should NOT use these switches if the Exchange server is a:
  • Front End Server
  • SMTP Gateway/Bridgehead (see exception above)
What it really comes down to is that the store.exe benefits from these switches, and Front End and SMTP Gateway/Bridgehead servers don't utilize the store.exe. The exception is when using Envelope Journaling because it does use the store.exe.
 
In all cases, you should NOT use these switches if your server has less than 1GB of RAM.
 
For more information on the /3GB and /USERVA switches, refer to KB 823440 and KB 810371.
Posted by cluster
Filed under:

MOM 2005 and SQL Server 2000 SP4 - Original Posted Jun 27, 2005

What in the world was Microsoft thinking?
 
I was installing MOM 2005 and found out that it is not possible to install the MOM 2005 Database on a SQL Server 2000 server with SP4. WTF? That just doesn't make sense.
 
Supposedly this will be fully supported with MOM 2005 SP1. Umm, that just doesn't work for me. So, time to research. Thanks to Neil Hobson, a MOM MVP, a workaround has been published in the MOM 2005 public newsgroup. It is a registry hack, but at least it is a supported registry hack. What needs to be done is that you have to alter the CSDVersion entry located at:

HKEY_Local_Machine\SOFTWARE\Microsoft\MSSQLServer\MSSQLServer\CurrentVersion

If SP4 is installed, the version number will be 8.00.2039. To get the MOM 2005 Database to install, you need to change the version number to 8.00.761 which is the same one for SP3A and then change it back again after the installation is completed.
 
Somebody needs to be slapped over this one.
 
Updated 7/28/05 - Microsoft provides another workaround for this same problem. You can use the msiexec command with the PREREQ_COMPLETED=1 switch to skip the MOM Prerequisite Checker. See KB 902803 for more information.
Posted by cluster
Filed under: ,

MOM 2005 and Firewalls - Original Posted Jun 30, 2005

I had to laugh at myself yesterday. It just kills me how a simple search will save you tons of effort and reading...
 
I am installing MOM 2005 as part of my testing. I ran the discovery and tried to deploy agents to all the computers on my network. Most of the agents installed perfectly. However, I notice that all of the ones that didn't install are Windows XP computers with Service Pack 2 (no, I am not really going to try to manage them, but I thought it would be interesting to see their data as I get fully up to speed on MOM 2005). Hmmm, interesting. So, I put it aside and figure I will attack it later.
 
As I am reading through tons of whitepapers, I see the process for manually installing agents using the MOM 2005 install media. Yep, just like MOM 2001, so this is nothing new to me. I am thinking to myself that I will probably need to do a manual install of the agents on a couple of the XP computers that failed and test them out. As I am thinking about this in the back of my mind and reading further, I notice that one of the reasons for doing a manual install of the agents from media is because of firewall issues where the MOM 2005 Managment Server can't access the computer to install agents because port 1270 needs to be open.
 
Wait... I am adding in my head, 2 + 2 = 4... There it is, I got it... Firewall issue... XP SP2... yes, there it is, I got it. SP2 for XP installs the firewall by default. I disable the firewall, and all is well. Of course, I could have just created the exception rule, too. I will do that later. I am pretty sure that I will need to open up more than 1270 as RPCs are needed as well.
 
So, today, I am doing some more digging and I stumble on the KB article 885726 which has been out for a good while now. It pretty much says that MOM 2005 will not be able to install agents on XP with SP2 when the firewall is in use and doesn't allow port 1270.
 
One of these days I will learn to do a little searching before I start experimenting.
Posted by cluster
Filed under:

Ethernet MAC for Virtual PC and Virtual Server - Original Posted Jun 17, 2005

Since the issue keeps coming up, I have decided to add it to my blog.

Many people like to create a basic VPC or VS image and then copy the vhd and vmc files and rename them to build a new virtual machine. Sysprep is often used so the copied image isn't exactly like the original. It sure works nice, except, the ethernet address is not changed by sysprep and TCP/IP has lots of problems when there are multiple systems with the exact same MAC.

Paul Adare has discussed this about 8 bazillion times (he is getting tired of it, too). The answer is to go into the vmc file (make sure the virtual image is not in use) and remove the hex code from the xml tags. Search for ethernet_card_address and then remove the information between the opening and closing tag.

When you restart the virtual machine, it will automatically generate a new random address so there will be no conflicts.

Posted by cluster

SharePoint Portal Server - 500 Error - Original Posted Jun 1, 2005

I have been using SPS for a good while at work. Yesterday, it started to act up on me. Our SPS site is in a different domain (no trust between them) than our desktops. Normally, a separate set of credentials is required to access the portal or any of the team sites. The symptoms were:

  • Anytime I tried to access the portal or any team sites from my desktop, I would get a 500 error
  • If I tried from another desktop, it worked fine
  • If anyone tried from my desktop, it worked fine

It sounded like a profile problem to me in that it was only my account on my desktop that didn't work. My account on other systems worked properly.

I tried some steps to help with the problem. I tried:

  • Clearing my cookies
  • Clearing my Internet temp files
  • Clearing my SSL State
  • Closing all browser sessions each time I tried something
  • Doing a magic reboot

Nothing worked, so I did the one thing I didn't want to do, I killed my profile and rebuilt it. Everything works again.

Updated Jun 17, 2005.

I ran into this issue again. I was bound and determined to not have to recreate my profile yet again, so I did some more digging. This time I was able to fix it. I went to Start, Settings, Control Panel, User Accounts. I selected my account name, clicked on the Advanced Tab, clicked on the Manage Passwords button, selected the name of the SPS site, and then clicked remove. After closing up the dialong boxes and open windows, I tried the site again, it prompted me for my password information, and all was well.

Posted by cluster
Filed under:

Exchange Server SP2 - Original Posted Jun 14, 2005

I can't believe this, but I was incredibly psyched last week while in Orlando. No, it wasn't golf, and the damn rain really ruined by chick watching.

SP2 for Exchange got me all wound up. Microsoft announced some of the improvements coming forth in SP2. I was really happy to see the improvements to managing mobile devices. In particular, I love the ability to wipe a mobile device. "What? Joe got a better job than me and quit?" No problem, I can wipe the email from his mobile device before he gets home.

The one item that I found to be pretty nice as well is that MAPI can be enabled/disabled per user just like other protocols. Very nice!

There are other improvements, but I will leave it to everyone to get those from the web links above and the many blogs out there describing each feature in detail. I intend on going home and taking a nap real soon.

Posted by cluster
Filed under:

SharePoint Training - Original Posted May 19, 2005

I installed SPS and got it up and running properly. The SQL connection was a major problem as I was trying to connect it to a SQL cluster with multiple SQL instances installed. When I tried to configure SPS to connect, I used the cluster name but kept forgetting to put the instance name along with it so it kept failing. Anyways, I finally figured it out and SPS was up in a few minutes after that.

SPS is new to the company, and I really had no desire to teach everyone how to use it. So, I downloaded the SharePoint Portal Server (SPS) 2003 Training Kit from Microsoft's website at http://www.microsoft.com/downloads/details.aspx?FamilyID=bca45a99-e420-47fd-8aea-a8743735c710&DisplayLang=en.

I tried to run the install and kept getting odd errors about how it could not connect to the SPS server. Finally, I turned off SSL, and it installed properly. Somebody ought to fix that.

After a little fussing, I finally sent out invitations to everyone and made sure to mention that the training location was up and available for them to do some digging around and learn how to use SPS. I also sent everyone an email and put a link on the main SPS page for the Windows SharePoint Services (WSS) training on Microsoft's website at http://www.microsoft.com/windowsserver2003/evaluation/demos/wss/wss.html.

The end result? I had very few end users (have I mentioned that I hate end-users as they keep screwing up my network?) come to me with any questions, and the company is just loving SPS now that it is available. I must say that I have never seen such quick uptake on a new product before. It is a testament to the great work of the SPS team at Microsoft to put together all of the online training and to make the product so easy to use.

Posted by cluster
Filed under:

Exchange Cluster Support KBs - Original Posted May 25, 2005

I am not sure what I would do without Evan Dodd being around. If his blog isn't in your list, you need to add it. Anyways, I decided that Evan saves me at least a few hours a month by him finding and reporting on issues before I run into them in my life (which seems to be more of a sit-com than a drama lately). Thanks Evan.

Anyway, the reason for this entry is that Evan reported some changes to a very important KB for Exchange server clustering. KB 810986 was recently re-written with a narrower focus on Microsofts support for 3rd party clustering software (to sum it up, you may need to remove 3rd party cluster software during troubleshooting efforts).

The concern, as Evan pointed out, is that the old KB used to contain some other very important information regarding Exchange server clustering support. Evan was nice enough to also point out that this older, and still very important information, was spun off into its own KB (I bet it gets good ratings and will not be cancelled this season). The new KB 898634 clearly spells out that installing Exchange server cluster nodes on domain controllers is not supported. This has been the case for a good while having been included as a couple of sentences in the old KB. Now, however, this issue is much clearer and will get much more visibility.

I know most of the Exchange and Cluster crowd already reads Evan's blog. In the future, I will try not to rehash Evan's posts.

Posted by cluster
Filed under:

Exchange 2000 Cluster to 2003 Upgrade - Original Posted May 11, 2005

Rod Fournier and I were talking the other day about how much we hate the idea of upgrading servers, especially clusters.

Upgrading any server bothers me on several levels. First, I can never be certain that all of the old operating system and all of the old application are gone (they linger in small corners just waiting to screw with my day). Second, I worry about the supportability of the environment and worry about it doing its job. Upgrading clustered applications bothers me even more because it just doesn't lend itself to providing High Availability (HA) for vital services. It goes deeper, but I just don't get a warm and fuzzy feeling knowing that there is code on my server that not only isn't needed, but it also doesn't belong there.

So what? Yeah, I heard to you say that! Well, the whole point is that rolling upgrades of existing clusters do not allow for "clean" installations. I really want clean installations for my important applications, especially when talking about a vital application like my messaging platform.  KB842427 outlines a clean upgrade scenario that is fully supported by Microsoft (Thanks, Evan, I never read that one!) for the move from an Exchange 2000 server cluster to an Exchange Server 2003 server cluster.

I previously did some testing of my own, and my steps are pretty close to the ones in this KB. Since this is fully supported, I have chopped out a bunch of stuff from the previous blog entry and updated it accordingly. According to the KB, this process is not supported for an Active/Active cluster, but I am pretty sure it would work if you added the steps for the additional EVS. Of course, I have to add the disclaimer that Active/Active is not recommended and if you are doing an upgrade it is a good time to convert to Active/Passive.

While I like the process in the KB, I have to say that I will probably avoid it for my production environment. Instead, I will use this alternate solution:

Scenario: NodeA is the Active node, and NodeB is the passive node.

  1. Borrow a server with enough disk space to hold all of your cluster's mailboxes.
  2. Back up the Exchange cluster just in case you need it (if you don't back it up, you will need to do a restore - it is the way of the jungle).
  3. Install the new server with Windows Server 2003 and Exchange Server2003 (both Standard Editions) into the existing Exchange Organization. Name it whatever you want.
  4. Apply latest service packs and patches.
  5. Move all mailboxes to the new server.
  6. Migrate public folders to the new server. This is the tough part. To make this work, you will need to wait several days until the outlook profiles are properly updated to point to the temp server.
  7. Uninstall Exchange 2000 from NodeB.
  8. Stop the cluster service on NodeB. Evict NodeB from the cluster.
  9. Use KB 833396 to manually remove Exchange 2003 and to clean up active directory from NodeA.
  10. Wipe and reinstall both NodeA and NodeB (use new names, i.e. NodeC and NodeD to avoid any name issues in case you missed something and need to do more work in AD to clean it up) with Windows Server 2003 Enterprise Edition and apply the latest service pack and patches.
  11. Create a new cluster with NodeC and NodeD.
  12. Install Exchange Server 2003 on NodeC and NodeD and create the EVS in the existing Exchange org (do not use the old EVS network name here).
  13. Install AV, content scanning, and other Exchange applications on the cluster.
  14. Move all mailboxes from the Temp Exchange server to the new cluster.
  15. Migrate the public folder structure to the new cluster.
  16. Uninstall the Temp Exchange server. Again, you will probably want to wait several days so that the outlook profiles are properly updated.

This entry was updated on 5/13/05 per info from Evan Dodd.

Posted by cluster
Filed under:

Exchange Server 2003 and /3GB - Original Posted May 1, 2005

I have discussed the /3GB and /PAE switches previously in this entry: http://spaces.msn.com/members/russkaufmann/Blog/cns!1pwuGkyvTDx37q1_Y3JQ_E6g!229.entry

There seems to be a problem with the Exchange Best Practice Analyzer in that it incorrectly recommends that some Exchange servers have the /3GB switch implemented.

Refer to http://support.microsoft.com/kb/815372/ 

The key in this article is the note: "For Exchange Server computers that do not contain any mailboxes or public folders (such as mail gateways), we do not recommend setting the /3GB switch in boot.ini, independently of installed physical memory size."

Also, in http://support.microsoft.com/kb/823440/ it says:

"When you install Exchange Server 2003 on a Microsoft Windows Server 2003-based computer that has 1 gigabyte (GB) or more of physical random access memory (RAM) installed, and that is home to mailboxes or public folders, you must edit the Boot.ini file to optimize the virtual memory usage of the Information Store (I added the bolding) service."

The store.exe utilizes and leverages the addition RAM available in user mode when you use the /3gb switch. However, front-end (OWA) and IMR servers (which do not have information stores) actually suffer when using the /3gb switch and they should not use the /3gb switch.

Posted by cluster
Filed under:
More Posts Next page »