AK_Network_Geek: router

Showing posts with label router. Show all posts

Thursday, August 30, 2018

Troubleshooting Dial-up T1 Lines

I had an interesting trouble ticket land in my lap a few years ago. My employer at the time was one of the few service providers still using various and sundry Cisco AS5300 routers to provide dial-up (!) Internet service to customers. In one location where we had one of these AS5300 routers, the CO tech was notified that his telephone switch was seeing "Remote Made Busy" alarms from my AS5300, and after some initial troubleshooting, he escalated the ticket to me to investigate from the router side.

Unfortunately, when I logged in to the router, I found nothing wrong:


as2.blah#sho run | begin controller

controller T1 0

 framing esf

 clock source line primary

 linecode b8zs

 cablelength short 133

 ds0-group 0 timeslots 1-24 type e&m-fgb dtmf dnis

 description HC 09201 tg#ISP2 trk 1-24, DTC 00-07, #xxx-1005

!

as2.blah#sho controller t1 0

T1 0 is up.

  Applique type is Channelized T1

  Cablelength is short 133

  Description: HC 09201 tg#ISP2 trk 1-24, DTC 00-07, #xxx-1005

  No alarms detected.

  alarm-trigger is not set

  Version info of slot 0:  HW: 1, PLD Rev: 11

  Framer Version: 0x8

<...snip...>

  Total Data (last 24 hours)

     1 Line Code Violations, 1 Path Code Violations,

     0 Slip Secs, 0 Fr Loss Secs, 1 Line Err Secs, 1 Degraded Mins,

     1 Errored Secs, 0 Bursty Err Secs, 0 Severely Err Secs, 0 Unavail Secs

as2.blah#sho caller ip

  Line           User       IP Address      Local Number    Remote Number   <->

as2.blah#

You can manually busy-out a trunk, as shown on Controller T1 2:


as2.blah#sho run | begin ontroller

<...snip...>
controller T1 2

 framing esf

 clock source line secondary 2

 linecode b8zs

 cablelength short 133

 ds0-group 0 timeslots 1-24 type e&m-fgb dtmf dnis

 ds0 busyout 1-24 soft

 description 45.ISP.001119..8901 tg#ISP2 trk 49-72, DTC 04-01, #xxx-4108/xxx-1199

!

<...snip...>

See the line that says, "ds0 busyout 1-24 soft?" That tells the router to busy-out (disable, but only once an individual DS-0 goes inactive) the individual voice channels inside the T1. However, that line didn't exist on Controller T1 0, so no-one had intentionally busied-out the trunk.

Once I had verified that there was nothing obviously wrong with the T1, I bounced the T1 line by running a shut/no shut on Controller T1 0. No change. Then, I rebooted the router. Again, no change. I called the CO tech, who confirmed that he was still seeing the "Remote Made Busy" alarm on the T1, meaning that from his equipment's perspective, my router had busied-out the individual lines on the T1.

Eventually, I called a co-worker of mine who had been a Cisco AS5x00 guru back in the day, who showed me another troubleshooting tip:


as2.blah# sho controllers t1 0 call-counters

T1 0:

  DS0's Active: 0

  DS0's Active High Water Mark: 2

  TimeSlot   Type   TotalCalls   TotalDuration

      1       cas           6       00:36:48

      2       cas           7       01:19:29

      3       cas           7       00:24:16

      4       cas           7       00:30:35

      5       cas           7       00:15:49

      6       cas           6       02:33:36

      7       cas           7       03:06:59

      8       cas           7       00:23:25

      9       cas           7       03:01:43

     10       cas           5       04:03:10

     11       cas           6       00:38:36

     12       cas           7       01:08:50

     13       cas           5       05:33:33

     14       cas           6       01:36:16

     15       cas           5       00:16:07

     16       cas           6       01:06:34

     17       cas           5       01:06:48

     18       cas           5       00:09:15

     19       cas           6       00:05:20

     20       cas           6       02:12:24

     21       cas           6       01:25:18

     22       cas           5       00:27:50

     23       cas           5       00:42:23

     24       cas           6       01:47:45



Total DS0's Active High Water Mark: 3

Total Calls since System Bootup: 178

as2.blah#

Ideally, under the "TotalCalls" column, we would see an even distribution of calls -- that is, each individual timeslot in the T1 trunk would have approximately the same number of received calls -- and in fact, in this case, the distribution turns out to be pretty even, with between 5 and 7 calls on each DS-0 (controller T1 1 looks even better with almost exactly six calls per DS-0). Also, the last column, "TotalDuration," shouldn't show any unusually low counts, where "unusual" is determined entirely by context. In this case, the router had been rebooted recently, so fairly low numbers for call duration were to be expected. However, if most of the individual timeslots had total call durations of 20-30 hours, and one (or two, or...) timeslots had call durations of, say, 30 minutes, then that's a pretty good indication of a problem on that DS-0, especially if the router had not been rebooted in quite a while (the longer it has been running, the more even the call duration distribution should be).

Eventually, the engineer I called agreed with my assessment: there did not appear to be anything wrong with the router or the T1 lines. Our best guess was that, at some point in the last ten years or so since this router had been installed, our documentation in the controller description had diverged from what was actually plugged in to the router, meaning that controller T1 0 was not the one we really should have been troubleshooting. Unfortunately, by the time I got that far with the troubleshooting process, the problem had mysteriously corrected itself, and as a result, I didn't get a chance to verify the controller descriptions. That's a bit of a mixed blessing. To the engineer in me, it was disappointing not to have found a definitive cause of the problem, but at least everything was working properly once again.

Sunday, January 8, 2017

Advanced Cisco Routing -- BGP MED (Multi-Exit Discriminator)

Suppose we have two connections to our upstream ISP: a high-speed link from Cust-A to ISP-1, and a low-speed link from Cust-A to ISP-2 (Cust-B is just a random Internet host):

Here are the subnets in use on this network:

Subnet	Endpoint A	Endpoint A Interface	Endpoint B	Endpoint B Interface
10.254.254.1	ISP-1	Lo0	N/A	N/A
10.254.254.2	ISP-2	Lo0	N/A	N/A
100.64.1.254	Cust-A	Lo0	N/A	N/A
100.64.2.254	Cust-A	Lo0	N/A	N/A
10.0.0.0/30	ISP-1	Gig-E 1/0	Cust-A	Gig-E 1/0
10.0.0.4/30	ISP-1	Gig-E 2/0	Cust-B	Gig-E 1/0
10.0.0.8/30	ISP-1	Gig-E 3/0	ISP-2	Gig-E 3/0
10.0.0.12/30	ISP-2	Gig-E 1/0	Cust-B	Gig-E 2/0
100.64.1.0/26	Cust-A	Fast-E 0/0	Knoppix-32	Eth 1/0
100.64.2.0/26	Cust-B	Fast-E 0/0	CentOS7_1	Eth 1/0

Obviously, we would typically want traffic to flow across the high-speed link rather than the low-speed link. However, BGP doesn't consider bandwidth when determining the "best" path from one host to another:

As you can see, BGP has selected a route via the low-speed circuit from the host Knoppix-32 PC to the CentOS7_1 web server in Cust_B's network. To solve this problem, it's easy enough to set a weight on the outbound link to force traffic to use the circuit connected to ISP-1. All we have to do is set a sufficiently high metric on the route we want to take:

Cust-A


router bgp 65512

 neighbor 10.0.0.1 weight 30

Since higher weights take priority over lower weights, this will force outbound traffic to use ISP-1 rather than ISP-2. However, that only has an effect on our outbound traffic. BGP may still provide a route from Cust-B back to us through ISP-2 (the low-bandwidth circuit). This potentially causes two problems: first, we'd rather have our traffic go through the faster circuit (for obvious reasons); and second, this can cause "asymmetric routing." Some applications and network devices (stateful firewalls, for example) really don't like asymmetric routing. Unfortunately, trying to troubleshoot a problem caused by asymmetric routing can be a real PITA, and no, not the tasty kind :( To force other networks to prefer the path via ISP-1, we will adjust BGP's "MED" ("Multi Exit Discriminator"), one of the metrics that BGP uses to calculate the "best" route between endpoints. First, on our router, we'll create an access list to identify our internal networks:


Cust-A(config)#ip access-list standard BGP_Internal_Nets

Cust-A(config-std-nacl)#permit 100.64.1.0 0.0.0.63

Cust-A(config-std-nacl)#permit host 100.64.1.254

Next, we create a route map:


Cust-A(config)#route-map BGP_MED 10 

Cust-A(config-route-map)#match ip addr BGP_Internal_Nets

Cust-A(config-route-map)#set metric 110

Finally, we apply the route map to the LESS-PREFERRED neighbor (ISP-2) in our BGP configuration:


Cust-A(config)#router bgp 65512

Cust-A(config-router)#neighbor 172.16.0.1 route-map BGP_MED out

Cust-A(config-router)#exit

Cust-A(config)#exit

Cust-A#clear ip bgp 65511

Unlike weight, a lower MED is preferable to a higher MED, and therefore, by advertising a higher-than-default MED to ISP-2's BGP process, we are effectively telling it to prefer an alternate route to our network.

After BGP re-converges, we should see that both ISP-1 and ISP-2 are using the higher-bandwidth link via ISP-1 to reach 100.64.1.x:


ISP-1#sho ip bgp | inc 65512

*  10.0.0.0/30      10.0.0.2                 0             0 65512 i

*> 100.64.1.0/26    10.0.0.2                 0             0 65512 i

*> 100.64.1.254/32  10.0.0.2                 0             0 65512 i

*  172.16.0.0/30    10.0.0.2                 0             0 65512 i

ISP-1#

...and...:


ISP-2(config)#do sho ip bgp | inc 65512

*>i100.64.1.0/26    10.0.0.2                 0    100      0 65512 i

*                   172.16.0.2             110             0 65512 i

*>i100.64.1.254/32  10.0.0.2                 0    100      0 65512 i

*                   172.16.0.2             110             0 65512 i

ISP-2(config)#

Perfect! Both routers are now advertising a preferred route via ISP-1, just as we wanted (">" indicates a preferred route). You can verify this by a traceroute from CentOS7_1:

By setting the MED in our BGP config, we have redundant links to our ISP, but will still prefer the high-bandwidth circuit unless there is a problem. I'll leave testing fail-over as an exercise for the reader ;)

Advanced Cisco Routing -- BGP and OSPF Part 2

Quite a while ago, I created a post on using BGP and OSPF together on Cisco routers. In that particular example, I used OSPF to route within an internal area and BGP to peer with another provider's area, then redistributed OSPF into BGP and BGP into OSPF. If you'll recall, one of the reasons I gave for using BGP when service providers peer with each other is that the Internet's routing tables are too large to incorporate into an interior gateway protocol like OSPF.

This raises a question, however. How can you redistribute BGP into OSPF if OSPF isn't capable of handling that many routes?

In this lab, I'll show one way of addressing this problem. We'll start by creating the following network:

Warning: I am using publicly routable addresses in this lab! DO NOT try to build this lab on real hardware that is connected to an actual Internet connection, as the potential exists to conflict with real IP addresses actually in use, or to propagate bogus routes into your network!

In this lab, the routers R1 through R6, the routers below the switch in the diagram, are all maintained by various other service providers, and therefore all exist in separate Autonomous Systems (AS's). Meanwhile, the routers above the switch, that is, R7 through R10, are under your control. Because I'm lazy (I've mentioned that before, haven't I?), I simply used loopback interfaces on R1 through R6 to simulate various networks in use on each of the AS's 65512 through 65517. Here is the relevant portions of the config from one of these routers:


interface Loopback0

 ip address 141.5.17.1 255.255.255.192

!

interface Loopback1

 ip address 141.5.17.65 255.255.255.192

!

interface Loopback2

 ip address 141.5.17.129 255.255.255.192

!

interface FastEthernet0/0

 ip address 7.7.7.2 255.255.255.240

 duplex auto

 speed auto

!

router bgp 65513

 no synchronization

 bgp router-id 141.5.17.1

 bgp log-neighbor-changes

 network 7.7.7.0 mask 255.255.255.240

 network 141.5.17.0 mask 255.255.255.192

 network 141.5.17.64 mask 255.255.255.192

 network 141.5.17.128 mask 255.255.255.192

 neighbor 7.7.7.1 remote-as 65512

 neighbor 7.7.7.3 remote-as 65514

 neighbor 7.7.7.4 remote-as 65515

 neighbor 7.7.7.5 remote-as 65516

 neighbor 7.7.7.6 remote-as 65517

 neighbor 7.7.7.7 remote-as 65518

 no auto-summary

!

One thing I didn't mention in my earlier posts on BGP: the "network" statement in BGP does not operate like the "network" statement in IGP's like OSPF or EIGRP. In this case, the network statement tells BGP what networks you wish to advertise; in an IGP, they enable the routing protocol on the interface that is attached to that network. Consequently, this router (R2, as it happens) is advertising three /26 networks: 141.5.17.0/26, 141.5.17.64/26 and 141.5.17.128/26. It is also offering to peer with six neighbor routers, 7.7.7.1, 7.7.7.3, 7.7.7.4, 7.7.7.5, 7.7.7.6 and 7.7.7.7. So far, pretty straightforward, right?

Likewise, R9 and R10 are pretty straightforward, as well. R8, R9 and R10 are all participating in OSPF area 0.0.0.0:


interface Loopback0

 ip address 10.254.254.10 255.255.255.255

!

interface FastEthernet0/0

 ip address 194.0.0.10 255.255.255.0

 duplex auto

 speed auto

!

router ospf 42

 router-id 10.254.254.10

 log-adjacency-changes

 redistribute connected subnets

 network 194.0.0.0 0.0.0.255 area 0.0.0.0

!

Again, no surprises here. OSPF is enabled on Fa0/0, and we are redistributing the IP address of our Loopback0 interface in OSPF.

The magic in this lab happens between R7 and R8. In fact, at first glance, you might be wondering why we even put two separate routers here. Since the MTBF of a system of devices decreases with every (non-redundant) device you add to the system (because the probability of a failure of the system is equal to the product of the probability of failure of every non-redundant device in the system), putting two routers in series at this point has decreased the reliability of the network.

The reason for using two routers becomes apparent, however, when you look at the configs:

R7:


interface Loopback0

 ip address 10.254.254.7 255.255.255.255

!

interface FastEthernet0/0

 ip address 7.7.7.7 255.255.255.240

 duplex auto

 speed auto

!

interface FastEthernet1/0

 ip address 209.112.170.7 255.255.255.0

 duplex auto

 speed auto

!

router bgp 65518

 bgp router-id 10.254.254.7

 bgp log-neighbor-changes

 neighbor 7.7.7.1 remote-as 65512

 neighbor 7.7.7.2 remote-as 65513

 neighbor 7.7.7.3 remote-as 65514

 neighbor 7.7.7.4 remote-as 65515

 neighbor 7.7.7.5 remote-as 65516

 neighbor 7.7.7.6 remote-as 65517

 neighbor 209.112.170.8 remote-as 65518

 !

 address-family ipv4

 neighbor 7.7.7.1 activate

 neighbor 7.7.7.2 activate

 neighbor 7.7.7.3 activate

 neighbor 7.7.7.4 activate

 neighbor 7.7.7.5 activate

 neighbor 7.7.7.6 activate

 neighbor 209.112.170.8 activate

 no auto-summary

 no synchronization

 network 7.7.7.0 mask 255.255.255.240

 network 209.112.170.0

 exit-address-family

!

R8:


interface Loopback0

 ip address 10.254.254.8 255.255.255.255

!

interface FastEthernet0/0

 ip address 209.112.170.8 255.255.255.0

 duplex auto

 speed auto

!

interface FastEthernet1/0

 ip address 194.0.0.8 255.255.255.0

 duplex auto

 speed auto

!

interface FastEthernet2/0

 ip address 193.0.0.8 255.255.255.0

 duplex auto

 speed auto

!

router ospf 42

 router-id 10.254.254.8

 log-adjacency-changes

 passive-interface Loopback0

 network 193.0.0.0 0.0.0.255 area 0.0.0.0

 network 194.0.0.0 0.0.0.255 area 0.0.0.0

 default-information originate always

!

router bgp 65518

 bgp router-id 10.254.254.8

 bgp log-neighbor-changes

 neighbor 209.112.170.7 remote-as 65518

 !

 address-family ipv4

 neighbor 209.112.170.7 activate

 no auto-summary

 no synchronization

 network 193.0.0.0

 network 194.0.0.0

 network 209.112.170.0

 exit-address-family

!

ip route 0.0.0.0 0.0.0.0 209.112.170.7

!

I don't want to redistribute BGP into OSPF, since that would make the OSPF routing tables too large (okay, not in this example, but if you are peering with actual service providers...). However, I can't just point static routes at the peers either, since that would entirely defeat the purpose of using dynamic routing protocols. Consequently, on R8, I am redistributing OSPF into BGP, then pointing a single default route to R7 and redistributing that default route to R9 and R10 with the "default-information originate" directive on R8. Then, R7 and R8 are BGP peering so that R7 picks up all of the routes in use by R8, R9 and R10 (this is a use of BGP, which is typically an Exterior Gateway Protocol, as an IGP). Because R7 is BGP peering with R1 through R6, it knows how to reach each of the subnets advertised by its peers, and consequently, all of our routers can pass traffic back and forth to each other.

Saturday, January 7, 2017

Advanced Cisco Networking: Policy-Based Routing (PBR)

Suppose you have a multi-homed network where you want to direct certain traffic out one interface, but other traffic out another. For example, maybe you want your VoIP traffic to use a moderately low bandwidth circuit, but with extremely strict QoS policies to provide low latency and jitter, while your bulk data traffic takes a higher bandwidth circuit with no QoS protection. Or, perhaps you have a small-bandwidth circuit for management traffic (one network I managed had an "overhead" T1 on an OC-3 microwave shot and we used the overhead T1 for out-of-band management). In any case, Policy-Based Routing (PBR) is a way for you to designate specific routes for certain traffic, based upon any of a number of characteristics -- basically, if you can match it with an access-list, you can use it to make PBR decisions.

Once again, we'll start with a network diagram:

I've stacked the deck pretty heavily in favor of the route R1-R3-R5 in this network: this route has Gig-E interfaces, while R1-R2-R4-R5 is only using FastEthernet interfaces, and there are fewer hops via R1-R3-R5 than R1-R2-R4-R5. As you can see in the screenshot below, this network design does, in fact, favor using R1-R3-R5 as the preferred route between the two hosts connected to R1 and the CentOS server connected to R5:

Now, let's set up policy-based routing so that system management traffic (Telnet, SSH and SNMP), as well as any traffic from the Sysmon CentOS server are routed through the lower-bandwidth -- but lower latency -- route across R2 and R4:

R1:


R1(config)#ip access-list extended matchSYSMON

R1(config-ext-nacl)#permit tcp any any eq 22

R1(config-ext-nacl)#permit tcp any any eq 23

R1(config-ext-nacl)#permit tcp any any eq 161

R1(config-ext-nacl)#permit ip host 192.168.1.4 any 

R1(config-ext-nacl)#deny ip any any

R1(config-ext-nacl)#route-map SYSMON permit 10 

R1(config-route-map)#match ip address matchSYSMON

R1(config-route-map)#set ip next-hop 10.1.2.2

R1(config-route-map)#int fa0/0

R1(config-if)#ip policy route-map SYSMON

R1(config-if)#exit

Now, let's try the traceroutes again:

Looks like it did before. However, from Sysmon, we see that we are taking a different route, just as expected:

Since the Knoppix host is simply using the default route, OSPF is using the higher-bandwidth, lower hop-count route. However, the router has identified the traffic originating on the Sysmon server as matching the routing policy that we added to R1, and therefore is steering this traffic through R2 and R4, just as we intended.

If you'll recall, our design goal in this scenario was to ensure that management traffic had low-latency queueing across the network. Suppose our service provider on the R1-R2-R4-R5 path had agreed to honor our QoS markings, but the provider on the R1-R3-R5 path re-marked everything with a lower priority. We can use the route-map we have created for the routing policy to also adjust our QoS markings for traffic going through R2 and R4:


R1(config)#route-map SYSMON permit 10

R1(config-route-map)#match ip address matchSYSMON

R1(config-route-map)#set ip next-hop 10.1.2.2

R1(config-route-map)#set ip precedence flash 

R1(config-route-map)#exit

R1(config)#do sho run | section route-map SYSMON permit 10

route-map SYSMON permit 10

 match ip address matchSYSMON

 set ip precedence flash

 set ip next-hop 10.1.2.2

R1(config)#

Cool! Suppose we wanted to do some traffic engineering across an MPLS network:


R1(config)#route-map SYSMON permit 10

R1(config-route-map)#match ip address matchSYSMON

R1(config-route-map)#set ip ?

...

  vrf         VRF name

R1(config-route-map)#

That's really cool! As you can see, policy-based routing is a very powerful tool, allowing you to do a lot of traffic manipulation to optimize your network and traffic flows.

At this point, those of you who are paying attention ;) will be thinking to yourself, "That's great, but what happens if we lose the next-hop router specified in our routing policy?" That is a great question, and with the configuration shown here, your traffic will be dropped on the floor. That's hardly optimal, but as I'm sure you've suspected, there is a solution to this problem...which we'll cover in a later lesson.

Monday, December 19, 2016

Advanced Cisco Routing: BGP Route Reflectors

Advanced Cisco Routing: BGP Route Reflectors Suppose your network uses BGP as your Interior Gateway Protocol (IGP). Because iBGP will not share routes learned across one interface through a second interface (i.e., if R1 learns a route from R2, it will not share that route with R3, R4 or R5), your network must be a full mesh, like so:

While this is very robust, it is neither scalable nor efficient. Given a network of n nodes, then you must create n(n - 1) physical connections, with an IP address on each side of the connection, with a "neighbor ... remote-as..." and "neighbor ... activate" statement in the BGP config, and a "network ... mask ..." statement in the BGP config. When you are talking about just a handful of routers, that's not too terribly bad, but as your network grows, that starts to become rather cumbersome. For example, here are the interface configs and BGP config for R1 in the full-mesh network shown above:


interface Loopback0

 ip address 10.254.254.1 255.255.255.255

!

interface Loopback10

 ip address 192.168.1.1 255.255.255.0

!

interface FastEthernet1/0

 ip address 10.1.2.1 255.255.255.252

!

interface FastEthernet1/1

 ip address 10.1.3.1 255.255.255.252

!

interface FastEthernet2/0

 ip address 10.1.4.2 255.255.255.252

!

interface FastEthernet2/1

 ip address 10.1.5.2 255.255.255.252

!

router bgp 65510

 bgp router-id 10.254.254.1

 bgp log-neighbor-changes

 neighbor 10.1.2.2 remote-as 65510

 neighbor 10.1.3.2 remote-as 65510

 neighbor 10.1.4.1 remote-as 65510

 neighbor 10.1.5.1 remote-as 65510

 !

 address-family ipv4

  neighbor 10.1.2.2 activate

  neighbor 10.1.3.2 activate

  neighbor 10.1.4.1 activate

  neighbor 10.1.5.1 activate

  no auto-summary

  no synchronization

  network 10.1.2.0 mask 255.255.255.252

  network 10.1.3.0 mask 255.255.255.252

  network 10.1.4.0 mask 255.255.255.252

  network 10.1.5.0 mask 255.255.255.252

  network 10.254.254.1 mask 255.255.255.255

  network 192.168.1.0

 exit-address-family

!

Ugh...that's a lot of configuration, and a lot of chances to make a mistake...and that's only on a network with 5 routers! The SMALL ISP that I used to work for had 25 to 30 routers on our Internet service network. Imagine what a full-mesh config on one of those routers would look like!

To solve this problem, the designers of the BGP protocol created the concept of "route reflectors." Route Reflectors do exactly what it sounds like: they "reflect" routes learned through one interface out other interfaces. As a result, it is no longer necessary to create a physical connection between every node in your network, nor is it necessary for every node in the network to be an iBGP peer with every other node in the network. This allows you to have a much simpler network topology:

R1 doesn't change at all -- we still have all four network interfaces up, and R1 is peering with every one of the other routers. However, R3 is the opposite extreme: the ONLY router to which R3 is connected is R1, and consequently, there is now only 1 peering statement in the BGP config. As you can see, we no longer have the full network topology stored in our routing tables:


R3#sho ip route

Gateway of last resort is not set



     10.0.0.0/8 is variably subnetted, 6 subnets, 2 masks

C       10.1.3.0/30 is directly connected, FastEthernet2/0

C       10.254.254.3/32 is directly connected, Loopback0

B       10.1.2.0/30 [200/0] via 10.1.3.1, 00:48:25

B       10.254.254.1/32 [200/0] via 10.1.3.1, 00:36:43

B       10.1.5.0/30 [200/0] via 10.1.3.1, 00:48:25

B       10.1.4.0/30 [200/0] via 10.1.3.1, 00:48:25

B    192.168.1.0/24 [200/0] via 10.1.3.1, 00:48:25

C    192.168.3.0/24 is directly connected, Loopback10

R3#

We can resolve this by configuring R1 to be the route reflector for the other four routers:

R1:


R1(config)#router bgp 65510

R1(config-router)# neighbor 10.1.2.2 route-reflector-client

R1(config-router)# neighbor 10.1.3.2 route-reflector-client

R1(config-router)# neighbor 10.1.4.1 route-reflector-client

R1(config-router)# neighbor 10.1.5.1 route-reflector-client

R1(config-router)# bgp cluster-id 1

At this point, all of the other routers should have all the same routes that R1 has (only R3 shown):


R3#sho ip route

Gateway of last resort is not set



B    192.168.4.0/24 [200/0] via 10.1.4.1, 00:02:15

B    192.168.5.0/24 [200/0] via 10.1.5.1, 00:02:15

     10.0.0.0/8 is variably subnetted, 11 subnets, 2 masks

B       10.254.254.2/32 [200/0] via 10.1.2.2, 00:02:15

C       10.1.3.0/30 is directly connected, FastEthernet2/0

C       10.254.254.3/32 is directly connected, Loopback0

B       10.1.2.0/30 [200/0] via 10.1.3.1, 00:02:20

B       10.254.254.1/32 [200/0] via 10.1.3.1, 00:02:20

B       10.2.4.0/30 [200/0] via 10.1.2.2, 00:02:15

B       10.2.5.0/30 [200/0] via 10.1.2.2, 00:02:15

B       10.254.254.4/32 [200/0] via 10.1.4.1, 00:02:15

B       10.1.5.0/30 [200/0] via 10.1.3.1, 00:02:20

B       10.254.254.5/32 [200/0] via 10.1.5.1, 00:02:15

B       10.1.4.0/30 [200/0] via 10.1.3.1, 00:02:21

B    192.168.1.0/24 [200/0] via 10.1.3.1, 00:02:21

B    192.168.2.0/24 [200/0] via 10.1.2.2, 00:02:16

C    192.168.3.0/24 is directly connected, Loopback10v
R3#

You can see that we have routes now...but do they work? Let's find out:


R3#ping 192.168.1.1



Type escape sequence to abort.

Sending 5, 100-byte ICMP Echos to 192.168.1.1, timeout is 2 seconds:

!!!!!

Success rate is 100 percent (5/5), round-trip min/avg/max = 12/20/28 ms

R3#ping 192.168.2.1



Type escape sequence to abort.

Sending 5, 100-byte ICMP Echos to 192.168.2.1, timeout is 2 seconds:

!!!!!

Success rate is 100 percent (5/5), round-trip min/avg/max = 20/26/40 ms

R3#ping 192.168.3.1



Type escape sequence to abort.

Sending 5, 100-byte ICMP Echos to 192.168.3.1, timeout is 2 seconds:

!!!!!

Success rate is 100 percent (5/5), round-trip min/avg/max = 1/1/4 ms

R3#ping 192.168.4.1



Type escape sequence to abort.

Sending 5, 100-byte ICMP Echos to 192.168.4.1, timeout is 2 seconds:

!!!!!

Success rate is 100 percent (5/5), round-trip min/avg/max = 20/25/40 ms

R3#ping 192.168.5.1



Type escape sequence to abort.

Sending 5, 100-byte ICMP Echos to 192.168.5.1, timeout is 2 seconds:

!!!!!

Success rate is 100 percent (5/5), round-trip min/avg/max = 36/36/40 ms

R3#

Yep, looks like it. Good job!

At this point, you may be thinking to yourself, "That's great...but if R1 goes off-line, most of your network goes off-line, too," and you'd be exactly right. Fortunately, it is possible to use more than one route reflector on your network. Let's make a few changes...

R1:


R1(config)#router bgp 65510

R1(config-router)#no network 10.1.4.0 mask 255.255.255.252

R1(config-router)#no network 10.1.5.0 mask 255.255.255.252

R1(config-router)#no neighbor 10.1.4.1 remote-as 65510

R1(config-router)#no neighbor 10.1.5.1 remote-as 65510

R1(config-router)#int fa2/0

R1(config-if)#shut

R1(config-if)#no ip addr

R1(config-if)#int fa2/1

R1(config-if)#shut

R1(config-if)#no ip addr

R2:


R2(config)#router bgp 65510

R2(config-router)#neighbor 10.1.2.1 route-reflector-client

R2(config-router)#neighbor 10.2.4.2 route-reflector-client

R2(config-router)#neighbor 10.2.5.1 route-reflector-client

R2(config-router)#bgp cluster-id 1

R4:


R4(config)#router bgp 65510

R4(config-router)#no  neighbor 10.1.4.2 remote-as 65510

R4(config-router)#no   network 10.1.4.0 mask 255.255.255.252

R4(config-router)#int fa1/1

R4(config-if)#shut

R4(config-if)#no ip addr

R5:


R5(config)#router bgp 65510

R5(config-router)#no  neighbor 10.1.5.2 remote-as 65510

R5(config-router)#no   network 10.1.5.0 mask 255.255.255.252

R5(config-router)#int fa1/0

R5(config-if)#shut

R5(config-if)#no ip addr

Keep in mind that it wasn't necessary to modify the configs on R1, R4 and R5 if we were only adding redundancy; I removed the links from R1 to R4 and R5 simply to show that BGP was still providing routes to these hosts via R2, but if you only wanted to add redundant routes to R2, then all you would have needed to do was add the "neighbor ... route-reflector-client" and "bgp cluster-id 1" statements to R2's BGP configuration. Anyway, let's make sure that we still have the routes we expect (only R5 shown):


R5#ping 192.168.1.1



Type escape sequence to abort.

Sending 5, 100-byte ICMP Echos to 192.168.1.1, timeout is 2 seconds:

!!!!!

Success rate is 100 percent (5/5), round-trip min/avg/max = 24/29/36 ms

R5#ping 192.168.2.1



Type escape sequence to abort.

Sending 5, 100-byte ICMP Echos to 192.168.2.1, timeout is 2 seconds:

!!!!!

Success rate is 100 percent (5/5), round-trip min/avg/max = 8/16/36 ms

R5#ping 192.168.3.1



Type escape sequence to abort.

Sending 5, 100-byte ICMP Echos to 192.168.3.1, timeout is 2 seconds:

!!!!!

Success rate is 100 percent (5/5), round-trip min/avg/max = 8/45/76 ms

R5#ping 192.168.4.1



Type escape sequence to abort.

Sending 5, 100-byte ICMP Echos to 192.168.4.1, timeout is 2 seconds:

!!!!!

Success rate is 100 percent (5/5), round-trip min/avg/max = 24/33/40 ms

R5#ping 192.168.5.1



Type escape sequence to abort.

Sending 5, 100-byte ICMP Echos to 192.168.5.1, timeout is 2 seconds:

!!!!!

Success rate is 100 percent (5/5), round-trip min/avg/max = 1/2/4 ms

R5#

Looks good! With that, we'll wrap up this lesson, but in a later lesson, we'll discuss BGP confederations and peer groups.

Friday, December 16, 2016

Advanced Cisco Routing: A Full MPLS Network

A little over two years ago, I wrote a blog post about MPLS. In that lab, we built a very small, very simple MPLS network, where R1, R2 and R3 served as both our MPLS core and our "Provider Edge" routers. In the real world, you typically won't see this, as the requirements for a core and edge router are very different: the core is usually built on high-end chassis' with lots of memory and high-speed interfaces, whereas the edge routers are usually much smaller, much less expensive devices. Today, we will revisit the MPLS lab, breaking out the core ("P" -- "Provider"), edge ("PE" -- "Provider Edge") and customer ("CE" -- "Customer Edge") routers, and showing what is different amongst all three categories of routers.

Let's start with the core. Since I am mocking this lab up in GNS3 on a laptop with only 4GB of RAM, the core is going to be very simple: just two routers (P1 and P2), with a single Gig-E connection between them:

As I mentioned in the previous MPLS lab, we must be running CEF in order to run MPLS, so before anything else, make sure you've enabled CEF on the two core routers. Then, we'll put IP addresses on Gig3/0 on both P1 and P2, and configure a Loopback IP address, as well:


P1(config)#ip cef

P1(config)#int lo0

P1(config-if)#ip addr 10.254.254.1 255.255.255.255

P1(config-if)#no shut

P1(config-if)#int gig3/0

P1(config-if)#ip addr 10.0.0.1 255.255.255.252

P1(config-if)#no shut

P1(config-if)#

From this, I'm sure you can figure out how to configure P2 (basically, find any IP address that ends in ".1" and replace it with ".2"), so I won't belabor the point with a full config for P2 here.

Next, we will need to enable MPLS on Gig3/0 on both routers, and turn up OSPF so that our core and provider edge routers can route to each other:


P1(config-if)#int gig3/0

P1(config-if)#mpls ip

P1(config-if)#router ospf 42

P1(config-router)#router-id 10.254.254.1

P1(config-router)#network 10.0.0.0 0.0.0.3 area 0.0.0.0

P1(config-router)#redist conn sub

P1(config-router)#exit

P1(config)#

Once you've made the equivalent changes on P2, you should see the following output on both routers:


*Dec 16 11:40:01.311: %OSPF-5-ADJCHG: Process 42, Nbr 10.254.254.2 on GigabitEthernet3/0 from LOADING to FULL, Loading Done

P1(config)#

*Dec 16 11:40:10.767: %LDP-5-NBRCHG: LDP Neighbor 10.254.254.2:0 (1) is UP

P1(config)#

With that, your P (core) routers are essentially done. You will need to turn up interfaces to connect to your PE (edge) routers -- don't forget the "mpls ip" command on those interfaces! -- and you'll need to establish routing between the P and PE routers, but that should be old hat by now.

Let's move on to the PE routers. We will connect PE1 to P1, and PE2 to P2, like so...:

...using the following configs:
PE1:


PE1(config)#ip cef

PE1(config)#router ospf 42

PE1(config-router)#router-id 10.254.254.3
    
PE1(config-router)#int lo0

PE1(config-if)#ip addr 10.254.254.3 255.255.255.255

PE1(config-if)#no shut

PE1(config-if)#ip ospf 42 area 0.0.0.0

PE1(config-if)#int gig2/0

PE1(config-if)#mpls ip 

PE1(config-if)#ip addr 10.1.1.2 255.255.255.252

PE1(config-if)#no shut

PE1(config-if)#ip ospf 42 area 0.0.0.0

...and...:

PE2:


PE2(config)#ip cef

PE2(config)#router ospf 42

PE2(config-router)#router-id 10.254.254.4
            
PE2(config-router)#int lo0

PE2(config-if)#ip addr 10.254.254.4 255.255.255.255

PE2(config-if)#ip ospf 42 area 0.0.0.0

PE2(config-if)#no shut

PE2(config-if)#int gig2/0

PE2(config-if)#mpls ip

PE2(config-if)#ip addr 10.2.1.2 255.255.255.252

PE2(config-if)#ip ospf 42 area 0.0.0.0

PE2(config-if)#no shut

Once you've gotten this far, you should see output similar to this as the various adjacencies come up:


*Dec 16 11:58:31.063: %OSPF-5-ADJCHG: Process 42, Nbr 10.254.254.2 on GigabitEthernet2/0 from LOADING to FULL, Loading Done

*Dec 16 11:58:41.499: %LDP-5-NBRCHG: LDP Neighbor 10.254.254.2:0 (1) is UP

Let's check our routing tables and LDP database to make sure everything is working as expected:


PE1#sho ip route

Codes: C - connected, S - static, R - RIP, M - mobile, B - BGP

       D - EIGRP, EX - EIGRP external, O - OSPF, IA - OSPF inter area 

       N1 - OSPF NSSA external type 1, N2 - OSPF NSSA external type 2

       E1 - OSPF external type 1, E2 - OSPF external type 2

       i - IS-IS, su - IS-IS summary, L1 - IS-IS level-1, L2 - IS-IS level-2

       ia - IS-IS inter area, * - candidate default, U - per-user static route

       o - ODR, P - periodic downloaded static route



Gateway of last resort is not set



     10.0.0.0/8 is variably subnetted, 7 subnets, 2 masks

O E2    10.254.254.2/32 [110/20] via 10.1.1.1, 00:10:26, GigabitEthernet2/0

C       10.254.254.3/32 is directly connected, Loopback0

O       10.2.1.0/30 [110/3] via 10.1.1.1, 00:10:26, GigabitEthernet2/0

C       10.1.1.0/30 is directly connected, GigabitEthernet2/0

O       10.0.0.0/30 [110/2] via 10.1.1.1, 00:10:26, GigabitEthernet2/0

O E2    10.254.254.1/32 [110/20] via 10.1.1.1, 00:10:26, GigabitEthernet2/0

O       10.254.254.4/32 [110/4] via 10.1.1.1, 00:05:29, GigabitEthernet2/0

PE1#sho mpls ldp neigh

    Peer LDP Ident: 10.254.254.1:0; Local LDP Ident 10.254.254.3:0

    TCP connection: 10.254.254.1.646 - 10.254.254.3.53411

    State: Oper; Msgs sent/rcvd: 22/21; Downstream

    Up time: 00:10:33

    LDP discovery sources:

      GigabitEthernet2/0, Src IP addr: 10.1.1.1

        Addresses bound to peer LDP Ident:

          10.0.0.1        10.254.254.1    10.1.1.1        

PE1#sho mpls ldp bindings

  lib entry: 10.0.0.0/30, rev 8

    local binding:  label: 17

    remote binding: lsr: 10.254.254.1:0, label: imp-null

  lib entry: 10.1.1.0/30, rev 4

    local binding:  label: imp-null

    remote binding: lsr: 10.254.254.1:0, label: imp-null

  lib entry: 10.2.1.0/30, rev 6

    local binding:  label: 16

    remote binding: lsr: 10.254.254.1:0, label: 17

  lib entry: 10.254.254.1/32, rev 12

    local binding:  label: 19

    remote binding: lsr: 10.254.254.1:0, label: imp-null

  lib entry: 10.254.254.2/32, rev 10

    local binding:  label: 18

    remote binding: lsr: 10.254.254.1:0, label: 16

  lib entry: 10.254.254.3/32, rev 2

    local binding:  label: imp-null

    remote binding: lsr: 10.254.254.1:0, label: 18

  lib entry: 10.254.254.4/32, rev 14

    local binding:  label: 20

    remote binding: lsr: 10.254.254.1:0, label: 19

PE1#

With this, you now have a fully-functional "service provider" MPLS network. Your core is up, your PE routers are up, they are all sharing routes, and they have created LDP bindings between the routers. Sweet! All we need now are some customers to connect to our network so that the provider edge routers can start earning their keep ;)

This is where things start to get fun. Suppose the CIO for Perpetual Motion, Inc., an alternative energy provider, approaches you for connectivity across your network. You will turn up an interface for Perpetual Motion on both PE1 and PE2, and create a VRF to isolate Perpetual Motion's network instance from both your own network, as well as from any future customers' networks. Your network now looks like this...:

...with the following config changes on PE1 and PE2:
PE1:

 
PE1(config)#ip vrf PERPETUAL

PE1(config-vrf)#rd 65000:20

PE1(config-vrf)#route-target both 65000:20

PE1(config-vrf)#int fa0/0

PE1(config-if)#no ip addr

PE1(config-if)#no shut

PE1(config-if)#int fa0/0.20

PE1(config-subif)#encap dot1q 20

PE1(config-subif)#ip vrf forwarding PERPETUAL

PE1(config-subif)#ip addr 100.64.20.1 255.255.255.252

PE1(config-subif)#no shut

PE2:


PE2(config)#ip vrf PERPETUAL

PE2(config-vrf)#rd 65000:20

PE2(config-vrf)#route-target both 65000:20

PE2(config-vrf)#int fa0/0

PE2(config-if)#no ip addr

PE2(config-if)#no shut

PE2(config-if)#int fa0/0.20

PE2(config-subif)#encap dot1q 20

PE2(config-subif)#ip vrf forwarding PERPETUAL

PE2(config-subif)#ip addr 100.64.20.5 255.255.255.252

PE2(config-subif)#no shut

It isn't necessary to turn up a dot-1q encapsulated sub-interface here. We just as easily could turn up a new physical interface for every customer...until we ran out of physical interfaces. Since this is a lab in GNS3, it's not very likely that we would, in fact, run out of physical interfaces (unless you are far more ambitious than I, in which case, you do you!). However, this is pretty much how we provided service to customers at one of my former places of employment, given that SW1 and SW2 could be either actual Ethernet switches or some other kind of Metro-Ethernet network extender (Actelis, Accedian, AdTran, Cisco ME-3400, etc.) or combination thereof. Once the customer configures their routers, we should have point-to-point connectivity between CE1 and PE1, and between CE2 and PE2:

CE1:


CE1#sho run

interface Loopback0

 ip address 192.168.254.1 255.255.255.255

 ip ospf 1138 area 0.0.0.0

!

interface FastEthernet0/0

 ip address 192.168.1.1 255.255.255.0

 ip ospf 1138 area 0.0.0.0

!

interface FastEthernet1/1

 ip address 100.64.20.2 255.255.255.252

 ip ospf network point-to-point

 ip ospf 1138 area 0.0.0.0

!

router ospf 1138

 router-id 192.168.254.1

 log-adjacency-changes

 passive-interface FastEthernet0/0

 passive-interface Loopback0

!

^c
CE1#ping 100.64.20.1

Sending 5, 100-byte ICMP Echos to 100.64.20.1, timeout is 2 seconds:

.!!!!

Success rate is 80 percent (4/5), round-trip min/avg/max = 20/24/32 ms

CE1#

All that is left now is to set up routing between CE1 and CE2. On PE1 and PE2, we will set up an instance of OSPF to accept routes from CE1 and CE2, respectively:


PE1(config-subif)#router ospf 20 vrf PERPETUAL

PE1(config-router)#router-id 100.64.20.1

PE1(config-router)#network 100.64.20.0 0.0.0.3 area 0.0.0.0

PE1(config-subif)#

*Dec 16 14:09:43.579: %OSPF-5-ADJCHG: Process 20, Nbr 192.168.254.1 on FastEthernet0/0.20 from LOADING to FULL, Loading Done

PE1(config-subif)#


CE1(config-if)#router ospf 1138

CE1(config-router)#router-id 100.64.20.2

CE1(config-router)#network 100.64.20.0 0.0.0.3 area 0.0.0.0 

CE1(config-router)#int lo0

CE1(config-if)#ip ospf 1138 area 0.0.0.0

CE1(config-if)#int fa0/0

CE1(config-if)#ip ospf 1138 area 0.0.0.0

Now, does it work?


PE1#sho ip route vrf PERPETUAL

...

Gateway of last resort is not set



     100.0.0.0/30 is subnetted, 1 subnets

C       100.64.20.0 is directly connected, FastEthernet0/0.20

     192.168.254.0/32 is subnetted, 1 subnets

O       192.168.254.1 [110/2] via 100.64.20.2, 00:01:40, FastEthernet0/0.20

O    192.168.1.0/24 [110/2] via 100.64.20.2, 00:01:30, FastEthernet0/0.20

PE1#

Looks good! We've got the loopback and Fa0/0 IP addresses in our routing table, so as you can see, all we need to do to set up a customer routing instance on our PE routers is to append "vrf <VRF NAME> to the end of the "router ospf..." statements.

The last step is to set up a multiprotocol BGP process between PE1 and PE2 so that they can share the customer routes between them, then configure redistribution to the OSPF process in the customer VRF. If that sounds complicated, don't worry; it's really not terribly difficult:

PE1:


PE1(config)#router bgp 65000

PE1(config-router)#no synch

PE1(config-router)#neighbor 10.254.254.4 remote-as 65000

PE1(config-router)#neighbor 10.254.254.4 update-source Loopback0

PE1(config-router)#address-family vpnv4

PE1(config-router-af)#neighbor 10.254.254.4 activate

PE1(config-router-af)#neighbor 10.254.254.4 send-community extended

PE1(config-router-af)#exit

PE1(config-router)#address-family ipv4 vrf PERPETUAL

PE1(config-router-af)#redist ospf 20 vrf PERPETUAL

PE1(config-router-af)#no synch

PE1(config-router-af)#exit

PE1(config-router)#exit

PE1(config)#router ospf 20 vrf PERPETUAL

PE1(config-router)#redist bgp 65000 subnets

PE2:


PE2(config)#router bgp 65000

PE2(config-router)#no sync

PE2(config-router)#neighbor 10.254.254.3 remote-as 65000

PE2(config-router)#neighbor 10.254.254.3 update-source Loopback0

PE2(config-router)#address-family vpnv4

PE2(config-router-af)#neighbor 10.254.254.3 activate

PE2(config-router-af)#neighbor 10.254.254.3 send-community extended

PE2(config-router-af)#exit

PE2(config-router)#address-family ipv4 vrf PERPETUAL

PE2(config-router-af)#redist ospf 20 vrf PERPETUAL

PE2(config-router-af)#no sync

PE2(config-router-af)#exit

PE2(config-router)#exit

PE2(config)#router ospf 20 vrf PERPETUAL

PE2(config-router)#redist bgp 65000 sub

PE2(config-router)#exit

Let's check our CE routers and see if they are propagating routes correctly:


CE1#sho ip route

Gateway of last resort is not set



     100.0.0.0/30 is subnetted, 2 subnets

C       100.64.20.0 is directly connected, FastEthernet1/1

O IA    100.64.20.4 [110/2] via 100.64.20.1, 00:02:43, FastEthernet1/1

     192.168.254.0/32 is subnetted, 2 subnets

O IA    192.168.254.2 [110/3] via 100.64.20.1, 00:02:43, FastEthernet1/1

C       192.168.254.1 is directly connected, Loopback0

C    192.168.1.0/24 is directly connected, FastEthernet0/0

O IA 192.168.2.0/24 [110/3] via 100.64.20.1, 00:02:43, FastEthernet1/1

CE1#

CE2:


CE2#sho ip route

Gateway of last resort is not set



     100.0.0.0/30 is subnetted, 2 subnets

O IA    100.64.20.0 [110/2] via 100.64.20.5, 00:02:27, FastEthernet1/1

C       100.64.20.4 is directly connected, FastEthernet1/1

     192.168.254.0/32 is subnetted, 2 subnets

C       192.168.254.2 is directly connected, Loopback0

O IA    192.168.254.1 [110/3] via 100.64.20.5, 00:02:27, FastEthernet1/1

O IA 192.168.1.0/24 [110/3] via 100.64.20.5, 00:02:27, FastEthernet1/1

C    192.168.2.0/24 is directly connected, FastEthernet0/0

CE2#

Yep, on CE1, I can see the Loopback and Fa0/0 IP addresses from CE2, and vice versa. It looks like MPLS is working properly, and like our routing processes are sharing routes in the proper VRF's.

By configuring the P, then PE and CE routers one at a time, it should be fairly obvious how each class of router differs from the others (at least, from a configuration standpoint). The CE routers are the simplest of all, in that they are completely agnostic about the underlying architecture of the service provider network. All they need to do is set up routing, either with a dynamic routing protocol like OSPF or via static routes, with the provider; no special configuration is required on the CE routers at all. Next, in order of complexity, are the P routers. The only additional configuration they require is the "mpls ip" statement in any interface that will be part of the MPLS core. Most of the magic happens in the PE routers, which is reflected in the relative complexity of the PE routers' configs. This is where we create the VRFs, set the route distinguisher and route targets, configure the VRF-aware routing protocols, and set up BGP to redistribute the routes across the core.

Advanced Cisco Routing: DMVPN -- Point-to-Multipoint VPN Tunneling

A few years ago, I used to work for a service provider that operated in rural Alaska. By lower-48 standards, our network wasn’t terribly large — or at least, the logical topology wasn’t terribly large; the physical topology covered a rather large geographical region. Our major hub was a huge, bustling metropolis of about 5,000 (!) people.

This site also was where we located our hub router for the network. We had an extension site in Anchorage (naturally, since that was where most of our employees lived and worked), the hub site, a PoP at the hub site hanging off the hub router, and then multiple PoPs scattered across our service area, also linked off of the hub router. Because our own management network was built across our own service-provider network, we set up VPN tunnels from the hub router to each and every one of our sub-tended sites to provide secure management access to our network. Conceptually, it was a very simple model (and honestly, it might have been the only model our equipment would support at the time), but if you think that configuring a separate VPN tunnel for each site could be a bit of a chore, you are exactly right.

As I’m sure you’ve guessed by now, there is A Better Way to achieve these goals, a way that makes configuring and managing multiple sites sub-tended off of a single hub much less time consuming. Allow me to introduce you to DMVPN’s (Dynamic Multipoint VPN’s). As always, we’ll start with our network diagram:

R1 through R4 will be our management network, with R1 being the hub and R2 — R4 being the spokes. R5, R6 and R7 are the service provider network. In the real network that I managed, we had static, default routes on R1 through R4, and ran OSPF on our provider network. In this lab, we will run OSPF internally on both networks, and peer the management and provider networks with BGP, since that is a more common scenario for most people (being both provider and customer is fairly unusual). Also, running OSPF over a DMVPN topology introduces a few wrinkles that are worth covering, but I’m getting ahead of myself ;)

For addressing, I’ll be using 100.64.x.x addresses in place of public IP address ranges, and 192.168.x.0/24 for the inside interfaces on my management network. I’ll use 172.16.x.x IP space for the tunnel addressing. On the "Internet" routers, I’ll use 100.64.254.x/32 for the Loopback IP addresses, while 10.254.254.x/32 will be the Loopback IP addresses on my management routers.

Still with me? Good! Let’s start by setting up basic connectivity to each router, starting with the Internet routers (since there is nothing new on them):

R5:


interface Loopback0

 ip address 100.64.254.5 255.255.255.255

 no shut

!

interface FastEthernet0/0

 ip address 100.64.0.1 255.255.255.252

 no shut

!

interface FastEthernet0/1

 ip address 100.64.0.5 255.255.255.252

 no shut

!

interface FastEthernet1/0

 ip address 100.64.0.9 255.255.255.252

 no shut

!

router ospf 1138

 router-id 100.64.254.5

 log-adjacency-changes

 passive-interface Loopback0

 network 100.64.0.0 0.0.0.3 area 0.0.0.0

 network 100.64.0.4 0.0.0.3 area 0.0.0.0

 network 100.64.0.8 0.0.0.3 area 0.0.0.0

 network 100.64.254.5 0.0.0.0 area 0.0.0.0

!

R6 and R7 are similar, and since there is nothing new here, I’ll skip those configs.

We’ll go ahead and configure the IP addressing on the FastEthernet and Loopback interfaces of R1, R2, R3 and R4 next. Again, nothing new, and nothing exciting, so I won’t belabor the config here, but make sure R1, R2, R3 and R4 can ping their respective gateways before proceeding. Once point-to-point connectivity between the management network and the service provider network is working, we’ll set up BGP peering between R1 and R5, R2 and R6, R3 and R5, and finally, R4 and R7:

R1:


router bgp 65511

 bgp router-id 100.64.0.2

 network 100.64.0.0 mask 255.255.255.252

 neighbor 100.64.0.1 remote-as 65512

 neighbor 100.64.0.1 activate

R5:


router bgp 65512

 bgp router-id 100.64.0.1

 network 100.64.0.0 mask 255.255.255.252

 neighbor 100.64.0.2 remote-as 65511

 neighbor 100.64.0.2 activate

 redist ospf 1138 metric 120

!

router ospf 1138
 redist bgp 65512 sub metric 120

!

As you can see, the configurations are almost identical, aside from swapping the AS’ in the "router bgp..." and "neighbor..." statements, and swapping the IP addresses in the "bgp router-id..." and "neighbor..." statements. Also, on R5, we are redistributing the routes learned via BGP into OSPF. We are also redistributing OSPF routes into the BGP process. R6 and R7 will be configured similarly to R5, and R2, R3 and R4 will be configured similarly to R1. Again, nothing new so far.

But now, things will start to get interesting. Let’s set up the GRE tunnel on R1:


interface Tunnel0

 ip address 172.16.0.1 255.255.255.0

 ip nhrp map multicast dynamic

 ip nhrp network-id 1

 tunnel source 100.64.0.2

 tunnel mode gre multipoint

 no shut

!

Just like a normal GRE tunnel, we start with "interface Tunnel <blah>", and assign an IP address to the tunnel interface. Unlike a normal tunnel interface, we are assigning a /24. You can use whatever size subnet you want, but since it is a multipoint tunnel, it should probably be larger than a /30. The "tunnel source..." statement should look familiar also (if not, see the GRE Tunnel lab for a refresher).

However, there are a few differences between a DMVPN tunnel config and a standard, point-to-point tunnel config. One of the first things you’ll likely notice is that we have not specified any of the opposite endpoints. Instead, we used the command "tunnel mode gre multipoint" to explicitly state that we are creating a point-to-multipoint (hub-and-spoke) network. That’s the "dynamic" portion of the "Dynamic Multipoint VPN. Basically, the hub accepts tunnel requests from multiple spoke routers, and automatically establishes the tunnels on demand.

You'll also notice that, even though there are three spoke routers, the hub only has one tunnel interface. That's the "Multipoint" portion of the acronym ;) This raises a very interesting question. In a point-to-point circuit, it is trivial to determine the IP address of the next hop (if you are on a /30 or /31 network, there are only two usable IP addresses, and you are using one of them, right?). However, in a multipoint network, your tunnel interfaces are in a larger subnet. In our example, we are using a /24, meaning the other end of the tunnel could be any one of 253 possible IP addresses! How does the hub router know which IP address corresponds to which tunnel? If you look at the next two lines of the tunnel config, you’ll see the two "ip nhrp..." statements. NHRP ("Next Hop Resolution Protocol," see also CCIE or Null! for a good discussion on the topic) is the protocol that we use to determine the IP address of the other side of the multipoint tunnel. In much the same way that ARP maps IP addresses to Ethernet addresses, NHRP allows our routers to dynamically map IP addressing to the multipoint tunnels. In the "ip nhrp map multicast dynamic" statement, we are telling NHRP to dynamically create these mappings for our multipoint tunnels. However, you might have multiple tunnels on any given router, so by specifying different network ID's with the "ip nhrp network-id ..." statement, you can create multiple hub-and-spoke networks without them conflicting with one another. That’s it for the hub router. That wasn't too bad, was it?

We’ll use R2 as an example of the spoke router configuration; R3 and R4 will be very similar:

R2:


interface Tunnel0

 ip address 172.16.0.2 255.255.255.0

 ip nhrp map 172.16.0.1 100.64.0.2

 ip nhrp map multicast 100.64.0.2

 ip nhrp network-id 1

 ip nhrp nhs 172.16.0.1

 tunnel source 100.64.0.14

 tunnel mode gre multipoint

 no shut

!

Like the hub router, the spoke router contains the "ip address...," "tunnel source..." and "tunnel mode gre multipoint commands." It also contains a handful of "ip nhrp..." statements, but they are slightly more complex. First, the spoke router must know how to reach the hub router in order to send the tunnel request, so we start by telling the tunnel to create a connection to the IP address of the hub router’s outside interface (the tunnel source on the hub router). In other words, to reach 172.16.0.1 (the tunnel IP address on R1) use 100.64.0.2 (Fa1/0 on R1). Next, "ip nhrp map multicast 100.64.0.2" sets 100.64.0.2 (Fa1/0 on R1) as the destination for multicast or broadcast packets sent across the non-broadcast, multi-access, or NBMA, (ie., the DMVPN) network. If multicast or broadcast packets are sent across the NBMA network, R1 is responsible for forwarding them to other hosts participating in the network, so we are telling the tunnel interface to forward those packets to R1. The last new command on the spoke router is the "ip nhrp nhs 172.16.0.1" statement. With this line, we are telling the spoke router to use the "next-hop server" to forward traffic across the NBMA network.

Substitute the appropriate values for the IP address and tunnel source on R3 and R4, and you should have working tunnels between R1 and each of the spoke routers. To verify this, use the "sho dmvpn" command:


R2#sho dmvpn

Legend: Attrb --> S - Static, D - Dynamic, I - Incomplete

    N - NATed, L - Local, X - No Socket

    # Ent --> Number of NHRP entries with same NBMA peer

    NHS Status: E --> Expecting Replies, R --> Responding

    UpDn Time --> Up or Down Time for a Tunnel

==========================================================================



Interface: Tunnel0, IPv4 NHRP Details 



IPv4 NHS: 172.16.0.1 RE

Type:Spoke, Total NBMA Peers (v4/v6): 1



# Ent  Peer NBMA Addr Peer Tunnel Add State  UpDn Tm Attrb    Target Network

----- --------------- --------------- ----- -------- ----- -----------------

    1     100.64.0.2      172.16.0.1    UP 01:21:34    S      172.16.0.1/32





R2#

As you can see from the snippet of output above, R2 has now established a tunnel connection to R1 (tunnel state is "Up" and next to "IPV4 NHS, we have the IP address of int tunnel0 on R1, followed by the flags "RE," verifying that the tunnel is responding and expecting replies). After duplicating the tunnel config on R3 and R4, you should see similar output on those routers, although each router will only show the connection to R1. This is a point-to-multipoint network, meaning that R2 cannot talk directly to R3 without going through R1 (sort of...actually R1 can broker connections between the spokes, but honestly, I’m not comfortable enough with the topic to go there yet). Assuming that you have copied the modified version of R2’s tunnel config to R3 and R4, you should have a completed point-to-multipoint VPN network now (w00t!). However, if you try to ping from the inside interface of R2, R3 or R4 to the inside interface of R1, you will most likely not be thrilled with the result:


R4#ping 192.168.1.1 source 192.168.4.1

<...snip...>

..... 

Success rate is 0 percent (0/5)

R4#

Any ideas why? Of course! We haven’t set up any routing between the inside networks. When we configured BGP peering between the management network and the service provider network, we only advertised the outside interfaces of R1, R2, R3 and R4, since our service provider should not be aware of the inner workings of our network (unless we are using MPLS). In order to actually send traffic across the VPN tunnels, we need to enable a routing protocol over the tunnels. Easy enough, right? It should look something like this...:

R1:


router ospf 42

 router-id 10.254.254.1

 passive-interface default

 no passive-interface Tunnel0

 network 10.254.254.1 0.0.0.0 area 0.0.0.0

 network 172.16.0.0 0.0.0.255 area 0.0.0.0

 network 192.168.1.0 0.0.0.255 area 0.0.0.0

Again, after making the appropriate substitutions for the router-id and the advertised networks, we’ll make the same changes on the spoke routers, and...what is going on here?


R1(config)#

*Dec 13 15:44:15.947: %OSPF-5-ADJCHG: Process 42, Nbr 10.254.254.3 on Tunnel0 from LOADING to FULL, Loading Done

*Dec 13 15:44:16.391: %OSPF-5-ADJCHG: Process 42, Nbr 10.254.254.3 on Tunnel0 from FULL to DOWN, Neighbor Down: Adjacency forced to reset

R1(config)#

*Dec 13 15:44:20.483: %OSPF-5-ADJCHG: Process 42, Nbr 10.254.254.2 on Tunnel0 from INIT to DOWN, Neighbor Down: Adjacency forced to reset

*Dec 13 15:44:20.535: %OSPF-5-ADJCHG: Process 42, Nbr 10.254.254.3 on Tunnel0 from EXSTART to DOWN, Neighbor Down: Adjacency forced to reset

*Dec 13 15:44:20.699: %OSPF-5-ADJCHG: Process 42, Nbr 10.254.254.4 on Tunnel0 from LOADING to FULL, Loading Done

R1(config)#

*Dec 13 15:44:22.735: %OSPF-5-ADJCHG: Process 42, Nbr 10.254.254.4 on Tunnel0 from FULL to DOWN, Neighbor Down: Adjacency forced to reset

R1(config)#

*Dec 13 15:44:25.307: %OSPF-5-ADJCHG: Process 42, Nbr 10.254.254.3 on Tunnel0 from LOADING to FULL, Loading Done

*Dec 13 15:44:25.555: %OSPF-5-ADJCHG: Process 42, Nbr 10.254.254.3 on Tunnel0 from FULL to DOWN, Neighbor Down: Adjacency forced to reset

R1(config)#

Why is OSPF flapping?!?!

This is where I start to get in a little bit over my head. If I understand correctly, the issue is that OSPF is aware of the type of network configured across each OSPF-aware link. In this case, we have configured a NBMA network via multipoint GRE tunnels, but OSPF considers GRE tunnels to be a point-to-point network. While we had OSPF only running between R1 and R2, this was fine, but as soon as OSPF sees two neighbors across a single "point-to-point" link, it gets confused (understandably so!) and drops the neighbor relationship. To resolve this problem, at least under certain circumstances, make the following change to R1, R2, R3 and R4:


int tunnel 0

 ip ospf network point-to-multipoint

Now that OSPF understands that int Tunnel0 is actually part of a multipoint network, it will allow all three spoke routers to participate in OSPF across the tunnel. Run "sho ip route" and "sho ip ospf neighbor" to verify that everything is working as expected (it should be), and you should be good to go!

Thursday, October 27, 2016

IPv6 Intro: BGP, OSPF and IPv6...or Maybe Just EIGRP and IPv6, smh

Being a network engineer is kind of like being Sisyphus. Just when you think you you're starting to get to the top of your game, someone moves the target on you. In fact, a writer by the name of Spencer Johnson (M.D.) wrote a book on that subject not quite 20 years ago, and even though I've never read the book, I'd guess that it's at least as relevant today as it was then. Case in point, even though I've used IPv4, OSPF, and EIGRP professionally for years, I don't have a lot of professional experience with IPv6 or BGP. To address that problem, I set up the following network in GNS3 for playing with IPv4 and IPv6 with multiple routing protocols on emulated Cisco 7200 routers:

We have R1, R2 and R3 as routers within an autonomous system, R4 a random (IPv4-only) Internet router and R5 as another (IPv6-only, this time) random Internet router. My intent was to set up BGP peering between R1, R4 and R5, and to have R1, R2 and R3 share routes via OSPF. Sounds easy enough, right?

Hahaha...no.

In a previous lab, we set up OSPFv3 (OSPF for IPv6) on Cisco 3640 routers, so I used those instruction to (try to) set up OSPFv3 on the 7200 routers:


R1(config)#int gig0/0

R1(config-if)#ipv6 ospf 42 area 0.0.0.0

                   ^

% Invalid input detected at '^' marker.



R1(config-if)#ipv6 o?

% Unrecognized command

R1(config-if)#ipv6 ?

IPv6 interface subcommands:

  address             Configure IPv6 address on interface

  authentication      authentication subcommands

...

  multicast           multicast

  nat                 Enable IPv6 NAT on interface

  nd                  IPv6 interface Neighbor Discovery subcommands

  next-hop-self       Configures IP-EIGRP next-hop-self

  policy              Enable IPv6 policy routing

  redirects           Enable sending of ICMP Redirect messages

  rip                 Configure RIP routing protocol

...

Okay...is OSPFv3 not supported on this router? As it turns out, I think it actually is, but I'll save that for another lab.

Edit: no, it's not. I mean, it is, but it isn't. The "hooks" are there to configure OSPFv3 using "ipv6 ospf <process ID>" in global configuration, but you have to have an Advanced IP Services image to run it. The SP Services image I am running isn't licensed for it, because after all, what service provider would run OSPF on their network (answer: every one I've ever worked at), grrr...

For now, I decided to try to set up EIGRP for IPv6 since, 1) it *is* supported on the 7200, and 2) it did not seem to be supported on the 3640's:


R1(config)#int gig0/0

R1(config-if)#ipv6 router eigrp 10

R1(config-rtr)#int gig1/0

R1(config-if)#ipv6 router eigrp 10

R1(config-rtr)#exit

R1(config)#exit

R1#sho run int gig0/0

Building configuration...



Current configuration : 201 bytes

!

interface GigabitEthernet0/0

 ip address 66.223.227.5 255.255.255.252

 duplex full

 speed 1000

 media-type gbic

 negotiation auto

 ipv6 address 2001:C0:FFEE:2::1/126

 ipv6 enable

 ipv6 eigrp 10

end

R1#sho ipv6 route

IPv6 Routing Table - Default - 8 entries

Codes: C - Connected, L - Local, S - Static, U - Per-user Static route

       M - MIPv6, R - RIP, I1 - ISIS L1, I2 - ISIS L2

       IA - ISIS interarea, IS - ISIS summary, D - EIGRP, EX - EIGRP external

C   2001:C0:FFEE::/126 [0/0]

     via GigabitEthernet1/0, directly connected

L   2001:C0:FFEE::1/128 [0/0]

     via GigabitEthernet1/0, receive

C   2001:C0:FFEE:2::/126 [0/0]

     via GigabitEthernet0/0, directly connected

L   2001:C0:FFEE:2::1/128 [0/0]

     via GigabitEthernet0/0, receive

LC  2001:C0:FFEE:254::1/128 [0/0]

     via Loopback0, receive

C   2016:FA:1::/64 [0/0]

     via FastEthernet6/0.20, directly connected

L   2016:FA:1::1/128 [0/0]

     via FastEthernet6/0.20, receive

L   FF00::/8 [0/0]

     via Null0, receive

R1#

Weird...why are none of my EIGRP routes showing up? I could ping across the interfaces and my IPv4 routing protocols were working as expected, but I could not get EIGRP in IPv6 to form neighbor adjacencies. What gives?

I started troubleshooting EIGRP using essentially the same toolkit I would use for IPv4...:


R1#sho ipv6 eigrp 10 neigh

IPv6-EIGRP neighbors for process 10

% EIGRP 10 is in SHUTDOWN

R1#sho ipv6 eigrp 10 int  

IPv6-EIGRP interfaces for process 10

% EIGRP 10 is in SHUTDOWN

R1#

"EIGRP...is in SHUTDOWN?" I'm not familiar with that error message. WWGS ("What Would Google Say")? I quickly found a couple of tutorials on-line which showed that setting up EIGRP in IPv6 on a 7200 is a little different than setting OSPFv3 on a 3640 (go figure). Whereas OSPFv3 on a 3640 is entirely configured on the interface, EIGRP for IPv6 is a mix of interface-level commands and global config commands:


R1#conf t

Enter configuration commands, one per line.  End with CNTL/Z.

R1(config)#ipv6 router eigrp 10

R1(config-rtr)#router-id 10.254.254.1

R1(config-rtr)#redistribute connected 

R1(config-rtr)#passive-int default

R1(config-rtr)#no passive-int gig0/0

R1(config-rtr)#no passive-int gig1/0

R1(config-rtr)#no shut

02:35:27: %DUAL-5-NBRCHANGE: IPv6-EIGRP(0) 10: Neighbor FE80::C802:7FF:FE00:70 (GigabitEthernet1/0) is up: new adjacency

R1(config-rtr)#

02:35:38: %DUAL-5-NBRCHANGE: IPv6-EIGRP(0) 10: Neighbor FE80::C801:6FF:FEF0:70 (GigabitEthernet0/0) is up: new adjacency

R1(config-rtr)#exit

R1(config)#exit

After making eseentially the same changes on R2 and R3 (the interface names were different, but...), I saw my routes as expected:


R1#sho ipv6 route

IPv6 Routing Table - Default - 12 entries

Codes: C - Connected, L - Local, S - Static, U - Per-user Static route

       M - MIPv6, R - RIP, I1 - ISIS L1, I2 - ISIS L2

       IA - ISIS interarea, IS - ISIS summary, D - EIGRP, EX - EIGRP external

C   2001:C0:FFEE::/126 [0/0]

     via GigabitEthernet1/0, directly connected

L   2001:C0:FFEE::1/128 [0/0]

     via GigabitEthernet1/0, receive

C   2001:C0:FFEE:2::/126 [0/0]

     via GigabitEthernet0/0, directly connected

L   2001:C0:FFEE:2::1/128 [0/0]

     via GigabitEthernet0/0, receive

D   2001:C0:FFEE:3::/64 [90/28416]

     via FE80::C802:7FF:FE00:70, GigabitEthernet1/0

LC  2001:C0:FFEE:254::1/128 [0/0]

     via Loopback0, receive

D   2001:C0:FFEE:254::2/128 [90/130816]

     via FE80::C801:6FF:FEF0:70, GigabitEthernet0/0

D   2001:C0:FFEE:254::3/128 [90/130816]

     via FE80::C802:7FF:FE00:70, GigabitEthernet1/0

D   2001:C0:FFEE:2222::/64 [90/28416]

     via FE80::C801:6FF:FEF0:70, GigabitEthernet0/0

C   2016:FA:1::/64 [0/0]

     via FastEthernet6/0.20, directly connected

L   2016:FA:1::1/128 [0/0]

     via FastEthernet6/0.20, receive

L   FF00::/8 [0/0]

     via Null0, receive

R1#

Well that was more cumbersome than it should have been, but <shrug>. At least we've got EIGRP working now. BGP via IPv4 is nothing new, so I won't waste a lot of time discussing the BGP configuration for R1-R4. However, the IPv6 configuration between R1 and R5 had me swearing at Cisco:


R1(config)#router bgp 65511

R1(config-router)# neighbor 2016:FA:1::5 remote-as 65515

R1(config-router)# address-family ipv6

R1(config-router-af)#  network 2001:C0:FFEE:254:0:0:0:1/128

R1(config-router-af)#  network 2001:C0:FFEE:254:0:0:0:2/128

R1(config-router-af)#  network 2001:C0:FFEE:254:0:0:0:3/128

R1(config-router-af)#  network 2001:C0:FFEE:2:0:0:0:0/126

R1(config-router-af)#  network 2001:C0:FFEE:0:0:0:0:0/126

R1(config-router-af)#  network 2001:C0:FFEE:2222:0:0:0:0/64

R1(config-router-af)#  network 2001:C0:FFEE:3:0:0:0:0/64

R1(config-router-af)#  neighbor 2016:FA:1::5 activate

% BGP context not been initialized properly.

R1(config-router-af)# exit

R1(config-router)#exit

R1(config)#exit

R1#sho bgp ipv6 unicast neighbors



R1#sho run | begin router bgp

router bgp 65511

 bgp router-id 10.254.254.1

 bgp log-neighbor-changes

 neighbor 2016:FA:1::5 remote-as 65515

 neighbor 209.193.4.4 remote-as 65514

 !

 address-family ipv4

  neighbor 209.193.4.4 activate

  no auto-summary

  no synchronization

  network 10.254.254.1 mask 255.255.255.255

  network 10.254.254.2 mask 255.255.255.255

  network 10.254.254.3 mask 255.255.255.255

  network 66.223.224.0 mask 255.255.255.224

  network 66.223.224.32 mask 255.255.255.224

  network 66.223.227.0 mask 255.255.255.252

  network 66.223.227.4 mask 255.255.255.252

  network 209.193.4.0

 exit-address-family

!

ip forward-protocol nd

...

Wait, where's my "address-family ipv6" entries, and what's with that "BGP context has not been initialized properly" error message? I went back to the Great Oracle of Google, where I found this little tidbit of information:


Q. Error message: "% BGP context not been initialized properly." when Configuring neighbor under address-family IPv6



A. The issue is with the feature set. If the feature set is SP services, the following services are not supported.



IPv6 Routing: Multiprotocol BGP Extensions for IPv6
IPv6 Routing: Multiprotocol BGP Link-local Address Peering


To use these features,change the feature set to Advanced Enterprise Services.

Okay, let's check the code version on my routers:


R1#sho ver

Cisco IOS Software, 7200 Software (C7200-SPSERVICESK9-M), Version 12.4(24)T4, RELEASE SOFTWARE (fc2)

Well, <expletive deleted>! Since I don't have an Advanced IP Services image laying around, that pretty much kills the BGP portion of this lab for now.

I went ahead and removed the BGP portion and played with EIGRP across the network, but I'm slightly miffed by the fact that I couldn't do any testing with BGP or OSPF under IPv6, since IPv6 is now a part of certification testing. With adoption of IPv6 "in the wild" still lagging, it would be nice to be able to mock such networks up in a lab without spending a fortune in hardware and software licensing.

Sunday, October 23, 2016

Lesson 16: VRRP

In the last lesson that I wrote while working on my CCNA certification, I introduced the concept of router redundancy via a Cisco proprietary protocol known as HSRP, or "Hot Standby Router Protocol." However, HSRP is not the only way to create a redundant data connection for your office. In this lab, we'll look at a second, similar protocol known as VRRP, or "Virtual Router Redundancy Protocol."

Disclaimer:
The configuration document I used to play with VRRP in this lab didn't work exactly as advertised on the routers I was emulating. In fairness, Cisco 3640 routers are decidedly, ummmm, "old-school" (read that: obsolete), so it's entirely possible that the syntax has changed on more modern platforms that are running more recent versions of IOS. However, what I present here should be close enough to get you started. Here (pdf) is the link to the Cisco document with the slightly different syntax.

As usual, we'll start with the network diagram:

We'll set up lo0 and fa1/0 on R1 and R2 as normal, R4 exists only to act as a DHCP server, and R3 serves as a destination network provider. We'll establish OSPF between R1, R2 and R3, using network statements for 100.64.1.0/30 and 100.64.2.0/30 and using "redistribute connected subnets." On our client, "Knoppix Clone 1," we'll set the default gateway to 100.64.0.1/29. So far, nothing unexpected, right?

Just to recap, the problem we want to solve is, what happens when fa0/0 goes down on our default gateway? If R2 did not exist in this network, then R1 is our single point of failure. If we lose R1, then the clients on our LAN can no longer reach the servers hanging off of R3. To address this, we set up two routers in parallel so that we have a redundant path to R3. However, there is no way to tell a client PC (or router or...) to use multiple default gateways. HSRP and VRRP were designed to address this problem. In both scenarios, you configure a single default gateway on your client network, then use either HSRP or VRRP to shuffle that default gateway address between multiple routers. To set it up, you...:

Enter configuration mode;
Switch to the interface facing your client LAN;
Add an IP address within the subnet of your client LAN;
Configure a meaningful description of the VRRP group;
Configure the client's default gateway address in the VRRP group;
Set the VRRP priority for the router (a higher value takes priority over a lower value);
Set the VRRP advertisement and preempt delay timers.

Here's how the configuration looks on R1...:


interface FastEthernet0/0

 ip address 100.64.0.2 255.255.255.248

 vrrp 10 description VRRP Group

 vrrp 10 ip 100.64.0.1

 vrrp 10 preempt delay minimum 3

 vrrp 10 priority 254

end

...and on R2:


interface FastEthernet0/0

 ip address 100.64.0.3 255.255.255.248

 vrrp 10 description VRRP Group

 vrrp 10 ip 100.64.0.1

 vrrp 10 preempt delay minimum 3

 vrrp 10 priority 128

end

NOTE:
I also added the following line to the config...:


R1(config-if)#vrrp 10 timers advertise 1

...to set VRRP to send an "advertisement" every second. However, this is the default behaviour for VRRP, and therefore, it didn't show up in the config until I changed it for testing. Anyway, does it work?

Let's traceroute from R4 to R3:


R4#traceroute 10.254.254.3



Type escape sequence to abort.

Tracing the route to 10.254.254.3



  1 100.64.0.2 4 msec 4 msec 4 msec

  2 100.64.1.2 12 msec 8 msec 8 msec

R4#

Now, if I shut down fa0/0 on R1, I should see a short interruption in service, followed by R2 picking up the traffic:


R4#ping 10.254.254.3 repeat 100



Type escape sequence to abort.

Sending 100, 100-byte ICMP Echos to 10.254.254.3, timeout is 2 seconds:

!!!!!!!!!!!!!..!.!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!

!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!

Success rate is 97 percent (97/100), round-trip min/avg/max = 4/11/36 ms

R4#traceroute 10.254.254.3  
   


Type escape sequence to abort.

Tracing the route to 10.254.254.3



  1 100.64.0.3 8 msec 4 msec 8 msec

  2 100.64.2.2 8 msec 12 msec 8 msec

R4#

Notice how the first hop originally was 100.64.0.2, but now it's 100.64.0.3? VRRP has failed over the virtual 100.64.0.1 IP address from R1 to R2, which is reflected in the traceroute output.

With a little effort, we can see what's happening at the Ethernet level, too, and it's even more interesting. We'll start by verifying our configuration. R1 is currently the VRRP master (i.e., it's hosting the IP address 100.64.0.1), and R2 is the backup:


R1#sho vrrp brief

Interface          Grp Pri Time  Own Pre State   Master addr     Group addr

Fa0/0              10  254 3007       Y  Master  100.64.0.2      100.64.0.1     

R1#

----------------------------------------------------------------------------



R2#sho vrrp brief

Interface          Grp Pri Time  Own Pre State   Master addr     Group addr

Fa0/0              10  128 3500       Y  Backup  100.64.0.2      100.64.0.1     

R2#

First, we'll clear the arp table on Knoppix Client 1:

Now, we'll ping 10.254.254.3 (lo0 on R3) from Knoppix Clone 1:

Next, we shut down fa0/0 on R1, and verify that R2 is now the VRRP master:


R1#sho vrrp brief

Interface          Grp Pri Time  Own Pre State   Master addr     Group addr

Fa0/0              10  254 3007       Y  Init    0.0.0.0         100.64.0.1    
 
R1#



----------------------------------------------------------------------------



R2#sho vrrp brief

Interface          Grp Pri Time  Own Pre State   Master addr     Group addr

Fa0/0              10  128 3500       Y  Master  100.64.0.3      100.64.0.1     

R2#

...and ping R3 again:

Hmmm...since the VRRP virtual MAC address moves with the router, that doesn't give us much insight into what was actually happening here. Fortunately, I was running tcpdump to capture the Ethernet frames while running this test. After exporting the PCAP file to Wireshark, we can get a little better understanding of what happened here.

Note:
To keep the Wireshark screen captures relevant, I filtered out some of the chatter. We configured VRRP to send advertisements every second, for example, so I filtered out the VRRP protocol data. These routers were also running CDP, so I filtered that as well.

At the very beginning of the capture, we can see Knoppix Client 1 ("CadmusCo_d3:7c:8f") send an arp request for 100.64.0.1, and we can see R1 (cc:00:2a:af:00:00) send an arp reply, stating that the VRRP virtual MAC address "00:00:5e:00:01:0a" is associated with 100.64.0.1:

Then, we ping R3 through R1. As you can see, we sent the ICMP request to the VRRP virtual MAC...:

...but received the reply from the MAC of fa0/0 on R1 (cc:00:2a:af:00:00):

At this point, we shut down fa0/0 on R1, and allowed R2 to take ownership of 100.64.0.1. Since VRRP also transports the virtual MAC, our next ping will still be sent to 00:00:5e:00:01:0a...:

...but this time, our reply has come from the MAC address of fa0/0 on R2 (cc:01:2a:af:00:00):

Because the MAC address doesn't change, we don't have to wait until the arp cache on connected devices times out for traffic to use the new path. This can be a serious problem, in some cases. For example, if you are using a Cisco ASA to connect to a (non-VRRP) "highly-available" system, the default arp cache timeout period is FOUR HOURS, which means it can take up to four hours for your "highly-available" (cough) system to recover from a failover! This isn't just an academic, theoretical point, either. I am currently working a trouble ticket in my day job where this is exactly what's happening. Unfortunately, just shortening the arp cache timeout period can drive up CPU load and memory requirements on busy devices, so there is a balancing act to be found between automatic fail-over times and system resource utilization. VRRP neatly solves that problem by sidestepping the whole issue.