OSPF Costing Models

spin · May 11, 2016, 12:06pm

Continuing the discussion from Unable to route to 172.18.1.12 via KM:

It seems the time for a consistent OSPF costing checker/implementer is near. I am playing with the idea of doing a graph (in both the graphical and mathematical sense) of our OSPF backbone and pulling all costing inconsistencies.

These could include:

Inconsistencies in costing on the different sides of the same link
Inconsistencies in terms of costing ethernet and wireless.

I’m also thinking of then starting to suggest costs to admins where the actual costs deviates from some simple model.

Simple model as a start:

Ethernet 1
Wireless 10

Intermediate Model

Ethernet Gbit 1
Ethernet Mbit 2
Wifi 10 (maybe varying dual/single)

Complex Model

Ethernet varying by speed
Wifi varying by speed as well as “average” CCQ (different on each side of the link)

I’d like a friendly discussion on the topic with firmish suggestions on a simple initial model and speculative discussion on a future models for estimating costs.

I imagine this could start as suggestions perhaps via wms emails of costs, but could become automatic changes via script. Thoughts on this?

Jypels · May 11, 2016, 1:24pm

Spin I agree that we need to look into varying ospf costings. For example I have a AC link doing 702mbps/702mbps with 100%/100% ccq and a average ping of under 1ms. Costing on the wireless is set on 2 in both directions as the link can handle the traffic. Maybe look at a script that auto adjust the costings based on sync speed and ccq. Something like if the ccq is above 90% and sync is 120mbps make costing 8. If sync is 180mbps with ccq above 90% make costing 6. If sync is 300mbps with ccq above 90% make costing 4. If sync is over 300mbps with ccq above 90% make costing 2…

spin · May 11, 2016, 1:37pm

Perhaps I can suggest this:

###Complex Model (down the line)

Set a reference bandwidth 1Gbps to a cost of 1 (call this A)
Calculate the reference bandwidth for the interface based on the speed be it 1Gbps, 100Mbps (or 10Mbps). For wireless use the sync speed but reduce for poor CCQ. (Call this B).

Cost then becomes A/B.

Examples:

Wireless link as per @jypels example would be 1000 / (702 * 100%) = 1.42 rounded up to 2.
Wireless link syncing at at 20Mbps with 90% CCQ would be 1000/(20*0.9) = 55 .55 rounded up to 56.
Ethernet link at 100Mbps would be 1000/100=10.

A script could check these and email admins who’s interfaces fall outside a range of these costs, or it could actually adjust to be within range of these estimates? We could make the range wide initially say +/- 50% of the estimated cost. I.e. the admin has discression to adjust the costs in the three examples as follows:

1 - 3
25 - 75
5 - 15

See this article for similar examples.

Azure · May 12, 2016, 7:59am

Graphire had worked out nice, friendly and balanced calculation last year. Wonder if he still has it.
It took signal, CCQ and one or 2 other bits. Found it to work well

spin · May 12, 2016, 10:15am

@Graphire can you share here?

Graphire · May 12, 2016, 1:48pm

Hi guys

I used this formula

Link speed CCQ cost
1000 / linkspeed / ccq % = cost

example
270 mb/s link speed 90 ccq = 4

here is an excell spread sheet with the calculation if you want to use

http://172.18.107.226/downloads/ospf/ospf%20calc.xlsx

spin · May 12, 2016, 1:59pm

Great minds think alike.

My proposal is 1000 / (speed * ccq) = 1000/speed/ccq which is the same as yours

Jypels · May 12, 2016, 4:00pm

one small problem through. How would you adjust the costing on a ubiquity link running ospf on the core instead of the wireless rb?

spin · May 12, 2016, 4:20pm

Not sure. Given that we are starting with just pulling the data, then recommending and only then auto changing I guess it’s not an immediate concern.

We do need to think about a Ubiquity equivalent to WMS though.

spin · May 12, 2016, 5:19pm

Re-reading what you said I presume the wireless RB’s are in bridge mode for some reason? I don’t think that’s a great setup with this in mind. I’m assuming there is bridging involved?

I guess bridging wireless interfaces should not be allowed as it would mess with costing. You won’t be able to set the metric correctly for that. You’d need to set it at some weird average.

Jypels · May 12, 2016, 7:44pm

The wireless interface is bridged yes, but it runs its own OSPF on the core of each node. You assign a /29 that both Ubiquity devices use aswell as each core.

So for example you would have:

Node A core: 172.18.1.1
Node A Ubi: 172.18.1.2
Node B Ubi: 172.18.1.3
Node B core: 172.18.1.4

Each core has that assigned to a separate Ethernet port

spin · May 12, 2016, 8:09pm

Ah so the cores is miktortik?

Jypels · May 12, 2016, 8:15pm

Yes the cores are mikrotik, but with a couple of Ubiquity Edgepoints and edgerouters being deployed on the wug (as cores) currently. We might want to look into scripts for them, if possible. @Tinuva any input on this? @Diabolix would know if the Edgepoint supports scripts as he has setup the first edgepoint on the wug, with second one going live on sunday. .

spin · May 12, 2016, 8:16pm

They don’t need to have scripts. They just need ssh access

Tinuva · May 13, 2016, 7:38am

WMS will need to be updated to detect UBNT equipment though, if used as routing or wireless bridged

Possible, not easy though.

King · May 14, 2016, 5:29am

Back in the day I also wanted to visit this but the formula was to include distance as I thought that the longer links should only do traffic when necessary. You can see OSPF Costing if you want to check the opinions at the time.

I like the complex method and the formula that you’re using. The bridged UBNT devices will pose a challenge but an admin can manually calculate those when they’re setting up the links.

It will have to based on reporting/automated emails and then manual applying of the changes. If it was automated the entire network would go down and who knows how the RBs will cope with the sudden rebuild of the routing table.

spin · May 16, 2016, 8:01am

I’m thinking CCQ is measuring link quality. So if the link is bad due to distance or frequency issues or anything else then it should be OK. There might be an argument in sparing long distance links a little if they tend to be over-utilised but this would increase hop count for that traffic which would increase latency.

We’d need to stop bridging on some of these so that we can clearly set the cost on each link separately. Or calcualte some average cost for the bridge.

Yeah it probably can’t be done all at once. As I see it, it could be suggestions at first but it could be automated relatively easily. The script will just need to be set to update costings over time rather than in a short period of time to avoid this problem.

spin · May 19, 2016, 10:55am

I’m currently putting together a OSPF monitoring script/thingy. Basically it pulls all information from the Link State Advertisement DB and puts together a model of our OSPF network. This is early stages at the moment.

Also found some older threads on the same topic in the mean time, which I’m linking here for reference:

spin · May 21, 2016, 8:27am

2 posts were split to a new topic: OSPF Visualisation