<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>You shot the invisible swordsman.</title>
	<atom:link href="http://youshottheinvisibleswordsman.co.uk/feed/" rel="self" type="application/rss+xml" />
	<link>http://youshottheinvisibleswordsman.co.uk</link>
	<description>A slightly technical blog by Stephen D. Strowes.</description>
	<lastBuildDate>Thu, 05 Jan 2012 21:19:29 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.3</generator>
		<item>
		<title>That was the year that was &#8230; 2011.</title>
		<link>http://youshottheinvisibleswordsman.co.uk/2012/01/05/that-was-the-year-that-was-2011/?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=that-was-the-year-that-was-2011</link>
		<comments>http://youshottheinvisibleswordsman.co.uk/2012/01/05/that-was-the-year-that-was-2011/#comments</comments>
		<pubDate>Thu, 05 Jan 2012 21:19:29 +0000</pubDate>
		<dc:creator>Stephen</dc:creator>
				<category><![CDATA[life]]></category>

		<guid isPermaLink="false">http://youshottheinvisibleswordsman.co.uk/?p=657</guid>
		<description><![CDATA[If 2011 was anything, it was the year I submitted my PhD dissertation, and my dissertation was my focus essentially throughout. Long days and hard work involving coding, verifying outputs, making last-minute awkward bug fixes, reading text, re-reading text, and lots of coffee. Ultimately, a year featuring two publications (one presented, one to present in [...]]]></description>
			<content:encoded><![CDATA[<p>If 2011 was anything, it was the year I submitted my PhD dissertation, and my dissertation was my focus essentially throughout. Long days and hard work involving coding, verifying outputs, making last-minute awkward bug fixes, reading text, re-reading text, and  <a href="http://youshottheinvisibleswordsman.co.uk/2011/11/04/word-counts-and-caffeine-logs/">lots of coffee</a>. Ultimately, a year featuring two publications (one presented, one to present in early 2012), and a dissertation suitable for submission.</p>
<p>I did manage some travel, mostly work-related:</p>
<ul>
<li>Lerwick, Shetland, for Up Helly Aa. Indeed, my first vacation in some time, and the last I&#8217;d have prior to submitting.</li>
<li>Shanghai.</li>
<li>Prague.</li>
<li>San Francisco.</li>
<li>Abingdon, <a href="http://www.informatics.sussex.ac.uk/research/projects/ngn/cgi-bin/ngn05.cgi?pg=wshops/msn11/proc">of course</a>.</li>
<li>Dublin.</li>
</ul>
<p>On photography, not so much this year. A minor disaster: I accidentally dropped my shallow depth of field lens (<a href="http://www.dpreview.com/lensreviews/canon_50_1p8_ii_c16/">Canon EF 50mm F1.8 II</a>) only to watch it fall into pieces on contact with the floor. Fortunately, the lens itself is fine! But after reassembly, it&#8217;s clear the motor unit has been knackered. Still, some of my favourite shots from the last year or so:</p>
<p><a title="Viking, fire. by sdstrowes, on Flickr" href="http://www.flickr.com/photos/sdstrowes/5398775558/"><img src="http://farm6.staticflickr.com/5091/5398775558_d04923a791_s.jpg" alt="Viking, fire." width="75" height="75" /></a> <a href="http://www.flickr.com/photos/sdstrowes/6466249151/" title="Hub. by sdstrowes, on Flickr"><img src="http://farm8.staticflickr.com/7146/6466249151_dbba9ca1e3_s.jpg" width="75" height="75" alt="Hub."></a> <a href="http://www.flickr.com/photos/sdstrowes/6115962479/" title="Research Club, Hetherington House by sdstrowes, on Flickr"><img src="http://farm7.staticflickr.com/6195/6115962479_931a881391_s.jpg" width="75" height="75" alt="Research Club, Hetherington House"></a></p>
<p><a href="http://www.flickr.com/photos/sdstrowes/6092686903/" title="Freeway Entrance (East) by sdstrowes, on Flickr"><img src="http://farm7.staticflickr.com/6074/6092686903_69030f06c0_s.jpg" width="75" height="75" alt="Freeway Entrance (East)"></a> <a href="http://www.flickr.com/photos/sdstrowes/6085317241/" title="15th Street by sdstrowes, on Flickr"><img src="http://farm7.staticflickr.com/6086/6085317241_9f318b9985_s.jpg" width="75" height="75" alt="15th Street"></a> <a href="http://www.flickr.com/photos/sdstrowes/5965731662/" title="BrewDog Glasgow by sdstrowes, on Flickr"><img src="http://farm7.staticflickr.com/6022/5965731662_c91fa338c7_s.jpg" width="75" height="75" alt="BrewDog Glasgow"></a></p>
<p><a href="http://www.flickr.com/photos/sdstrowes/5662023637/" title="Lighthouse by sdstrowes, on Flickr"><img src="http://farm6.staticflickr.com/5101/5662023637_3dbbb06a63_s.jpg" width="75" height="75" alt="Lighthouse"></a></p>
<p>Finally some bloggy introspection: I modified my blog this year. I simplified the CSS for the wordpress theme to reduce it to one left-hand column with links, and one right-hand column for text. Only one blog post is shown at a time (many of my posts are long-form), and comments are only enabled on new posts for a short time before being closed off completely. This tidies this blog considerably, though I&#8217;d prefer something less heavyweight than wordpress to run the show. One day, I&#8217;ll swap out the back-end.</p>
]]></content:encoded>
			<wfw:commentRss>http://youshottheinvisibleswordsman.co.uk/2012/01/05/that-was-the-year-that-was-2011/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>IPv4 Address Space Utilisation</title>
		<link>http://youshottheinvisibleswordsman.co.uk/2011/12/06/ipv4-address-space-utilisation/?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=ipv4-address-space-utilisation</link>
		<comments>http://youshottheinvisibleswordsman.co.uk/2011/12/06/ipv4-address-space-utilisation/#comments</comments>
		<pubDate>Tue, 06 Dec 2011 15:01:13 +0000</pubDate>
		<dc:creator>Stephen</dc:creator>
				<category><![CDATA[internet]]></category>
		<category><![CDATA[technology]]></category>

		<guid isPermaLink="false">http://youshottheinvisibleswordsman.co.uk/?p=643</guid>
		<description><![CDATA[I read earlier another warning from the EC that businesses should be looking to deploy IPv6 as soon as possible. We know, of course, that this is important: IANA ran out of /8 blocks almost a year ago, and APNIC&#8217;s allocation ran dry soon after. The others will follow suit soon. But how much of [...]]]></description>
			<content:encoded><![CDATA[<p>I read earlier <a href="http://www.ispreview.co.uk/story/2011/12/05/europe-warns-isps-and-businesses-to-act-now-and-adopt-ipv6-asap.html">another warning from the EC</a> that businesses should be looking to deploy IPv6 as soon as possible. We know, of course, that this is important: IANA ran out of /8 blocks almost a year ago, and APNIC&#8217;s allocation ran dry soon after. The others will <a href="http://ipv4.potaroo.net/">follow suit soon</a>. But how much of that is being used?</p>
<p>We are nearing 400,000 prefixes in the IPv4 BGP default-free zone. 398,135 are visible to <a href="http://archive.routeviews.org/bgpdata/">route-views2</a> in its latest table dump at time of writing. But the allocated address space is not indicative of actual utilisation, nor is the number of prefixes being held in full tables. When it comes to utilisation, it&#8217;s the amount of address space advertised by those prefixes that is more interesting.</p>
<p>The <a href="http://www.iana.org/assignments/ipv4-address-space">IPv4 address space</a> is primarily used for unicast addressing. IPv4 offers 32 bits per address, allowing 4,294,967,296 (2<sup>32</sup>) <em>potential</em> addresses. Some blocks of this space are marked for particular uses and cannot be routed globally. These blocks are:</p>
<ul>
<li>0.0.0.0/8 is for self-identification.</li>
<li>10.0.0.0/8 is for private networks.</li>
<li>127.0.0.0/8 is for loopback addresses.</li>
<li>169.254.0.0/16 is for link local addresses.</li>
<li>172.16.0.0/12 is for private networks.</li>
<li>192.168.0.0/16 is for private networks.</li>
<li>192.0.0.0/24 is for &#8220;IANA IPv4 Special Purpose Registry&#8221;.</li>
<li>192.88.99.0/24 is for 6-to-4 relay anycast addresses.</li>
<li>192.0.2.0/24 is reserved for examples in documentation (&#8220;TEST-NET-1&#8243;).</li>
<li>198.51.100.0/24 is reserved for examples in documentation (&#8220;TEST-NET-2&#8243;).</li>
<li>203.0.113.0/24 is reserved for examples in documentation (&#8220;TEST-NET-3&#8243;).</li>
<li>224.0.0.0/4 is for multicast.</li>
<li>240.0.0.0/4 is listed as for &#8220;Future Use&#8221;.</li>
</ul>
<p>That accounts for 588,383,488 addresses that are not globally routeable IPv4 space, leaving 3,706,583,808 from the total.</p>
<p>These are covered in more detail in <a href="http://tools.ietf.org/html/rfc5735">RFC 5735</a>. Get that &#8220;Future Use&#8221; block out of your mind right now; those sixteen /8s sure would be useful for a short time, but they&#8217;re effectively out of bounds. We <a href="http://tools.ietf.org/html/rfc6319#section-5.4">don&#8217;t know how much hardware explicitly drops packets purporting to have come from, or drops packets destined for, this range</a>.</p>
<p>To reach 100% utilitisation, these ~3.7 billion addresses would all have to be lit up, but we&#8217;re not quite there yet. Counting only the shortest prefixes advertised today that fully cover the advertised address space, there are 2,497,743,276 publicly routeable IPv4 addresses out there. (To be clear, since multiple small blocks from a parent block can be advertised over BGP, I count only the shortest prefixes and ignore the overlapping space beneath. Also, I&#8217;m simplifying and not discounting special addresses such as broadcast addresses.)</p>
<p>That is, today we&#8217;re at about 67.4% utilisation. <em>The Internet is two-thirds full</em>. The last time I checked this, perhaps 2 years ago, we were closer to 50%. I should draw a graph.</p>
<p>Of course, I should make clear that I am not suggesting we should be aiming for full utilisation. Equally, &#8220;utilisation&#8221; here doesn&#8217;t indicate actual use; much of the space advertised is possibly unused, suggesting more efficient packing could take place. But it&#8217;s certainly utilised in the sense that it cannot be used by anybody else.</p>
]]></content:encoded>
			<wfw:commentRss>http://youshottheinvisibleswordsman.co.uk/2011/12/06/ipv4-address-space-utilisation/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>University of Glasgow LaTeX dissertation style</title>
		<link>http://youshottheinvisibleswordsman.co.uk/2011/11/09/university-of-glasgow-latex-dissertation-style/?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=university-of-glasgow-latex-dissertation-style</link>
		<comments>http://youshottheinvisibleswordsman.co.uk/2011/11/09/university-of-glasgow-latex-dissertation-style/#comments</comments>
		<pubDate>Wed, 09 Nov 2011 20:10:46 +0000</pubDate>
		<dc:creator>Stephen</dc:creator>
				<category><![CDATA[life]]></category>
		<category><![CDATA[linux]]></category>

		<guid isPermaLink="false">http://youshottheinvisibleswordsman.co.uk/?p=639</guid>
		<description><![CDATA[When I started to write my dissertation, there was no LaTeX style available that was compliant with the University guidelines. (For shame!) So, I made one: https://github.com/sdstrowes/Glasgow-Thesis-Template Share, fix, and feed back please!]]></description>
			<content:encoded><![CDATA[<p>When I started to write my dissertation, there was no LaTeX style available that was compliant with the University guidelines. (For shame!)</p>
<p>So, I made one: <a href="https://github.com/sdstrowes/Glasgow-Thesis-Template">https://github.com/sdstrowes/Glasgow-Thesis-Template</a></p>
<p>Share, fix, and feed back please!</p>
]]></content:encoded>
			<wfw:commentRss>http://youshottheinvisibleswordsman.co.uk/2011/11/09/university-of-glasgow-latex-dissertation-style/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Word counts and caffeine logs</title>
		<link>http://youshottheinvisibleswordsman.co.uk/2011/11/04/word-counts-and-caffeine-logs/?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=word-counts-and-caffeine-logs</link>
		<comments>http://youshottheinvisibleswordsman.co.uk/2011/11/04/word-counts-and-caffeine-logs/#comments</comments>
		<pubDate>Fri, 04 Nov 2011 13:05:45 +0000</pubDate>
		<dc:creator>Stephen</dc:creator>
				<category><![CDATA[life]]></category>

		<guid isPermaLink="false">http://youshottheinvisibleswordsman.co.uk/?p=599</guid>
		<description><![CDATA[I submitted my PhD dissertation at the start of this week. Dissertation writing is a long, slow process. Who knew? Broad stats, for interest: Number of SVN revisions: 682. The dissertation builds out to 163 sheets of A4. Of those: 11 are front matter, including cover, acknowledgements, dedication, abstract, and tables of contents, figures and [...]]]></description>
			<content:encoded><![CDATA[<p>I submitted my PhD dissertation at the start of this week. Dissertation writing is a long, slow process. Who knew?</p>
<p>Broad stats, for interest:</p>
<ul>
<li>Number of SVN revisions: 682.</li>
<li>The dissertation builds out to 163 sheets of A4. Of those:
<ul>
<li>11 are front matter, including cover, acknowledgements, dedication, abstract, and tables of contents, figures and tables.</li>
<li>119 pages are body chapters.</li>
<li>21 pages are appendices.</li>
<li>12 pages are references.</li>
</ul>
</li>
<li>I included 130 references.</li>
<li>Across the body and the appendices, I wrote 40,342 words.</li>
</ul>
<p>Although word and page counts are not at all barometers of the quality of a PhD dissertation, these counts put me right in the middle of the University&#8217;s guidelines.</p>
<p>My word count grew initially in bursts. In the final days, it was hovering around the 41,000 word mark:</p>
<p><img class="alignnone size-full wp-image-628" title="dissertation-wordcount" src="http://youshottheinvisibleswordsman.co.uk/wp/wp-content/uploads/2011/11/dissertation-wordcount.png" alt="" width="544" height="300" /></p>
<p>The latter chapters were being expanded while earlier chapters were being tweaked down to size, and so the number of words written per day is actually higher than the total words, but I don&#8217;t have an easy way to pull out that data. The long flat section between July and August was when work was diverted to work on an INFOCOM 2012 submission.</p>
<p>But the most important meta metric is surely caffeine, which I started logging at the same time I rigged my scripts to log my word count. Broken very simply down into tea and coffee, my cumulative intake is as follows:</p>
<p><img class="alignnone size-full wp-image-630" title="mug-data-cumulative" src="http://youshottheinvisibleswordsman.co.uk/wp/wp-content/uploads/2011/11/mug-data-cumulative.png" alt="" width="544" height="300" /></p>
<p>That is, 407 mugs logged (230 of which were coffee, 177 of which were tea) across 139 days. Those are the ones I remembered to log, of course. I actually thought my tea count would have been much less than it was, but no matter.</p>
<p>Broken down by week, I think it&#8217;s safe to wave my hands in the air and state that there&#8217;s a distinct trend toward moar caffeine in the final weeks:</p>
<p><img class="alignnone size-full wp-image-623" title="mug-data" src="http://youshottheinvisibleswordsman.co.uk/wp/wp-content/uploads/2011/11/mug-data.png" alt="" width="544" height="300" /></p>
<p>So, there we go. I was burning through a lot of coffee in the final stages while clearing up a pile of annoying consistency glitches. I haven&#8217;t read back any of my dissertation since submission, but I think I am happy with the content therein, and the thrust of the argument. I think I have done some good work, and I am pretty happy with it.</p>
<p>Naturally, I have a list of pet issues and errors, without even reading the text. However, turning caffeine into words has now ended. The next phase is the viva, where less caffeine is probably better.</p>
]]></content:encoded>
			<wfw:commentRss>http://youshottheinvisibleswordsman.co.uk/2011/11/04/word-counts-and-caffeine-logs/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Message ordering in disconnected Remote Actors</title>
		<link>http://youshottheinvisibleswordsman.co.uk/2011/04/10/message-ordering-in-disconnected-remote-actors/?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=message-ordering-in-disconnected-remote-actors</link>
		<comments>http://youshottheinvisibleswordsman.co.uk/2011/04/10/message-ordering-in-disconnected-remote-actors/#comments</comments>
		<pubDate>Sun, 10 Apr 2011 22:33:11 +0000</pubDate>
		<dc:creator>Stephen</dc:creator>
				<category><![CDATA[scala]]></category>
		<category><![CDATA[technology]]></category>

		<guid isPermaLink="false">http://youshottheinvisibleswordsman.co.uk/?p=593</guid>
		<description><![CDATA[I spotted a curious little Remote Actors behavioural bug recently, mostly by chance. Consider two remote actors, a source sending messages to a sink. If the source is alive prior to the sink, then the source can still send (asynchronous) messages. When the sink is started, the source will finally be able to connect, and [...]]]></description>
			<content:encoded><![CDATA[<p>I spotted a curious little Remote Actors behavioural bug recently, mostly by chance.</p>
<p>Consider two remote actors, a source sending messages to a sink. If the source is alive prior to the sink, then the source can still send (asynchronous) messages. When the sink is started, the source will finally be able to connect, and messages will be passed. Personally, I think it would be entirely reasonable for messages prior to the sink&#8217;s startup to be dropped. Scala&#8217;s Remote Actors queues the messages instead.</p>
<p>If messages are to be queued, however, I would fully expect those messages to arrive at the sink in FIFO order. What actually happens is that they arrive in LIFO order, i.e., entirely in reverse. This goes against my assumptions as a developer, given the behaviour of Scala message passing in all other circumstances.</p>
<p>The reason the messages arrive in reverse order is quite simple: the buffer is a Scala list, constructed by attaching each new item as the head of the list. When the sink comes alive, the TcpService running deep within the source is able to successfully <em>connect()</em>, and the message that prompted the <em>connect()</em> is sent. Then, the buffer is drained, by iteratively sending the head of the list until it is empty. Efficient, but surely backwards.</p>
<p>The fix is simple: when a <em>connect()</em> succeeds, reverse and then drain the buffer, and then send the message that prompted the successful connect. Thus, messages are buffered and arrive in FIFO order.</p>
<p>It&#8217;s a trivial little behavioural bug, but it could catch somebody out. The fix is simple; I <a href="http://lampsvn.epfl.ch/trac/scala/ticket/4454">submitted a bug report and a patch</a>.</p>
<p>This post is most closely related to my other posts on Remote Actors: <a href="http://youshottheinvisibleswordsman.co.uk/2009/04/01/remoteactor-in-scala/">[1]</a> <a href="http://youshottheinvisibleswordsman.co.uk/2010/06/18/scala-2-8-remote-actors-in-scala-2-7-7-final/">[2]</a>.</p>
]]></content:encoded>
			<wfw:commentRss>http://youshottheinvisibleswordsman.co.uk/2011/04/10/message-ordering-in-disconnected-remote-actors/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Browne Review research spending</title>
		<link>http://youshottheinvisibleswordsman.co.uk/2011/03/03/browne-review-research-spending/?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=browne-review-research-spending</link>
		<comments>http://youshottheinvisibleswordsman.co.uk/2011/03/03/browne-review-research-spending/#comments</comments>
		<pubDate>Thu, 03 Mar 2011 10:17:09 +0000</pubDate>
		<dc:creator>Stephen</dc:creator>
				<category><![CDATA[thoughts]]></category>

		<guid isPermaLink="false">http://youshottheinvisibleswordsman.co.uk/?p=585</guid>
		<description><![CDATA[It was revealed some time ago that Browne review had a research budget of £120,000, of which £68,000 was spent, and most of that on an opinion survey. The results of the opinion survey were also unpublished. Well, that irked me. So I lodged an FOI request to receive the survey and its results, and I [...]]]></description>
			<content:encoded><![CDATA[<p>It was revealed some time ago that Browne review had a research budget of £120,000, of which <a href="http://www.timeshighereducation.co.uk/story.asp?storycode=414764">£68,000 was spent</a>, and most of that on an opinion survey. The results of the opinion survey were also unpublished. Well, that irked me. So I lodged an FOI request to receive the survey and its results, and I received those back without hassle. Fair play.</p>
<p>The opinion survey is protected under copyright law, so apparently it would be deeply uncool for me to start sharing it around on a public website. But I can sure talk about what&#8217;s in there.</p>
<p>The opinion survey asked 80 school pupils, 40 parents, 40 early-year University students, and 18 part-time students for their opinion on University funding. Let me quote the report up-front, so that we&#8217;re all clear on something:</p>
<blockquote><p>The findings in this research are based on a qualitative sample which is large enough to provide in-depth indicative findings. However, the findings are not statistically robust and should not be treated as such.</p></blockquote>
<p>The interviews took place at various locations, and sought to include people from different backgrounds, but we still have the fact that this survey was the only real research that took place to feed into a report that influenced the wrong Government&#8217;s policy. (The Browne review was initiated by the previous government.) It&#8217;s a survey almost 50% made up of latter-year school pupils, which has been interpreted incorrectly by the Government. Face decidedly in palm.</p>
<p>The basic jist of the report is that questions were asked regarding what levels of fees, if they were to be raised, would be acceptable. The survey also highlighted what I think many of us in the sector implicitly knew: most people are unaware of how much funding the Government provides the tertiary teaching sector per-head.</p>
<p>So the survey has a feel around to find out what people think, and concludes that some people are comfortable with fees set to £6,000 per annum, assuming that £6,000 is the cap. The new £9,000 cap is not mentioned once in the report. Nor is the concept of the withdrawal of the state from tertiary learning in England and Wales, which is the real reason the fees are now rising.</p>
<p>A couple of key quotes from the survey document:</p>
<blockquote><p>Most full time students and parents believed that the cost of providing higher education should be shared amongst government, graduates and parents.</p></blockquote>
<p>And also:</p>
<blockquote><p>Most participants believed that the government contribution to the costs of tuition and support for students is important for a number of reasons: there was the perception that the government benefits from higher taxes from graduates throughout their lifetime and a more skilled workforce able to generate wealth and deliver better public services. Some students and parents believed that government contributions to higher education are important in making it accessible to students from a wide range of different backgrounds, and avoiding a system based on the ability to pay. When completing a ‘pie chart’ exercise, marking relative proportions that the government vs. graduates should pay towards higher education, most full time students and parents believed that the government should pay at least half the cost of higher education. [[Footnote: It is important to note that this exercise did not focus on actual amounts of money, rather on the principle of the burden of funding.]] This is because the personal benefits of higher education were seen by many to match the benefits to society.</p></blockquote>
<p>So the coalition Government found found this opinion survey that says students and parents might be just happy enough to pay £6,000 tuition per year, but ignored all the parts like this:</p>
<blockquote><p>most full time students and parents believed that the government should pay at least half the cost of higher education.</p></blockquote>
<p>And so it seems they worked backward from the amount of money people might be willing to pay to rather ham-fistedly reduce tuition spending, rather than use any additional income to boost tuition spending. If the purpose of the survey was to determine how education spending could be boosted with additional income from graduates, then the <em>results</em> from the survey has been taken and used; the <em>purpose</em> of the survey, on the other hand, seems to have been ignored.</p>
<p>Quite simply, this is a survey that makes the assertion that students will find increased fees acceptable, given deferred payment, but makes no assertions regarding a corresponding reduction in state support. It raises concerns over an emerging &#8220;two-tier&#8221; system if variable fees are allowed, but does not resolve them. It is fully an opinion survey to determine just how much pain the public might be willing to take, but primarily asks school pupils for the answers. In brief, it is a survey which states its results are not statistically significant given the sample size, which discusses only one half of the course of action taken by the coalition Government.</p>
<p>Update, 2011-03-27: The document released as part of my request is now <a href="http://www.whatdotheyknow.com/request/browne_review_opinion_survey">available through whatdotheyknow.com</a>.</p>
]]></content:encoded>
			<wfw:commentRss>http://youshottheinvisibleswordsman.co.uk/2011/03/03/browne-review-research-spending/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>IPv4, 5, 6&#8230;</title>
		<link>http://youshottheinvisibleswordsman.co.uk/2011/01/29/ipv4-5-6/?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=ipv4-5-6</link>
		<comments>http://youshottheinvisibleswordsman.co.uk/2011/01/29/ipv4-5-6/#comments</comments>
		<pubDate>Sat, 29 Jan 2011 19:04:35 +0000</pubDate>
		<dc:creator>Stephen</dc:creator>
				<category><![CDATA[internet]]></category>
		<category><![CDATA[technology]]></category>

		<guid isPermaLink="false">http://youshottheinvisibleswordsman.co.uk/?p=577</guid>
		<description><![CDATA[The exhaustion of the central pool of IPv4 addresses is imminent. Exhaustion has led to a resurgence of the simple question &#8216;what happened ip IPv5?&#8217; and, less commonly, &#8216;what happened to IPv{0,1,2,3}?&#8217;. IANA naturally governs the allocation of IP version numbers, and indeed have a list of all ten of them. The list is as follows: IPv0: [...]]]></description>
			<content:encoded><![CDATA[<p>The exhaustion of the central pool of IPv4 addresses <a href="http://www.potaroo.net/tools/ipv4/index.html">is imminent</a>. Exhaustion has led to a resurgence of the simple question &#8216;what happened ip IPv5?&#8217; and, less commonly, &#8216;what happened to IPv{0,1,2,3}?&#8217;. <a href="http://www.iana.org/">IANA</a> naturally governs the allocation of IP version numbers, and indeed have a <a href="http://www.iana.org/assignments/version-numbers/">list of all ten of them</a>. The list is as follows:</p>
<p>IPv0: Reserved.<br />
IPv1: Reserved.<br />
IPv2: Unassigned.<br />
IPv3: Unassigned.<br />
IPv4: defined in <a href="http://tools.ietf.org/html/rfc791">RFC 791</a>.<br />
IPv5: ST-II, defined in <a href="http://tools.ietf.org/html/rfc1190">RFC 1190</a>.<br />
IPv6: Formerly SIP, then SIPP, defined in <a href="http://tools.ietf.org/html/rfc1752">RFC 1752</a>.<br />
IPv7: TP/IX, defined in <a href="http://tools.ietf.org/html/rfc1475">RFC 1475</a>.<br />
IPv8: PIP, defined in <a href="http://tools.ietf.org/html/rfc1621">RFC 1621</a>.<br />
IPv9: TUBA, defined in <a href="http://tools.ietf.org/html/rfc1347">RFC 1347</a>.</p>
<p>Certainly the last 4 in this list were designed in response to the IPng <a href="http://tools.ietf.org/html/rfc1550">white paper solicitation</a>. <a href="http://postel.org/postel.html#about">Jon Postel</a>, for the sake of fairness, applied a version number for each of the candidates, and ultimately IPv6 was chosen.</p>
<p>Regarding IPv{2,3}, the (very informative) <a href="http://mailman.postel.org/mailman/listinfo/internet-history">internet history mailing list</a> provides an answer: <a href="http://mailman.postel.org/pipermail/internet-history/2009-November/000977.html">This mailing list post from Vint</a> states that TCP and IP were split sometime around TCPv3, presumably leading to IP packets with version number 3. The suggestion is that there was no IPv2 as we would recognise it.</p>
<p>IPv{0,1} were ever presumably also never used. And just as development of TCP/IP doesn&#8217;t map well onto early internet protocol versions, the &#8216;missing&#8217; versions also don&#8217;t map onto <a href="http://en.wikipedia.org/wiki/Network_Control_Program">NCP</a>, which TCP/IP replaced in early 1983; the packet headers, and indeed the network design and semantics as specified in BBN Report 1822, are very different to what we have today.</p>
<p>Anyway, we squeezed 28 years and 1 month out of IPv4. We&#8217;re all well into our IPv6 transition, right?</p>
]]></content:encoded>
			<wfw:commentRss>http://youshottheinvisibleswordsman.co.uk/2011/01/29/ipv4-5-6/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>That was the year that was &#8230; 2010.</title>
		<link>http://youshottheinvisibleswordsman.co.uk/2011/01/05/that-was-the-year-that-was-2010/?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=that-was-the-year-that-was-2010</link>
		<comments>http://youshottheinvisibleswordsman.co.uk/2011/01/05/that-was-the-year-that-was-2010/#comments</comments>
		<pubDate>Wed, 05 Jan 2011 01:22:25 +0000</pubDate>
		<dc:creator>Stephen</dc:creator>
				<category><![CDATA[life]]></category>

		<guid isPermaLink="false">http://youshottheinvisibleswordsman.co.uk/?p=543</guid>
		<description><![CDATA[2010 is definitely behind us, and so here is the customary retrospective post where I reflect on my year. I enjoyed my 2010. It&#8217;s largely been a busy, keep-your-head-down kind of year, but that&#8217;s okay. It means I have plenty to reflect on. So, the majority of my reflection focusses on work. I&#8217;ve done a [...]]]></description>
			<content:encoded><![CDATA[<p>2010 is definitely behind us, and so here is the customary retrospective post where I reflect on my year. I enjoyed my 2010. It&#8217;s largely been a busy, keep-your-head-down kind of year, but that&#8217;s okay. It means I have plenty to reflect on.</p>
<p>So, the majority of my reflection focusses on  work. I&#8217;ve done a lot of work this year, including authoring four papers (primary on two, secondary on two). One of those papers was <a href="http://sdstrowes.co.uk/publications/2010-hgw-study.pdf">published at IMC 2010</a>. One was rejected, despite decent reviews, but I did submit it to a highly competitive venue. Two are awaiting a decision some time soon. I find writing fun, in the sense that it features the important process of discovering the things you  don&#8217;t understand  in quite enough detail. This process forces any author to do more work, to clarify their argument. The outcome, I think, is good. And, in my case, the outcome is also two dissertation chapters.</p>
<p>My workflow, I am realising, seems to be heavily centralised, but centralised in various places. By that, I mean, I&#8217;m increasingly reliant on parts of &#8220;the cloud&#8221;. Although my office  machine is the focal point for my work, Google has seen some more of my data this year, too. But I&#8217;ve expanded into all sorts of other services.</p>
<p>Although I have parts of my electronic life backed up  using various mechanisms where appropriate (svn, rsync, etc), these are all a bit  disparate. To add to the ragbag, I started using Dropbox as a convenient way of syncing my library of PDF copies of papers  between my machines. (I did toy with using <a href="http://fak3r.com/2009/09/14/howto-build-your-own-open-source-dropbox-clone/">lsync</a> to achieve the same, but ultimately left this for another day: I&#8217;d like  my own dedicated host sitting in my flat to coordinate this sort of data, but I don&#8217;t have one I&#8217;m  happy with right now.) Regarding other services, I started using Delicious again until all the <a href="http://techcrunch.com/2010/12/16/is-yahoo-shutting-down-del-icio-us/">ambiguity surrounding its future</a>, after which I promptly jumped to <a href="http://pinboard.in/u:sdstrowes">pinboard.in</a>, which I like a lot. Another service I&#8217;m now happily using regularly is <a href="http://instapaper.com/">Instapaper</a>, useful for punting links to for reading later, rather than holding far too many browser tabs open. I&#8217;ve also been using <a href="http://todoist.com/">Todoist</a> for some longer-term todo-list management. All these services feel lightweight, and offer plugins for Chrome, meaning I don&#8217;t need  to go out of my way to use them. Joy.</p>
<p>In a non-cloud progression, I started learning little bits of <a href="http://www.r-project.org/">R</a> this year.  R is a statistics package which makes my life simpler by implementing  all sorts of statistical tests that I no longer need to craft by hand:  think percentiles and standard deviations, and other calculations of that ilk, and how tedious many of them are to craft in AWK. I let R handle all that now, and my work is probably less error-prone because of it.</p>
<p>Regarding non-work cloudification, Spotify now looms large on my listening  habits. I think I started using the service more than a year ago, but I  have one machine dedicated to Spotify that is happily hooked up to my  sound system and streams much music when I please. Is this good? Well,  it is until you decide to do research on Spotify peering traffic using your own  account, and you have days where you can&#8217;t interrupt experiments. This  did make me realise just how much I use Spotify: When I didn&#8217;t have  access to the service, there were many things I did not own that I could  no longer listen to. Equally, there are many CDs I own that I had not  heard in many months: physical media still seems more interesting to  browse. Will I stop buying CDs? Unlikely, especially given the frequency  at which Spotify&#8217;s contracts with the record labels seem to expire:  It&#8217;s not uncommon to find tracks missing from carefully crafted  playlists when the track is no longer available through Spotify. A  shame, but otherwise I&#8217;m pretty happy welcoming Spotify into my home. The other web service I&#8217;ve come to use frequently is <a href="http://www.dailymile.com/">Dailymile</a>, which I use every single time I go running. It&#8217;s fun to see just how my running varies over the year&#8230;</p>
<p>In terms of gadgets: I acquired my <a href="http://maemo.nokia.com/n900/">N900</a>, and my <a href="http://www.amazon.co.uk/dp/B002Y27P46">Kindle</a>. These sit very nicely within my ecosystem. In particular, the N900 is basically a Debian-based Linux box. Since I&#8217;ve been using Debian systems for many, many years, I&#8217;m entirely used to how the N900 works and, as a Linux box, it does a lot of things transparently in my environment. Likewise, it&#8217;ll do a lot of things not so transparently, but it pretty much works how I&#8217;d like it to. And my Kindle, I&#8217;ve only just acquired, but Amazon seem to have been pretty smart in how they manage syncing, and how they manage profile management.</p>
<p>Regarding travel, I haven&#8217;t done so much this year. I went to Australia again, with the tiniest of stops in London. I visited Cambridge a few times (arguably too many, given the size of the city). Abingdon, once again, for the annual <a href="http://www.informatics.sussex.ac.uk/research/projects/ngn/cgi-bin/ngn05.cgi?pg=wshops/msn10/proc">Cosener&#8217;s pilgrimage</a>. And apparently a tour of Scottish cities: Inverness, Aberdeen, Stirling, Edinburgh.</p>
<p>I did also take something of a break at the end of summer to walk the <a href="http://en.wikipedia.org/wiki/West_Highland_Way">West Highland Way</a>, a 96 mile (154km) walk from Milngavie to Fort William, though I did do this in two stages with a couple of days break in between (the first 48 miles to Crianlarich achieved in two days, then a two day break for aching feet, then I completed the route myself over another three days).</p>
<p>And that other hobby, photography, which I played around with whenever I had time. My favourite shots of this year:</p>
<p><a href="http://flic.kr/p/8YFKuG"><img src="http://farm6.static.flickr.com/5286/5235976836_4c9e37b467_t.jpg" alt="" /></a> <a href="http://flic.kr/p/8qYTrJ"><img src="http://farm5.static.flickr.com/4137/4877192608_6b6ab32a48_t.jpg" alt="" /></a><a href="http://flic.kr/p/7YnPmf"> <img src="http://farm4.static.flickr.com/3335/4576121018_a1708f3e8b_t.jpg" alt="" /></a><a href="http://flic.kr/p/7YnPmf"> </a></p>
<p><a href="http://flic.kr/p/8UZV72"><img src="http://farm5.static.flickr.com/4113/5194255193_da489e736c_t.jpg" alt="" /></a> <a href="http://flic.kr/p/8KJmZd"><img src="http://farm5.static.flickr.com/4132/5089372006_97042c8773_t.jpg" alt="" /></a> <a href="http://flic.kr/p/7YnPmf"> </a><a href="http://flic.kr/p/7uExnj"><img src="http://farm5.static.flickr.com/4064/4262522272_16dfa74a41_t.jpg" alt="" /></a></p>
<p><a href="http://flic.kr/p/8w32zg"><img src="http://farm5.static.flickr.com/4122/4934387773_d7fac0173e_t.jpg" alt="" /></a> <a href="http://flic.kr/p/8w3e7P"><img src="http://farm5.static.flickr.com/4073/4934426607_105cb8a25c_t.jpg" alt="" /></a> <a href="http://flic.kr/p/7QwE5Z"><img src="http://farm3.static.flickr.com/2727/4487313897_99586a754d_t.jpg" alt="" /></a></p>
<p>And so, that&#8217;s that. Goodbye 2010; hello 2011.</p>
]]></content:encoded>
			<wfw:commentRss>http://youshottheinvisibleswordsman.co.uk/2011/01/05/that-was-the-year-that-was-2010/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>The process of the Ph.D.</title>
		<link>http://youshottheinvisibleswordsman.co.uk/2010/11/23/the-process-of-the-ph-d/?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=the-process-of-the-ph-d</link>
		<comments>http://youshottheinvisibleswordsman.co.uk/2010/11/23/the-process-of-the-ph-d/#comments</comments>
		<pubDate>Tue, 23 Nov 2010 23:56:08 +0000</pubDate>
		<dc:creator>Stephen</dc:creator>
				<category><![CDATA[life]]></category>
		<category><![CDATA[thoughts]]></category>

		<guid isPermaLink="false">http://youshottheinvisibleswordsman.co.uk/?p=514</guid>
		<description><![CDATA[The course of a Ph.D. is a strange one. You hopefully get into it because you enjoy learning and you want to do research. Possibly you are capable of one of those and want to do the other. Whatever. Either way, it&#8217;s a very personal experience, driven primarily by yourself and your interests. Your research [...]]]></description>
			<content:encoded><![CDATA[<p>The course of a Ph.D. is a strange one. You hopefully get into it because you enjoy learning and you want to do research. Possibly you are <em>capable</em> of one of those and <em>want</em> to do the other. Whatever. Either way, it&#8217;s a very personal experience, driven primarily by yourself and your interests.</p>
<p>Your research is anchored to one primary requirement: to contribute something to the scientific community. Obviously the primary drive at the start is to gobsmack the world with your sheer brilliance, to make your mark on science, to discover some new fundamental truth, and graduate in record time. Everybody&#8217;s entitled to that phase; after all, you never know.</p>
<p>Meanwhile, back here on Earth, we know this isn&#8217;t how most Ph.D.s go. &#8220;A contribution&#8221; can be surprisingly small, often depressingly so. But over time you come to realise that the requirement of contribution is anchored to two important desires, much more important than &#8220;fame&#8221;: first, the desire to contribute in a way that you hope will be acceptable to your review panel; second, the desire to contribute something that <em>you find interesting</em>. This is, after all, presumably why you got into things in the first place. The former is most important if you care for an easy life; the latter is important if you care for personal fulfilment.</p>
<p>And here is why: you&#8217;re going to be working on this for roughly four years of your life. A Ph.D. is one of life&#8217;s few opportunities to work relatively freely from interference. Also, if you get to the end of your degree and you still enjoy what you were working on, or have a bag full of ideas for &#8220;when you have time,&#8221; then congratulations. <em>That&#8217;s</em> the primary indicator of whether you were born for your field, or if your Ph.D. is as far as you want to take your academic career. If you get to the end and you aren&#8217;t horribly bitter and twisted, itching to get out, then the academic community might decide to keep you around.</p>
<p>So, interest is a primary driver. This interest drives you to learn, and by continuously learning you ultimately also learn where the boundary of human knowledge lies. From the perspective of the review panel, demonstrating crystal clear perception of how far human knowledge goes in your particular field is of the utmost importance.</p>
<p>But there&#8217;s a hidden process in getting to this point, and I find it quite interesting.</p>
<p>When you enter from an undergraduate perspective, you think you might know everything about anything, but that&#8217;s generally far from the truth. You know a lot, granted, or you wouldn&#8217;t have gotten funding. But what many undergraduates have is a broad-strokes picture of things. This is a blissful, innocent phase, where if you want to know more <em>you go read about it</em>.</p>
<p>In this context, everything in the universe feels knowable, because you haven&#8217;t yet reached the boundary of human knowledge. The Ph.D. <em>must</em> break you out of this mentality. You will not pass if you continue to think this way. If you do not find the boundary, then you will never be broken.</p>
<p>On your way toward your Ph.D. (which you&#8217;ll surely get because you&#8217;re pretty smart, right?), you read. You read a lot. You read hundreds of papers every year. Then you read some more. Reading papers is interesting because, apart from a rare few, many cover tiny incremental steps representing one more chunk of the puzzle. Many you can discard immediately. You learn how to rip out the key points of anything you read quickly, to determine in a few minutes whether this piece of paper in front of you that somebody poured months of their life into answers any of your questions. During this time, you refine your questions, and repeat. Refinement followed by reading followed by refinement followed by reading. This phase involves constantly banging your head off the boundary of human knowledge, sometimes without realising it. Still used to answers being available, this is a confusing phase. &#8220;<em>Surely somebody did this before? Seems obvious to me.</em>&#8221;</p>
<p>During this phase, you hopefully find a crack in the boundary and start working on something without being told to by your supervision team. This is something you decide to fill in yourself, and realise that possibly you could write about it too. Something you can do so that we know just a little bit more than we did yesterday.</p>
<p>But ultimately, the process of the Ph.D. is to break you down and rebuild you into a being who does not <em>expect</em> that certain knowledge is already known, does not <em>feel</em> that somebody must have already found such answers, to arrive at a point where you have an intuitive sense of what <em>has</em> and <em>has not</em> already been achieved. The Ph.D. teaches you how to parse what little information <em>is</em> known in such a way that you can determine whether you need to discover the unknowable. Or prove the as-yet unknown. Or verify the assumptions of others. You earn, through hard labour, sharp criticality of other people&#8217;s work. And hopefully of your own, too.</p>
<p>So you start the Ph.D. with much to learn. You finish the Ph.D. with an acute awareness of what is unknown. This, to me, is the single most important point of the whole process. Your Ph.D. becomes your license to go fill in those blanks.</p>
<p>As an aside, scepticism helps the break/rebuild process. The sceptic does not need to be broken down quite so far. Every time the sceptic hits the wall, shrug, &#8220;<em>Figures.</em>&#8221; The sceptic is quite comfortable in academia, because the sceptic works on the glass-half-empty principle. The sceptic has the in-built ability to detect when he&#8217;s read a half-truth. He understands how to interpret words in the least positive, but still truthful, manner, and can determine from that what the author of the text is <em>not</em> saying. This is an important skill not only when scanning the literature for prior work in an area, but also when reviewing papers. Ultimately, in the breaking process, the sceptic reads to the boundary of knowledge, and is capable of realising that <em>nobody knows what the sceptic wants to know</em>.</p>
<p>Shrug. <em>Figures</em>.</p>
<p>But the sceptic is equally pleased when he <em>does</em> find what he&#8217;s looking for. The sceptic is content to forever scrape boundary of knowledge, lifted by occasional moments of joy when he finds the information he seeks. The sceptic&#8217;s playground is this knowledge boundary.</p>
<p>I am something of a sceptic. I feel that this has made me amenable to much of the Ph.D. process.</p>
]]></content:encoded>
			<wfw:commentRss>http://youshottheinvisibleswordsman.co.uk/2010/11/23/the-process-of-the-ph-d/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>ACM Internet Measurement Conference (IMC) 2010: Day 3</title>
		<link>http://youshottheinvisibleswordsman.co.uk/2010/11/07/acm-internet-measurement-conference-imc-2010-day-3/?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=acm-internet-measurement-conference-imc-2010-day-3</link>
		<comments>http://youshottheinvisibleswordsman.co.uk/2010/11/07/acm-internet-measurement-conference-imc-2010-day-3/#comments</comments>
		<pubDate>Sun, 07 Nov 2010 15:41:16 +0000</pubDate>
		<dc:creator>Stephen</dc:creator>
				<category><![CDATA[internet]]></category>
		<category><![CDATA[technology]]></category>

		<guid isPermaLink="false">http://youshottheinvisibleswordsman.co.uk/?p=502</guid>
		<description><![CDATA[Preamble The notes that follow are a mixture of what each speaker said, or bullets listed on slides (verbatim, with minor US-&#62;UK deltas because I was touch-typing), or thoughts of my own. If you were at IMC and you spot errors (likely), do feel free to point them out, and I’ll fix them. The amount [...]]]></description>
			<content:encoded><![CDATA[<p><strong>Preamble</strong></p>
<p><em>The notes that follow are a mixture of what each speaker said, or bullets listed on slides (verbatim, with minor US-&gt;UK deltas because I was touch-typing), or thoughts of my own. If you were at IMC and you spot errors (likely), do feel free to point them out, and I’ll fix them.</em></p>
<p><em>The amount of text written for each is proportional to certain variables: How tired I was during the talk, how interested I was in the talk, how much I knew about the subject material, and how good the speaker was. Note also that these are pretty raw. Questions are omitted if the question was not clear; many responses from the speaker have been shortened for brevity.</em></p>
<p><strong>Session 11: Methodology III</strong></p>
<p><strong>Long paper: </strong>Revisiting the Case for a Minimalist Approach for Network Flow Monitoring</p>
<p>Speaker: Vyas Sekar</p>
<ul>
<li>Decouple collection and computation
<ul>
<li>Generic collection primitive</li>
<li>Generic data structure</li>
</ul>
</li>
<li>Primitive 2: Sample and hold; If flow is already logged, update</li>
<li>Sample packet with prob p</li>
<li>If new flow -&gt; create counter</li>
<li>Use cSamp [NSDI'08] to configure flow sampling capabilities</li>
<li>Hash-based coordination for non-overlapping sets of flows</li>
<li>Network-wide optimisation -&gt; operators golas e.g., per-path guarantee</li>
<li>Putting the pieces together: “minimalist” approach</li>
<li>Flow sampling + sample &amp; hold</li>
<li>Relative accuracy difference = accuracy (minimalist) &#8211; accuracy (app specific) / accuracy (app specific)</li>
<li>Hardware reqs are similar:
<ul>
<li>both need per-packet array/key-value updates</li>
<li>More than packet sampling, but with router capabilities</li>
</ul>
</li>
<li>Memory for counteres
<ul>
<li>botleneck is SRAM (flow headers can be offloaded to DRAM)</li>
<li>We conservatively assume 4x more per-counter cost</li>
</ul>
</li>
<li>Processing costs
<ul>
<li>Online cost lower for minimalist (don&#8217;t need per-app-instance)</li>
<li>OFfice cost is higher for minimalist (but can be reduced)</li>
</ul>
</li>
<li>Reporting bandwidth: Higher for minimalist, but &lt; 1% of network capacity</li>
</ul>
<ul>
<li>Q: Costs</li>
<li>A: Hardware costs. Full sampling is of similar complexity to current packet sampling. Another issues is SRAM size.</li>
</ul>
<p><strong>Long paper: </strong>Exact Temporal Characterization of 10 Gbps Optical Wide-Area Network</p>
<p>Speaker: Daniel A. Freedman</p>
<ul>
<li>Understand novel behaviour of high-performance lightly loaded WAN links</li>
<li>Apprciate distortive impact of endpoint network adapters</li>
<li>Present instrumentation (named bifocals)</li>
<li>Network adapter distorts signatures/timestamps. Instead, let&#8217;s tap optical fibre directly.</li>
<li>Record analogue waveform of entire physical layer symbolstream in real time; extract discrete ethernet frames in later off-line processing</li>
<li>Packet chains are exponentially rare</li>
<li>What causes WAN burstiness?
<ul>
<li>Still an open question</li>
<li>But&#8230; Relatively insenstive to background traffic. Not due to a single router</li>
</ul>
</li>
<li>Instrumentation: Precise and reproducable timing measurements, accessible and transparent</li>
<li>This is the first “ground truth”measurement of packet timings of network strreams in flight</li>
<li>Comparaitive refernce for less accurate tools (determine when/where applicable)</li>
</ul>
<ul>
<li>Q: If we had optical switches, would this behaviour still be there?</li>
<li>A: That&#8217;s two questions. Is it the conversion? I think no. (“ODE”?). Is it the design of the switches? I guess so. Doesn&#8217;t know enough about optical switches though. If the switches mimic the same architecture, then probably they&#8217;ll behave the same.</li>
<li>Q: You get a number of time peaks at different points. Did you find any time peaks that would correspnd to timings within the router device? Known clock frequencies, etc? Could start talking about where in the router these artifacts originate?</li>
<li>A: Haven&#8217;t seen precisely what you&#8217;re asking about.</li>
</ul>
<p><strong>Long paper: </strong>TraceNET: An Internet Topology Data Collector</p>
<p>Speaker: Kamil Sarac</p>
<ul>
<li>tracenet is an extension to traceroute in internet topology collection studies</li>
<li>At each hop, traceroute to nodes in subnet hosting that hop&#8217;s ip addr</li>
<li>Network exploration
<ul>
<li>Trace collection</li>
<li>subnet exploration: start at a /30 subnet S which inclueds the IP address IP_v of the pivot interface v; Check if /30 mate of IP_v is likely to be in S; Grow S to a /X subnet (X = 29, 28&#8230;) which includes the IP address of v as long as IP addresses in subnet S satisfy our hueristics</li>
</ul>
</li>
<li>Hueristics: 9 heuristics to improive accuracy of our inferences</li>
<li>Does IP address i belong to assumed subnet S? Yes with high probability, if it satisfies hueristics.</li>
<li>Probing overhead: Exact number of probes during a tracenet session depends on length of trace, configuration of the explored subnet, confiruation of the fringe subnets</li>
<li>Path fluctuations</li>
<li>LImitations: Depends on heuristics based on best practices in layer 3 network configuration underestimation due to subnets with sparsely utilised IP addresses requires router responsiveness</li>
</ul>
<ul>
<li>Q: How sure are you that the 9 heuristics cover all the cases?</li>
<li>A: We did our best to think of all cases! Blah blah blah.</li>
<li>Q: Did you get multiple responses from broadcast addresses?</li>
<li>A: We did not do it.</li>
</ul>
<p><strong>Session 12: Online Social Networks</strong></p>
<p><strong>Long paper: </strong>Understanding Latent Interactions in Online Social Networks</p>
<p>Speaker: Jing Jiang</p>
<ul>
<li>Crawling process complicated by two things:
<ul>
<li>crawlers must revisit usrs often to build complete log; new visitors cause older visitors to fall off the list.</li>
<li>crawlers cannot visit each user to frequently; renren imposes multiple rate limits</li>
</ul>
</li>
<li>Visible interactions show much reciprocity</li>
</ul>
<p><strong>Short paper: </strong>Measuring the Mixing Time of Social Graphs</p>
<p>Speaker: Abedelaziz Mohaisen</p>
<ul>
<li>The Sybil Attack, John Douceur</li>
<li>Social graphs are not fast mixing</li>
<li>Some theoretical arguments in sybil defences are inaccurate</li>
<li>the applicability of social network-based sybil defenes is infeasible for some social graphs</li>
<li>there is a negative correlation between trust (hypothesized) and the underlying algorithmic property in these graphs&#8212;i.e., the mixing time. (Thanks to Aziz for correcting my hasty notes!)</li>
</ul>
<p><strong>Long paper: </strong>Estimating and Sampling Graphs with Multidimensional Random Walks</p>
<p>Speaker: Bruno Ribeiro</p>
<ul>
<li>Measurement distortions</li>
</ul>
<p><strong>Short paper: </strong>The Impact of YouTube Recommendation System on Video Views</p>
<p>Speaker: Lixin Gao</p>
<p><strong>Session 13: Path Properties and Dynamics</strong></p>
<p><strong>Long paper: </strong>Selecting Representative IP Addresses for Internet Topology Studies</p>
<p>Speaker: Xun Fan</p>
<ul>
<li>Topology study needs targets. Where do you dervice your destination addresses for your traceroutes?</li>
<li>census (ping sweeps) tell us who responds. Use census to predict best representatives.</li>
<li>contribs:
<ul>
<li>new automated hitlist generation method, updated everyh 2-3 months</li>
<li>evaluation of the hitlist</li>
<li>learn about the internet from the hitlist</li>
</ul>
</li>
<li>Completeness: Select one representative for all allocated /24&#8242;s Responsiveness: representative should be most likely to respond. Stability: representatives don&#8217;t change arbitrarily. Important for longitudinal study.</li>
<li>Representative predicition from census data: score IP addresses based on appearances in census data. Choose highest scoring rep per /24.</li>
<li>Dataset available: <a href="http://www.isi.edu/ant/">http://www.isi.edu/ant/</a>.</li>
</ul>
<ul>
<li>Q: Your aim is not to map the core. But have you considered in fact doing this?</li>
<li>A: ???</li>
<li>Q: What is the best coverage both in terms of ASes and prefixes that you can get? How much can I probe?</li>
<li>A: 1/12th of the allocated address space</li>
</ul>
<p><strong>Short paper: </strong>Representative Speed Tests for Free: Estimating Achievable Download Speed from Passive Measurements</p>
<p>Speaker: Jeffrey Pang</p>
<ul>
<li>How fast is the network? (From the perspective of the customers)</li>
<li>Benefits of passive measurement: No additional load on network,</li>
<li>millions of vantage points, captures actual “user experience”</li>
<li>Challenge: estimating network speed from flow records in the wild.</li>
<li>Solution: Only look at flows from non-rate-liimted applications and content-providers.</li>
<li>Goal: Detect rate-limited flows. Detecting if individual flows are rate-limited is not possible since we only have record flow records, not full packet streams.</li>
<li>Goal: Detect rate-limited flows-types</li>
<li>Contribution: A method to generate a throughput index. A set of creiteria that selects on y passive measurements that accuraely reflect achievable network speed</li>
</ul>
<ul>
<li>Q: Why didn&#8217;t you consider normal TCP behaviour?</li>
<li>A: Because at the flow level we do not have that information, do not know RTTs, etc.</li>
</ul>
<p><strong>Long paper: </strong>YouTube Traffic Dynamics and Its Interplay with a Tier-1 ISP: An ISP Perspective</p>
<p>Speaker: Vijay Kumar Adhikari</p>
<ul>
<li>Q: If you are doing localisation, you would see large sites near to customers?</li>
<li>A: Some faff on cost of building, economics, hand-waving. ??</li>
</ul>
<p><strong>Short paper: </strong>On the Characteristics and Reasons of Long-lived Internet Flows</p>
<p>Speaker: Lin Quan</p>
<ul>
<li>Long flows -&gt; high probability these are computer-to-computer traffic</li>
<li>Contrib: Multi-time-scale analysis
<ul>
<li>New mechanism that csales to weeks of traffic</li>
<li>efficient processing and queries</li>
</ul>
</li>
<li>show existence of long-lived flows</li>
<li>Find their characteristics and causes</li>
<li>Causes:
<ul>
<li>port based analysis</li>
</ul>
</li>
<li><a href="http://www.isi.edu/ant/">http://www.isi.edu/ant/</a></li>
</ul>
<ul>
<li>Q: What&#8217;s the smallest timescale you can say anything about the bustiness?</li>
<li>A: 10 mins</li>
<li>Q: Okay, 10 minutes is not enough to say anything about the burstiness</li>
<li>Q: How about using 2-tuple rather than 5-tuple; src-ip:dst-ip?</li>
<li>A: Um.</li>
</ul>
<p><strong>Session 14: Anomaly and Event Detection</strong></p>
<p><strong>Long paper: </strong>BasisDetect : A Model-based Network Event Detection Framework</p>
<p>Speaker: Paul Barford</p>
<ul>
<li>Anomoly detection: DOS attacks, outages, flash crowds, etc</li>
<li>Goal: Fast and accurate identification</li>
<li>Challenges: Data selection, filter selection,, perspective, thresholds, etc</li>
<li>Little actual deployment and use</li>
<li>Can we exploit structure in both normal and anomalous traffic to identify anomalies?</li>
<li>Basic detect methodology. Three components:
<ul>
<li>anomoly dictionary construction</li>
<li>anomoly decomposition of link data</li>
<li>network-wide data fusion</li>
</ul>
</li>
<li>In terms of single-link real world dataset: over 48% frewer false alarms</li>
<li>in terms of network-wide synthetic dataset: between 54-90% fewer false alarms</li>
</ul>
<ul>
<li>Q: If I understand, you&#8217;re looking for anomolies in the future that look like the old anomolies. Relies on signatures, won&#8217;t be able to see something you&#8217;ve never seen before?</li>
<li>A: Yes. False negatives are way better than false positives. We&#8217;re focussing on the latter, not enough on the former.</li>
<li>Q: Did you focus at all on false negatives?</li>
<li>A: A little.</li>
<li>Q: What about building your dictionary on normal traffic, and assuming anything else is anomolous?</li>
<li>A: ??</li>
</ul>
<p><strong>Short paper: </strong>Temporally Oblivious Anomaly Detection on Large Networks Using Functional Peers</p>
<p>Speaker: Kevin M. Carter</p>
<ul>
<li>Assume baseline behaviour, aim of anomaly detection is to flag the unexpected</li>
<li>An anomaly is not necessarily malicious</li>
<li>Given a large network, utilise functional peers to identify anomalous hosts. Baseline over space rather than time.</li>
<li>Contribs:
<ul>
<li>remove temporality entirely from detection</li>
<li>scales to very-large networks, don&#8217;t have to maintain state for all hosts</li>
<li>anomalies can be automatically categorised into descriptive sets. Actionable intelligence.</li>
</ul>
</li>
<li>This is NOT an IDS.</li>
</ul>
<ul>
<li>No questions fielded!</li>
</ul>
<p><strong>Long paper: </strong>What Happened in my Network? Mining Network Events from Router Syslogs</p>
<p>Speaker: Dan Pei</p>
<ul>
<li>Contrib: SyslogDigest: proacitvely mine network events from router systlogs</li>
<li>take low-level minimally sturcture syslog messages -&gt; prioitised high-level network events.</li>
<li>Grouping relevant messages together
<ul>
<li>Temporal relationship</li>
<li>spatial relationship</li>
<li>associations of message types</li>
</ul>
</li>
<li>Presentation:
<ul>
<li>ranking</li>
<li>naming</li>
</ul>
</li>
<li>Message template learning: Templates trte the signatures with the combination of frequent words</li>
<li>Decompose messages into works separated by whitespace</li>
<li>Construct a tree structure to express the template hierarchy based on the input messages</li>
<li>[...]</li>
<li>Rule-based grouping
<ul>
<li>“happening together”. Temporaly close within a sliding window</li>
<li>“together frequently”. Association rule mining.</li>
</ul>
</li>
<li>Offers 2-3 orders of magnitude reduction from raw messages to events</li>
</ul>
]]></content:encoded>
			<wfw:commentRss>http://youshottheinvisibleswordsman.co.uk/2010/11/07/acm-internet-measurement-conference-imc-2010-day-3/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>
<!-- WP Super Cache is installed but broken. The path to wp-cache-phase1.php in wp-content/advanced-cache.php must be fixed! -->
