Filtering Duplicate Messages in Twitch Chat
Jul 12, 2016
4 min read
I recently built this toy command line interface (CLI) app for fun. You can see a higher resolution video of it in action by clicking the image below.
The video above is a snapshot of the CLI in action during the middle of a speed run of the Nintendo 64 game The Legend of Zelda: Ocarina of Time, hosted on Twitch.tv. It shows the filtering in action, in the middle of a burst of the WutFace spam, which you can see in the lower left of the above video.
Twitch.tv Overview
For the uninitiated, Twitch.tv is a streaming video website that allows anyone to share live video with viewers over the internet, with twitch. The most popular use case by a wide margin for streaming players playing video games, although it is also sometimes used for sharing live music, where the video streamer might be playing an instrument (or instruments, if it’s a band.
Several thousand people can be watching the same video stream at the same time – it’s not uncommon to see streams that have 100,000 simultaneous viewers. Since viewers who watch the same video stream are also put into the same chat room (there’s no chat room splitting that happens automatically), the chat room messages can become incredibly difficult to parse.
To (sort of) solve this problem, I wrote a simple CLI Twitch/IRC chat client that supports the ability to filter out duplicate messages within a sliding time window in realtime.
Implementation Details
Although this was built specifically for Twitch, it works with any chat that supports IRC as a protocol.
The fully functional IRC client was written in Golang. The choice of Golang was purely out of my desire to learn the language. If I was to build something beyond for toy usage, I’d probably write it as a Chrome extension, since native Twitch chat is in the browser, and it’s where users are accustomed to chatting.
I used termui for the UI. The usecase for termui is pretty niche – I’d say it’s useful for interfaces when a browser is out of the question, such as while SSHed onto a remote server. Unless performance is a problem or golang is absolutely necessary, blessed-contrib using JavaScript looks to be a better option for building CLI UIs. I’d avoid building CLI UIs over browser based ones in general though – these frameworks for building CLI UIs is not like working with the DOM in the browser, and the kinds of UIs you can build using termui or blessed-contrib are limited in scope.
There are the following widgets in this client:
1. Message Rate Monitor (Rate of messages over sliding window)
2. Message Stats Monitor (Min, Max, Avg messages over sliding window)
3. Duplicate Message Aggregator (List of messages sorted by number of occurrences over sliding window)
The trickiest implementation detail was the duplicate message aggregator. It involved maintaining a sorted list of (message, count) pairs in realtime as messages arrived, and then decrementing / removing the message from the sorted list of (message, count) pairs, while also preventing concurrent updates from malforming the sorted list counts.
Despite the complexity, Golang made this process quite easy. Concurrent updates were handled by serializing all updates through a queueing channel, and goroutines with time.Sleep(duration) were used to decrement counts.
Further Thoughts on Golang
Golang was a bit of a pain at times – the lack of generics in the language shows. I had to duplicate methods like math.Max for int types, since the builtin math.Max only works with float64. Also, the extra overhead of thinking about when to pass a pointer to a struct vs just the struct got in the way of thinking about the higher level functionality of the application.
On the flip side, I spent almost no time learning Golang and was able to build something functional almost immediately. This hasn’t been the case with other languages I’ve learned, such as TypeScript or Scala. The expediency experienced with Golang comes from two things – firstly, the amazing tooling that has gone into the language ecosystem, and secondly, the sheer simplicity of the language and lack of language features.
I wouldn’t use Golang as my first language choice for prototyping a product, but for system level programming, network intensive applications, or performance critical backends, Golang seems like the perfect choice for development. I’ll stick to Python, Scala, or TypeScript for prototyping products quickly, and use Golang when the extra performance is necessary.
Using the Client
The source code and instructions for usage of the project described earlier can be found at my github page here. All that is needed is a golang installation and a Twitch account.

Numbers Every Software Engineer Should Know

Feb 7, 2014

4 min read

This article is a collection of runtime and memory usage numbers that might be useful to keep in mind when building performance critical software.

Big-O Absolute Numbers

This chart shows how well or poorly certain algorithm complexities scale when n grows.

Function   | n=8 (2^3) | n=64 (2^6)  | n=1024 (2^10) | n=2^32 (~ 4 billion)
-----------|-----------|-------------|---------------|---------------------
log(n)     | 3         | 6           | 10            | 32
n          | 8         | 64          | 1024          | 4294967296
n*log(n)   | 24        | 384         | 10240         | 1.374...E11
n^2        | 64        | 4096        | 1048576       | 1.844...E19
n^2*log(n) | 192       | 24576       | 1.048...E7    | -
n^3        | 512       | 262144      | -             | -
2^n        | 256       | 1.844...E19 | -             | -
n!         | 40320     | 1.268...E89 | -             | -

Runtime Performance Numbers

Big-O Runtime Performance Numbers

This chart shows how well or poorly certain algorithm complexities scale when n grows, in the context of CPU runtime. Exponentials and factorial algorithms clearly do not scale to values of n of any significant size.

Function   | n=8 (2^3) | n=64 (2^6)       | n=1024 (2^10) | n=2^32 (~ 4 billion)
-----------|-----------|------------------|---------------|---------------------
log(n)     | 3   ns    | 6   ns           | 10 ns         | 32  ns
n          | 8   ns    | 64  ns           | 1  µs         | 4.2 seconds
n*log(n)   | 24  ns    | 384 ns           | 10 µs         | 137 seconds
n^2        | 64  ns    | 4   µs           | 1  ms         | 584 years
n^2*log(n) | 192 ns    | 24  µs           | 10 ms         | forever
n^3        | 512 ns    | 262 µs           | 1  second     | forever
2^n        | 256 ns    | 584 years        | forever       | forever
n!         | 40  µs    | forever          | forever       | forever

ns      = Nanoseconds, about the time it takes for a CPU instruction cycle to run
µs      = 1,000 ns
ms      = 1,000 µs
forever = way way longer than the lifetime of the universe of 13.798 billion years

Latency Numbers

This chart shows the round trip latency for certain computing actions.

 Action                             | Time     | Comparisons
------------------------------------|----------|-----------------------------
 L1 cache reference                 | 0.5   ns |
 Branch mispredict                  | 5     ns |
 L2 cache reference                 | 7     ns | 14x L1 cache
 Mutex lock/unlock                  | 25    ns |
 Main memory reference              | 100   ns | 20x L2 cache, 200x L1 cache
 Compress 1K bytes with Zippy       | 3,000 ns |
 Send 1K bytes over 1 Gbps network  | 10    µs |
 Read 4K randomly from SSD*         | 150   µs |
 Read 1 MB sequentially from memory | 250   µs |
 Round trip within same datacenter  | 500   µs |
 Read 1 MB sequentially from SSD*   | 1     ms | 4X memory
 Disk seek                          | 10    ms | 20x datacenter roundtrip
 Read 1 MB sequentially from disk   | 20    ms | 80x memory, 20X SSD
 Send packet CA->Netherlands->CA    | 150   ms |

+ By Jeff Dean: http://research.google.com/people/jeff/
+ Originally by Peter Norvig: http://norvig.com/21-days.html#answers
+ Retrieved from Jonas Bonér: https://gist.github.com/jboner/2841832

Memory and Storage Numbers

Big-O Storage size

This chart shows how well or poorly certain algorithm complexities scale when n grows, in the context of memory and storage.

Function   | n=8 (2^3) | n=64 (2^6)    | n=1024 (2^10) | n=2^32 (~ 4 billion)
-----------|-----------|---------------|---------------|---------------------
log(n)     | 3   B     | 6   B         | 10 B          | 32 B
n          | 8   B     | 64  B         | 1  KB         | 4 GB
n*log(n)   | 24  B     | 384 B         | 10 KB         | 128 GB
n^2        | 64  B     | 40  KB        | 1  MB         | 16 exabytes
n^2*log(n) | 192 B     | 24  KB        | 10 MB         | ...
n^3        | 512 B     | 256 KB        | 1  GB         | ...
2^n        | 256 B     | 16  exaBytes  | ...           | ...
n!         | 40  KB    | ...           | ...           | ...

B       = 1 Byte, or 8 Bits
KB      = kilobyte (1024 B)
MB      = megabyte (1024 KB)
GB      = gigabyte (1024 MB)
TB      = terabyte (1024 GB)
TB      = petabyte (1024 TB)
exabyte = 1024 petabytes, or 1 million computers with 1 terabyte HDs
...     = there isn't enough room on earth to store the required computers.

Powers of Two Table

This chart shows how much memory is required to store bits in powers of two. For example, it takes 4GB of memory to store 2^32 bits in memory, and would require 16GB of memory to store an array of 32-bit integers. This is useful for knowing when a hash table is appropriate for a problem.

Power of 2 | Exact Value       | Storage size
-----------|-------------------|-------------
7          | 128               | 128 B
8          | 256               | 256 B
10         | 1024              | 1   KB
16         | 65,536            | 64  KB
20         | 1,048,576         | 1   MB
30         | 1,073,741,824     | 1   GB
32         | 4,294,967,296     | 4   GB
40         | 1,099,511,627,776 | 1   TB
64         | 1.844...E19       | 16  exabytes

+ Modified from _Cracking the Coding Interview_ By Gayle L. McDowell (p. 47)

Converting the OverClocked Remix RSS to a Twitter Feed
Feb 6, 2014
8 min read
OverClocked Remix (OCRemix) is a website dedicated to serving high quality remixes of music from video games. The RSS feed is occasionally updated with the latest ten new songs that are posted on the site.
This post is about a server daemon project I wrote a year ago – it polls the RSS feed of OCRemix periodically, and converts new results to a Twitter feed. The results can be seen at @newOCRemixes on Twitter. It’s been running on Heroku 24/7 for the past year without any problems. Source code is here.
The rest of this post goes into the design and development of this project, so if that doesn’t sound interesting to you, now is a good time to hop off this train.
Motivation and design
I’ve been listening to music off of OCRemix ever since 2002. The way I used to find new songs was to repeatedly visit their website, look for new songs, and then click through three or four links for every new song that was posted.
I got tired of checking the website, only to end up finding out there were no new songs, or finding out there are ten new songs, and having to click 30 times to hear all of them.
So I set out to figure out how to make it much, much easier to hear new songs and download them if I liked them.
Existing OCRemix Feeds
OCRemix has it’s own Twitter feed, which they will use to post new remixes, but it’s not exclusive to providing just new songs:
Official OCRemix Twitter Feed
Twitter happens to have this nice feature that they call cards. Cards are essentially a feature that detect certain types of URLs, and display information within the tweet that would only be accessible from visiting the URL. One of the supported card types are Youtube links, which will embed a Youtube video into a Tweet whenever a Youtube link is detected.
Sometimes the links to new songs include a Youtube link to the remix for easy listening…
Official OCRemix Twitter Feed -- Embedded Youtube
and sometimes they don’t:
Official OCRemix Twitter Feed -- no Embedded Youtube
They also have their own Youtube channel where new songs are posted, but similarly to their Twitter feed, it is not exclusive to new songs.
My OCRemix Feed
Since I was already using Twitter as a news feed, I chose that as my target platform for receiving new song notifications. The following is a list of some of the things I wanted out of my Twitter feed system.
- Only new songs should be posted – no reposts.
- Only songs should be posted – nothing else but new OCRemix songs.
- Every single song must contain a link to the Youtube song link.
- Show the video game, remixer, and composer the song is remixed from, if it fits in 140 characters along with the Youtube link.
- This system should not require any input from me (automated).
- If something is broken, I should be notified of the breakage.
Initially, I wanted to implement direct download links within each Tweet in order to simplify the process of downloading the mp3 for new songs. However, because ocremix.org is non-profit and receives advertising money from the main website to pay for bandwidth, I chose to just link to their site to download out of politeness.
This is what the final version of my implementation looks like:
My Unofficial Twitter Feed
Assuming this information fits into the 140 character limit, each one of these song tweets is composed of the following:
```
songId     - Integer, official remix number (currently at 2827 at the time of writing).
title      - Song title, given to the song by the remixers.
remixers   - The remix artists that created the remix.
composers  - The original artists that created the original song.
youtubeUrl - Link to the remix on Youtube.
writeupUrl - Official OCRemix link with remix information and a download link.
```
Programming Details
The actual project was implemented using Scala. The main libraries used were Dispatch for OAuth and talking with Twitter’s API, and Akka for the periodic polling of the RSS feed.
Fitting Info into a Tweet
Trying to detect whether or not a Tweet will fit in 140 characters is tricky, especially if there are URLs in the Tweet. Twitter will automatically shorten URLs for you – the caveat is that the length of the shortened URL is what counts to the 140 character limit, and not the size of the original URL.
So how do you figure out what the shortened URL size will be? Twitter happens to have a configuration API that will tell you the current length of URLs (https links are one character longer than http). Since this number can change, it can’t be hardcoded into the system. It’s also not a number that is likely to change often.
To take this varying URL length into consideration, the previously known shortened URL length is cached in memory for quick usage. I poll the configuration API every 24 hours to check for and update the URL length cache if necessary.
This polling is done using Akka’s scheduler:
```
val twitterConfigUpdaterSchedule = MySystem().scheduler.schedule(
  initialDelay = 0 seconds,
  frequency    = 1 day,
  receiver     = configUpdater,
  // If I was writing this again, I probably wouldn't use a "doit" message
  // for initiating an update.
  message      = "doit"
)
```
Sometimes there’s just too much information to include in 140 characters. Going back to my design goal of making it easy to hear new songs, the Youtube link is the most important thing to include in the Tweet.
Here’s a shortened version of the code that figures out what to put into a tweet:
```
// The youtube URL is in every possible one of these.
lazy val v0 = Tweet(songId, title, remixers, composers, youtubeUrl, writeupUrl)
lazy val v1 = Tweet(songId, title, remixers, youtubeUrl, writeupUrl)
lazy val v2 = Tweet(songId, title, youtubeUrl, writeupUrl)
lazy val v3 = Tweet(songId, title, youtubeUrl)
lazy val v4 = Tweet(songId, youtubeUrl)

// Start with the longest possible, then slowly go to the shortest.
if (v0.isTweetable)
  v0
else if (v1.isTweetable)
  v1
else if (v2.isTweetable)
  v2
else if (v3.isTweetable)
  v3
else
  v4
```
Unless Twitter plans on upping the shortened URL length to ~135 characters, or the ids start incrementing in several orders of magnitude per new remix, the smallest tweet content (songId,youtubeUrl) is good enough.
There’s a reason songId is included in all of the possibilities as well, more on that later.
Receiving Error Notifications
In order to make sure I received a notification whenever something breaks, every single method that could cause an error returns an Either[_, String], where the Right case stores the successful result, and the Left case stores a string explaining what went wrong. Whenever a method is completed with the Left case, a direct message is sent to my own personal Twitter handle, with information about the failure.
Here’s what error messages look like when they reach my Twitter inbox:
All in one place
Some examples of methods that might fail that I want to be alerted about if they do:
```
  // Sends a private message to someone on Twitter
  def directMessage(user: TwitterHandle, message: String): Future[Either[String,String]]

  // Tweets a new post
  def statusesUpdate(tweet: Tweetable): Future[Either[String,String]]

  // Retrieves the last few Tweets from a user's timeline
  def userTimeline(
    userId:     String,
    screenName: String,
    count:      Int
  ): Future[Either[String,String]]

  // Retrieves configuration information about Twitter
  def helpConfiguration: Future[Either[String, TwitterConfiguration]]
```
(In retrospect, just using Future[String] would have been enough, since it stores the exception information in case of failure… but keep in mind I did write this code as a novice just as I started learning Scala).
Polling and Parsing the RSS Feed
There’s nothing particularly fancy about this part – it mostly consists of periodically polling OCRemix’s RSS and parsing out new songs to post on Twitter.
Here’s the code that schedules an RSS polling check every half hour:
```
val rssPollerSchedule = MySystem().scheduler.schedule(
  initialDelay = 0 seconds,
  frequency    = 30 minutes,
  receiver     = rssPoller,
  message      = "doit"
)
```
In order to figure out whether or not a song is new, I send a request to Twitter’s API to retrieve the @newOCRemixes Twitter Timeline, check the last Tweet’s songId, and make sure any new song I post happens to have a songId greater than the one on the timeline.
Things to Improve
All of this mostly works. There are a few problems that were more complicated to solve than the time I wanted to spend on this.
One of these problems happens to be that the RSS feed only stores the 10 latest new songs – if there’s ever more than 10 songs posted within a 30 minute interval, any songs prior to the last 10 will not get posted. This has only happened once since I’ve launched this project, but there’s not much that can be done about it, short of asking for the OCRemix owners to not post so many songs at once, or up the number on their RSS feed.. or polling the website itself, and parsing the song URLs from raw HTML.
Another problem (that hasn’t actually been much trouble so far) is that the error message reporting is dependent on Twitter’s API working. If there’s a problem with the directMessage call to send me a message on errors, I won’t be notified. I could have solved this by setting up a secondary notification system such as email, but errors happen so rarely that it wasn’t worth the trouble.
The third and last problem that I’ve run into is actually not one I can do much about. The Twitter card feature for Youtube videos sometimes won’t work – a few Tweets just won’t embed a Youtube video into the post, even if there’s a valid Youtube URL in the message.
Having said that, I’m pretty happy with the results – it’s been working without a hitch for the last year since it’s been deployed.

Improving Scalariform, The Scala Source Formatter

Feb 5, 2014

2 min read

Scalariform is a Scala source code formatter, originally written by Matt Russell (big thanks to him for writing it).

It’s much easier to show you what this does than it is to try and explain it, so that’s what I’ll do.

This is some poorly formatted code before running Scalariform:

class Coffee {
val sugarCubes = 20
val isCaffeinated = false

def energyBoost = {
if (caffeinated)
100 * sugarCubes
else
0
}
}

After running Scalariform:

class Coffee {
  val sugarCubes = 20
  val isCaffeinated = false

  def energyBoost = {
    if (caffeinated)
      100 * sugarCubes
    else
      0
  }
}

Pretty cool, right? Unfortunately, there haven’t been very many updates to the official version of this super awesome project lately, so I decided to fork the project and start improving upon it myself. Since I use Scala quite often, I’ve got some pretty strong motivation to work on improving it.

Here’s a quick summary of what I’ve added to Scalariform so far:

def showInput[A](
  parent: Component = null,
  message: Any,
  title: String = uiString("OptionPane.inputDialogTitle"),
  messageType: Message.Value = Message.Question,
  icon: Icon = EmptyIcon,
  entries: Seq[A] = Nil,
  initial: A
): Option[A]

case class Cake(
  icingFlavor: Flavor = Vanilla,
  cakeFlavor: Flavor = Chocolate,

  candles: Int = 1,
  layers: Int = 3,
  iceCream: Boolean = False
)

o.manyArguments(abc = 0,
  abcOne = 1,
  abcTwo,
  abcThree = 3,
  abcFour = 4,
  abcFive = 3
)


// Parameter names, types, and defaults are aligned into three separate columns
def showInput[A](
  parent:      Component     = null,
  message:     Any,
  title:       String        = uiString("OptionPane.inputDialogTitle"),
  messageType: Message.Value = Message.Question,
  icon:        Icon          = EmptyIcon,
  entries:     Seq[A]        = Nil,
  initial:     A
): Option[A]

// Two newlines will result in separate alignment groups
case class Cake(
  icingFlavor: Flavor = Vanilla,
  cakeFlavor:  Flavor = Chocolate,

  candles:  Int     = 1,
  layers:   Int     = 3,
  iceCream: Boolean = False
)

// Same feature working with method calls
o.manyArguments(
  abc    = 0,
  abcOne = 1,
  abcTwo,
  abcThree = 3,
  abcFour  = 4,
  abcFive  = 3
)

And here’s how to use my version:

// Add this to .../project/plugins.sbt
resolvers += "Sonatype OSS Snapshots" at "https://oss.sonatype.org/content/repositories/snapshots"

addSbtPlugin("com.danieltrinh" % "sbt-scalariform" % "1.3.0-SNAPSHOT")

See the plugin for how to configure formatting options, and the Scalariform readme for available formatting options.

Since this is an ongoing project, there will be more updates to come.

Maintaining a Large Code Base, Part 3: Programming Languages
Jan 26, 2014
17 min read
Previous Part: Service Oriented Architecture
If you are lucky enough to be able to choose the programming language for a new project, this section might provide some insight on how it might impact the future of your code base.
Choose Your Programming Languages Wisely
This article focuses only on the technical details of programming languages and their effects on maintainability. It won’t be covering topics such as a language’s popularity, tooling, or library support. While those things are undoubtedly important, those things aren’t necessarily intrinsic to a programming language, and they tend to change much more often than the topics I’ll be discussing here.
When I first started writing this section, it was titled “Use a statically typed language.” I then thought of Java with all of its verbosity, realized it’s not quite that simple, and it would make more sense to just outline how certain language features impact code maintainability, and let you conclude yourself on what kind of language to pursue.
Having said that, personally I prefer languages with static type systems over dynamic ones from a pure maintainability point of view, except perhaps in the case of languages like Java.
Before I get started, let’s define a few things.
Compiled language - purely statically typed, type annotations are required, or must be able to be inferred at compile time (Scala, Java, Haskell, C++)
Dynamic language - interpreted, no type annotations or they are optional (Clojure, Typed Clojure, Ruby, Groovy, Groovy 2.0, Python, JavaScript)
Statically typed language - same as compiled language
Typed Clojure and Groovy 2.0 both have optional type annotations for partial compile time type checking and performance improvements, but I am grouping them in the dynamic language group because they are optional, and they do not provide the same static analysis guarantees as pure statically typed languages, and hence are not as helpful in terms of maintainability (more on static analysis later). They are certainly easier to pick up than a full blown compiled languages with generics, though (more on this in the language complexity section).
Conciseness
By conciseness I mean this: How much code is necessary to express a particular piece of business logic? How much of that code is intrinsic to describing that logic, and how much of it is necessary because of limitations in the language? Conciseness is important for maintainability because it means there’s less code to read, and less code to refactor when it’s necessary to do so.
Here’s an example comparing a basic class in Java with the same functionality (both in behavior and runtime performance) implemented in Scala that demonstrates what I mean:
```
public class Coffee {
    private final boolean caffeinated;
    private final int sugarCubes;

    public Coffee(boolean caffeinated, int sugarCubes) {
        this.caffeinated = caffeinated;
        this.sugarCubes = sugarCubes;
    }
    public boolean getCaffeinated() {
        return caffeinated;
    }
    public int getSugarCubes() {
        return sugarCubes;
    }
}
```
```
class Coffee(val caffeinated: Boolean, val sugarCubes: Int)
```
For comparison’s sake, here’s a Ruby example of something similar to the two examples above:
```
class Coffee
  attr_reader :caffeinated, :sugar_cubes

  def initialize(caffeinated, sugar_cubes)
    @caffeinated = caffeinated
    @sugar_cubes = sugar_cubes
  end
end
```
Dynamic languages tend to win in terms of conciseness, since there’s no extra code necessary for specifying type information, although this isn’t always the case.
Language Complexity
It doesn’t really matter if code takes fewer lines of code if it’s impossible to figure out what it does. Unlike conciseness, language complexity isn’t quite as clear cut to define. So instead of trying to define it in an abstract sense, I’ll just give you some examples.
Scala is quite conciseness, but the type system is anything but simple. The type system is so involved that it is Turing Complete – it is so complex that it is possible to write algorithms such as infinite loops that only utilize the type system features; no for loops, while loops, or function recursion. The type system is so complex to the point that a simpler programming language to possibly become the next Scala is in the works by Scala’s original designer.
Here’s one of the crazier function signatures in Scala, from the Scalaz library:
```
implicit def CokleisliMAB[M, A, B](k: Cokleisli[M[_][_], A, B]): MAB[Cokleisli[M[_][_], A, B][A, B], A, B]
```
It probably doesn’t seem fair to show that signature for those unfamiliar with Scala, but I assure you, that kind of function signature is still daunting for seasoned Scala developers.
Although without those complicated type features, code like this might not be possible in Scala:
```
// Just Ints
List(1, 2, 3, 4, 5) map { x => x * x } sum
// => Int = 55

// Ints and Doubles
Array(1, 2 ,3.0, 4.5, 5.55) map { x => x * x } sum
// => Double = 65.0525

// Just Strings
Vector("a", "b", "c", "d", "e") map { x => x * 2 }
// =>  Vector[String] = Vector(aa, bb, cc, dd, ee)
```
The implementation of the map function in Scala is far from simple, though. Writing this sort of code is not something the average Scala developer can accomplish. In fact, I’m not sure there’s more than a handful of people who could write a collections library as powerful as the one in Scala – it took the creator of the language and a few helpers an entire year to get it to the point it is currently at.
Ruby, on the other hand, has an incredibly simple implementation of the map method. It’s just a few lines of code, within the Enumerable module. To use it, all that is necessary is an implementation of the each function on the inheriting member, and that’s it. Because of duck typing, there isn’t much else that’s needed.
In contrast to both Scala and Ruby, Golang is far from being concise, but it is well reported for being relatively quick to pick up and start coding, likely in large part due to its incredibly minimal type system. It has no form of generics, meaning it is impossible to write type safe collection methods that work with all types using just the language. Golang is packaged with it’s own basic set of generic data structures (slices, maps, channels), but if you want to write your own priority queue that works with all types, you are out of luck. This means the language is easier to learn than a language like Scala, but also results in more code and code duplication.
Syntactic Flexibility
There’s a reason why Google, Twitter, and Github have coding style guidelines.
Can you imagine if each letter of the English alphabet was in a different Unicode character? Here’s some English that has had each letter of the alphabet mapped to a different character in Unicode:
```
Original text:
I am Heavy Weapons Guy. And this... [grips Sasha] is my weapon. She weighs 150
kilograms and fires $200 custom-tooled cartridges at 10,000 rounds per minute.
[leans in] It cost $400,000 to fire this weapon...for 12 seconds.

Altered text:
"ᚋ ᣀᣌ ᚊᣄᣀᨇᨊ ᚙᣄᣀᣏᣎᣍᣒ ᚉᨐᨊ. ᚃᣍᣃ ᣓᣇᣈᣒ... [ᣆᣑᣈᣏᣒ ᚕᣀᣒᣇᣀ] ᣈᣒ ᣌᨊ ᨈᣄᣀᣏᣎᣍ. ᚕᣇᣄ ᨈᣄᣈᣆᣇᣒ 150
ᣊᣈᣋᣎᣆᣑᣀᣌᣒ ᣀᣍᣃ ᣅᣈᣑᣄᣒ $200 ᣂᨐᣒᣓᣎᣌ-ᣓᣎᣎᣋᣄᣃ ᣂᣀᣑᣓᣑᣈᣃᣆᣄᣒ ᣀᣓ 10,000 ᣑᣎᨐᣍᣃᣒ ᣏᣄᣑ ᣌᣈᣍᨐᣓᣄ.
[ᣋᣄᣀᣍᣒ ᣈᣍ] ᚋᣓ ᣂᣎᣒᣓ $400,000 ᣓᣎ ᣅᣈᣑᣄ ᣓᣇᣈᣒ ᨈᣄᣀᣏᣎᣍ...ᣅᣎᣑ 12 ᣒᣄᣂᣎᣍᣃᣒ.

Code for converting: https://gist.github.com/daniel-trinh/2b6d4b9c38e713148db4
```
While that is a bit of a contrived example, it’s a taste of what programmers have to deal with in programming languages that have flexible syntax. Out of the times I’ve talked about the new languages I’ve been exploring (Clojure, Go, Scala), syntax is almost always the first thing my coworkers notice and talk about – this is because understanding syntax is the first step to being able to read the language. If it’s different from what they’re familiar with, it’s just another barrier to learning it.
Languages that are designed for building domain-specific languages tend to have more syntactic flexibility and lexical complexity. Languages like Ruby and Scala were designed in mind of supporting DSLs. Unfortunately, flexible syntax makes it harder for users to read code. For every syntax permutation that people use in a language, everyone who reads the language is going to have to be able to read those permutations. In terms of language design, I don’t hear this topic talked about as often as some of the other topics in this article, but nevertheless I still see it as worth discussing.
Here are some real examples demonstrating the flexibility of Ruby’s and Scala’s syntax.
```
# do..end vs curly braces

#  Curly Braces
[1,2,3,4,5].map { |x| x * x }

# do..end
[1,2,3,4,5].map do |x|
  x * x
end

# Optional periods

# Method calls without periods
[1,2,3,4,5].map { |x| x*x } reduce (:+)

# Method calls with periods
[1,2,3,4,5].map { |x| x*x }.reduce (:+)

# Different ways of defining functions

# Method method
def square(x)
 x * x
end

# Lambda method
square = lambda { |n| n * n }

# Proc method
square = Proc.new { |x| x * x }

# Flexible method names

# Non Alpha-numeric method names
def +=(new_value)
  @value = @value + new_value
end
```
```
/* Curly braces vs parenthesis */

// Curly Braces
List(1,2,3,4,5,6).map { x => x * x }

// Parenthesis
List(1,2,3,4,5,6).map( x => x * x )

// Unnecessary Curly Braces .. and Parenthesis
List(1,2,3,4,5,6).map(
  {
    {
      ( { x => x * x } )
    }
  }
)

/* Optional periods */

// Method calls without periods
List(1,2,3,4,5) map ( x => x * x ) sum

// Method calls with periods
List(1,2,3,4,5).map( x => x * x ) sum

/* Different ways of defining "functions" */

// Method
def square(number: Int): Int = number * number

// Function
val square: Int => Int = { x => x * x }

/* Flexible parameter newline formatting */

// One line
def manyParams(a: Int, b: Int, c: Int): Int

// Several lines
def manyParams(
  a: Int,
  b: Int,
  c: Int
): Int

/* Flexible naming */

// Non Alpha-numeric method names
def +=[T](newValue: T): List[T] = {
  newValue :: this.list
}

// Unicode method names
def `(╯°□°）╯︵ ┻━┻`: Unit = {
  sys.exit(1)
}
```
And for comparison’s sake, here’s some Golang:
```
// Curly braces or parenthesis, no mix and matching
func main() {
  fmt.Println("Hello, 世界")
}

// No optional periods
func (v *Vertex) Abs() float64 {
    return math.Sqrt(v.X*v.X + v.Y*v.Y)
}

// Unicode naming, but no "+=" stuff
func 世界() {
  fmt.Println("Hello, 世界")
}
```
Those examples are just the tip of the iceberg, but they should give you a good idea of the kinds of syntax quirks I’m talking about.
Personally, I’m not a fan of the Golang syntax, but there is something to be said about it’s uniformity – once I’ve learned how to read one person’s Golang code, I can read pretty much all Golang code. This isn’t necessarily true of Ruby or Scala.
In some cases, newer languages have relied on the syntax of previously famous languages to gain popularity – Java’s syntax similarity to C++ was no accident, it was an intentional design decision by James Gosling to lure programmers away from their familiar C++ homes. JavaScript wasn’t always named JavaScript, but Brendan Eich decided it would help with gaining popularity. The syntax is also reminiscent of Java – it uses curly braces for scopes as well was semicolons for terminating sequences.
The bottom line is that programmers don’t want to learn a million different ways of reading the same code. If you’re using a language with an auto formatter such as Scala or Golang, you’re probably in the best boat – these formatters will format your code for you, enforcing a consistent style without having to spend time and energy trying to manually modify your code to be more readable (no more time wasted on syntax during code reviews). Ruby doesn’t have a full fledged auto formatter, but it does have Rubocop for telling you when your code is breaking style conventions.
Static Analysis
If there’s anything that kills the maintainability of dynamic languages, it’s the lack of type safety. I’ve used Ruby to death, and while I love using it for small applications or building prototypes, it’s not something I would choose if I had to work on a project with more than a few engineers or one that was more than a few thousand lines of code (or one that demanded performance, but that’s another story).
While I was working at RightScale, we had a massive 900,000 SLOC repository that contained way more business logic than it should have. The code base was a nightmare to maintain, and it had been that way for years. This was partially from the lack of time given to fix the problem, partially from having fifty engineers modifying the same code base, but also partly due to the nature of Ruby itself.
At one point, we really needed to start deprecating old code to get a sense of what was still in use and what wasn’t. In order to do this, one of our software architects proposed this solution: add a snippet (that I’ll reference as) dead_code to any file that was thought to no longer be in use. The method dead_code was a monkey patch on the Object namespace that would log / email / sound the alarms whenever the code was utilized at runtime (since everything is an object in Ruby due to its Smalltalk influences, this works – don’t try this in other languages). The idea was that if we ever got production error logs from the dead_code snippet, it meant that the piece of code we thought we could remove was in fact not removable.
In another case, our infrastructure team was prototyping an idea of a “code fence”, which would log / email / sound the alarms whenever a set of files tagged with a certain method was called at runtime (in production). I’ll reference this method as code_fence(some_fence_group). We needed this in order to extract business logics into separate services for SOA services
And here’s what drives me crazy about the two solutions I outlined above – neither of those slow iterating approaches would have been necessary in a strongly typed static programming language. While they were clever and much easier than adding unit tests to every piece of code we could find after years of neglecting writing unit tests, it wouldn’t have been such a problem if we had used a compiled language (we may have never even had enough of a product to get to this point if we used a compiled language, but let’s leave that discussion for another time).
Here is how the dead_code situation would have been solved in a strongly typed static programming language:
```
1. Remove the file or code you want to check from the repo, and compile.
2. If there are any compile errors, it's being used. Stop using it. If it isn't, you're good.

1a. Alternatively, use your IDE to tell you if its used anywhere.
2a. If it is, stop using it. If it isn't, remove it.
```
That’s all there is to it… unless your code to remove is API code that is called from a separate service, such as a Rails controller for a RESTful HTTP API. Then some logging is required in the API code layer, but at least in this case it’s only the API code that needs logging, and not every possible file in your entire code base that you want to get rid of.
For the code_fence(some_fence_group) situation, solving this is even easier than with the dead_code case:
```
1. Remove the set of code you want refactored into a separate service or library repository.
2. Compile your newly divided two sets of code.
3. If it compiles, you're good. If it doesn't, fix the interfaces and GOTO #2.
```
You might be thinking at this point, “dude.. unit tests? WTF!?”, but I offer you this counter point: static analysis in a compiled language is tantamount to a proof of correctness in terms of interfaces. Unit tests are not proofs, they do not guarantee your code will work for every possible permutation. You literally get interface checking for free by just using a compiled language – in a language such as Ruby, to merely get a poor mans version of the same test coverage, you’d have to write a unit test for every single new method written to get the level of fine grained error reporting that a compiler would give you.
To drive my point home, that 900,000 SLOC Ruby application I mentioned was a Rails 2.3 application, running on a version of Ruby 1.8.7. Rails 2.3 was released in 2009. It’s still running on Rails 2.3 and Ruby 1.8.7, and it’s 2014. Ruby is up to version 2.1, and Rails is up to 4.0.2 as of me writing this. To be fair, some of the reasons for this is not purely related to the language, but it certainly would have helped with upgrading libraries and Ruby versions if the language was a compiled one.
I should note that there are a few exceptions to this rule in the land of static typing, most notably pointers in C / C++ and other low level languages, and in Scala there is the dynamic feature. The important thing here is that these are exceptions, and typical code in these languages will find errors more often than not.
Compile Times and Unit Test Iteration Times
Slow compile times and slow unit tests slow down code iteration. Dynamic languages don’t have a compile step, but they are not immune to this issue as they do not have a compile time static type checker, lexer, or parser to find these bugs. Unit tests are vital to fill in this gap… and they can be slow.
That 900,000 line Ruby application I mentioned earlier took five minutes to run a single unit test, mostly due to having to load and initialize way too many gems and libraries. Needless to say, it was impossible to iterate quickly. Small bugs such as interface errors between strings and symbols became more troublesome to debug than they should have been – for every mismatched def .. end, typo, type mismatch, or invalid argument bug I had in my code, it added five minutes to the development time of what I was working on. Unlike with static typing, dynamic languages will typically only find one of these bugs at a time due to their interpreted “run one line at a time” nature. If it was up to me, fixing the time to run unit tests would have been more important than anything else (except for production bugs that needed fixing).
Conversely, on the other side of the language typing fence, I’ve heard of 45 minute C++ build times – I can’t possibly imagine trying to modify a code base that takes that long to compile. Luckily, I don’t think it gets worse than C++, and most statically typed languages have much better compilation times than C++.
Languages like Java, Scala or Golang don’t quite have the compile time problem to the extent C++ does. They all have incremental compilers, which will only recompile code that has been changed (and any code that was using the changed code). Golang’s compiler is likely one of the fastest for a statically typed language, and Scala is bordering on being as slow as C++ without incremental compilation. One caveat with the JVM – it takes time to “warm up”; anything that requires restarting the JVM is going to add several seconds to the build process.
If you haven’t seen it, Bret Victor’s talk makes the case for immediate feedback in much greater detail than I can in this article. Let’s just say if it took a musician 5 minutes to hear a note once it’s been played, there would be much fewer musicians in the world.
Conclusion and Next Up
It used to be the case that it was common knowledge that statically typed languages were much more verbose than dynamically typed languages were. This hasn’t always been true – concise compiled languages such as Haskell and ML have been around for a while now – but concise, strongly and statically typed languages such as Scala are only now starting to gain traction in the industry.
So now our current options now seem to be these:
Simple to learn, concise languages, but terrible for long term large scale application maintainability due to the lack of compile time static analysis (Ruby, Python, JavaScript)
Simple to learn, verbose languages with okay long term maintainability, but might be a pain to refactor because of code duplication (Go, Java 7 and lower)
Difficult to learn, but concise languages with good long term maintainability… as long as you can figure out what your code is doing. (Scala, Haskell)
It remains to be seen if it’s possible for a language to have all three qualities – simple to learn, concise, and good for long term maintainability.
The next article in this series is still related to programming languages, but it’s important enough to warrant it’s own article.
Coming Next: Limiting Shared Mutable State, Or Why You Should Learn Functional Programming
Maintaining a Large Code Base, Part 2: Service Oriented Architecture
Nov 9, 2013
5 min read
Previous Part: Backstory and the Basics
How do you deal with a code base that’s too large to handle? Try making it smaller – split it up into multiple smaller, separate code bases that communicate with each other through well defined interfaces. When done with services at the server level, this is typically known as Service Oriented Architecture (SOA). SOA is just about applying the practices of code decoupling, clear interfaces, and code reuse at the scale of servers.
Service Oriented Architecture, AKA Don’t Repeat Yourself for Servers
Twitter, Netflix, and Amazon all started out with monolithic, tightly coupled architectures in their infant years, and have all adopted the SOA approach as they’ve grown.
In Twitter’s case, they started out with stuffing all functionality into single Ruby on Rails application, and later moved to the JVM, using Scala and Java, splitting their product into smaller services along the way.
From an interview with Alex Payne, a former Twitter engineer:
In the enterprise world, a service-oriented architecture is not new, but in Web 2.0 it is crazy new science. With PHP or Ruby on Rails, when you need more functionality, you just include more plugins and libraries, shoving them all in to the server. The result is a giant ball of mud.
So anything that has to do heavy lifting in our stack is going to be an independent service.
They split one code base that handled the entirety of Twitter’s functionality into several services, including a queuing service, a social graph store, a people search service, and a tweet streaming service, using Thrift (common RPC network protocol library) to tie their services together. Their system has become much more reliable ever since.
Netflix started off as a monolithic Java application, and split their code base off into smaller services. The change also allowed them to split their engineering team into smaller teams on a per service basis. Engineers wanting to integrate their service with another service no longer had to search through mud to integrate with other features – they only had to be concerned about the interfaces.
Amazon started doing SOA as early back as 2002. According to a well known leaked rant by Steve Yegge, Jeff Bezos (CEO of Amazon) sent out a mandate to all engineering teams, requiring all data and functionality to be exposed through services, with no hooks or backdoors for communicating between services. Everything was to be a stand-alone service with a well defined API, and every engineer would have to abide by this new rule, unless they wanted to be fired.
Steve goes on to make a point that Bezos’ reasoning for this was to be able to sell Amazon’s internal platform for managing servers (hence the no hooks thing), now known to us today as Amazon Web Services (AWS), but I’m sure he was aware of the code maintainability benefits of splitting products up, and how it leads to smaller self contained teams.
If you are still wondering why this works so well… Well, it’s almost impossible for one engineer to understand every detail in a giant monolithic application. If an engineer doesn’t know how some code works, the chances of them being able to reliably modify it are slim. By splitting up responsibilities into services, engineers can be assigned to work on specific services, limiting how much they need to know. So it helps in the division of labor in a larger software organization.
As Steve mentions in his rant, it’s also a good way of dogfooding services, which happens when the team behind one service has to integrate with the interface of another team’s interface.
The other obvious benefit of splitting things into strict independent services is the potential for open sourcing the services, just as Twitter and Netflix have done. Open source means more contributors (if it’s good enough for others to use) and more dogfooding. Or, if it’s really good, you could sell the service, like Amazon has done with AWS.
Some Basic Service to Service Examples
An example of SOA that’s perhaps more well known is the movement of generating GUIs from the server to the client, and having the server side code serve pure data over HTTP, websockets, or some other protocol. This is the idea behind the Javascript GUI frameworks such as Backbone.js and Angular.js. By building UI on top of an API that servers data, the API gets used and tested in the process of building the API. If the API is good enough, it can be opened to the public! Now if users don’t like your GUI, they can build their own.
That same API could also be split into a separate service that talks to other services that handle the actual business logic. To directly paraphrase from the link, the API layer would only handle authentication, request routing, serialization and deserialization of objects, and request caching.
So what if you aren’t building servers or applications that have to talk over the network? The same principle of splitting things into separate code bases and communicating through a common protocol can be applied to smaller scale programs as well.
The Future of IDEs
Ever used an IDE such as Eclipse, Intellij Idea, or Visual Studio? If you’ve ever thought to yourself, “man, I really like the IDE auto completion, incremental compilation, and refactoring tools, but I wish I could be using Emacs / Vim / Sublime Text / Microsoft Word for the text editing instead.”, it’s technically possible to write an IDE that would let you do this, even if the aforementioned IDEs can’t.
Gocode, Ensime, and Slime are IDE daemons for the Golang, Scala, and Common Lisp programming languages, respectively. They communicate over a protocol to any capable text editor – the daemon receives code from the editor and some commands, executes the commands on the specified code, and returns any necessary text deltas back to the text editor.
Speaking of programming languages…
Next Part: Programming Languages
Maintaining a Large Code Base, Part 1: Backstory and the Basics
Nov 5, 2013
8 min read
What’s one of the most important quality of good software?
It’s maintainable.
It doesn’t matter how well optimized the code is if the code base isn’t maintainable. If the code can’t be refactored and improved, the software project is stuck in time – new features are difficult to add, performance can’t be improved, and bugs will be harder to pinpoint. Upfront planning and design can greatly reduce the amount of code rewriting that is necessary, but its near impossible to get everything right the first time code is written in a significantly sized code base. I’ve yet to work on a non trivial software project that was finished and perfect on the first iteration.
Back when I was in college, the typical coding assignment involved writing a few hundred to a few thousand lines of code to solve some mind bending arbitrary assignment that the professor thought was more important than the other three programming assignments I had from my three other professors, but wouldn’t explain what was so important about it.
Anyway, the assignments were mostly automatically graded, and it was quite rare for the teaching assistant or professor to take a look at students’ code to provide coding style feedback. After the solution to an assignment was submitted, the code for it would pretty much never be touched again.
Since the assignments were written to facilitate the learning of specific computer science concepts, the source code for the assignment solutions were very rarely useful outside of the context of the class that it was presented in. There was pretty much always a better implementation for whatever data structure or algorithm that were being implemented in the assignments. The only reason to keep the source code was to be able to look back at it in several years and could go, “Yep.. I wrote that wonderful piece of turd.”
I ended up getting better and better at solving these types of self contained assignments, but I never really learned how to write maintainable code from completing those assignments.
Fast forward a few years, the largest code base I’ve worked on has gone from being somewhere around 10,000 source lines of code (SLOC) in Java, a naturally verbose language, to the largest code base being a 900,000 SLOC service written in Ruby, a duck-type-able dynamic language that’s known for its conciseness. That line count doesn’t include comments or blank spaces, as you might expect from “source lines, and that code base also only refers to one code base – it was one of many. It wasn’t a code base that I could submit somewhere, forget about and never see again either. This was the real deal – industry programming, where things get reused.
This code base was something that I had to stare at on a daily basis. Needless to say, I’ve learned many lessons in having to work with a code base of that size, and I’m writing this to help software engineers, including myself, to think about code maintainability to the same degree they might think about code correctness or performance, if not higher.
The Basics
Let’s get the obvious software practices that happen to help code maintainability out of the way first. These should be familiar with anyone who’s been coding for a while.
Don’t Repeat Yourself
What do you do when you’ve got a function duplicated 100 times over 100 different files, and you want to modify the behavior of that common code? Refactor the common code into one function somewhere, and have the code in those 100 files use that common code.
This is important for maintenance so that if the behavior of the common code needs to be changed (and the interface is the same), it can be done in one single location, instead of once per location it is duplicated.
This might seem too obvious to mention, but the convenience of copy paste seems to win quite often, since it’s easier do than to refactor code, which might bring bugs. Copying and pasting code guarantees your own code won’t affect anyone elses, but it causes maintenance headaches later on.
Sometimes this isn’t always a good thing, especially if the code refactoring involves metaprogramming or other language features that tend to make code less readable.
Eat Your Own Dog Food (Dogfooding)
If nobody has ever used your application before shipping it to customers, how can you be sure it’s any good?
Dogfooding is about using your own product whenever possible. If it’s bad, hopefully the pain from using the product will be motivating enough to improve it.
The term supposedly originated in Microsoft, and apparently they were trying to get the term changed to icecreaming.
I’ll be referring to it as dogfooding though, because that’s what most people know it by, and it doesn’t really make sense to dogfood icecream, because icecream is already delicious and is not dog food quality food, so it doesn’t need to be dogfooded to be improved. Or in English, products that are already good (icecream) don’t need to be improved.
Dogfooding doesn’t have to be limited to using something at the scale of products, though. Testing could be considered a form of dogfooding…
Write Unit Tests
How can you refactor and change code reliably if there are no checks in place to make sure the application is working as expected after the change?
These are necessary to quickly catch errors in the business logic of your application. With dynamic languages, unit tests are also necessary to catch typos, missing method declarations, and type errors that would be caught in statically typed languages. By having a framework for quickly checking correctness, it’ll be that much easier to reorganize your code, add features, fix bugs, and possibly splitting it into reusable libraries or services.
Unit tests are really important in dynamic language code bases. Without a compiler to perform semantic analysis, the next safety net after unit tests are integration tests, and then it’s your manual testers, then customers. Each step along the way is typically slower than the previous, with [having customers do your testing](http://blogs.msdn.com/cfs-filesystemfile.ashx/__key/communityserver-blogs-components-weblogfiles/00-00-01-32-02-metablogapi/7317.image_5F00_0F65063B.png just ship it) generally much slower than running a unit test.
Document Your Code
Sometimes it’s a lot easier to explain in words what your code does than it is to try to read the code itself, especially if your programming language is not particularly concise (more on this in part 3). A few comments here and there can greatly increase the understandability of your code.
Do Code Reviews and Design Reviews
So how do you go about figuring out what code needs to be better documented?
Ever write a piece of software, come back to it later some time later, and have no idea how it works until you sit down and stare at it until your eyes bleed and you want to rewrite your entire code base from scratch? Me neither. But for those that this does happen to, it is probably time to dogfood your designs or your code to others.
The writing courses I took in college made a point of having peer reviews for essays. Peer reviews exist because sometimes things that sound smart in your head do not read smart when written out on paper. Your peers can offer their own views on what you are writing that can strengthen your existing ideas. And sometimes, peer reviews are useful simply because you were too lazy to proof-read your essay on comparing book X about a topic only the professor cares about, to book Y about who knows what, mainly because you were busy trying to complete the three programming assignments from your three other professors.
Code reviews were almost never part of the curriculum for any of my computer science classes, neither by peers nor the teaching staff. Code reviews by peers weren’t allowed because they couldn’t trust us not to cheat, and they couldn’t trust us not to save solutions for future students during the next iteration of the course. Code reviews by teaching assistants or professors couldn’t be done for everyone because of the numbers problem – it was too much work for a couple of people to review the code of thirty students while also coding automated testing systems for homework solutions, doing research, and preparing course material, exams, and lectures – so it was easier to just not do it for anyone. At least it was fair.
Anyway, have someone else try to read your code. When you are programming, there is a significant amount of context that is in your short term memory that might not be obvious or apparent for someone else who didn’t write the code. The process of coding is like storing info about how your code works in RAM, and for someone else reading it, they don’t have the contents of your RAM – they have to reconstitute what was in your RAM into their own RAM by lossily interpreting it from your code. Good code results in equivalent RAM in both parties, and bad code is like trying to interpret a Picasso painting, where one person thinks it’s a man’s face looking to the side, and another thinks it’s a woman’s face looking towards the viewer, when it’s really a picture of a plane.
Okay, so it’s not quite like that, but what I’m trying to say is to dog-food the readability of your code, otherwise its going to be very hard for you to modify your code if you don’t understand it.
Next Part: Service Oriented Architecture

Twitch.tv Overview

Implementation Details

Further Thoughts on Golang

Using the Client

Big-O Absolute Numbers

Runtime Performance Numbers

Big-O Runtime Performance Numbers

Latency Numbers

Memory and Storage Numbers

Big-O Storage size

Powers of Two Table

Motivation and design

Existing OCRemix Feeds

Official OCRemix Twitter Feed

Official OCRemix Twitter Feed -- Embedded Youtube

Official OCRemix Twitter Feed -- no Embedded Youtube

My OCRemix Feed

My Unofficial Twitter Feed

Programming Details

Fitting Info into a Tweet

Receiving Error Notifications

All in one place

Polling and Parsing the RSS Feed

Things to Improve

Choose Your Programming Languages Wisely

Conciseness

Language Complexity

Syntactic Flexibility

Static Analysis

Compile Times and Unit Test Iteration Times

Conclusion and Next Up

Service Oriented Architecture, AKA Don’t Repeat Yourself for Servers

Some Basic Service to Service Examples

The Future of IDEs

The Basics

Don’t Repeat Yourself

Eat Your Own Dog Food (Dogfooding)

Write Unit Tests

Document Your Code

Do Code Reviews and Design Reviews