Converting posts from Blojsom/WordPress to Moveabletype

I had to write this because for weeks now I’ve been trying to convert blog entries between different blogging services.

The first one was converting Blojsom to WordPress, but this wasn’t too bad because both work around the RSS 2.0 format. Getting the comments, tags, metadata out and into the rss 2.0 feed was a real pain and I’m convinced I dropped a load of trackbacks and pingbacks in the process. This is another reason why I started using Disqus for all my comments.

The harder task was moving blog entries from WordPress to MoveableType. Yes I expect most of you are wondering why I would move from a far superior system to something which most people left in the dirt ages ago. Well unfortunately we still use MT on the bbc.co.uk/blogs platform and that means as backstage moves to the official blog platform, some mug had to find a way to down convert to MT 4.1 which only imports/exports in this crazy text only format.

So after lots of looking around, I finally found a XSL which I modified to do the job from a stripped down WordPress WXR file (RSS2.0 with lots of WP namespaces stuff). Its important that you strip down the WXR file as it might not be valid XML, so no XSL transform is going to work. I also took a bit of time to write a XSL to remove most of the namespaces elements or convert them into a more valid RSS 2.0 element. You can do the same with lots of finds and replaces, so I won’t post the simple XSL.

Hopefully this will save others a lot of time in the future, if your faced with the same problem.

Hello 2010, welcome to the new blog

2010

So I finally decided to switch my blog to wordpress and on top of that I was able to install storytlr open source.

I’m still in the process of doing all the redirects and general cleaning up but its coming together quite nicely. Once its all stable I’m hoping to spend some time sorting out the styles and themes.

Getting the entries out of Blojsom was easy as pie but then converting them into a format which WordPress wouldn’t barf on was a big problem. In the end I wrote a throw away XSL to do it, because it WordPress didn’t like namespaced elements or generally anything over the standard RSS 2.0 elements. I did manage to push over the Categories and Tags but had to split them apart in WordPress later.

My whole thing is hosted with GoDaddy on their new European Servers and will be quite slow while it caches all requests.

I love Blojsom but I never upgraded to version 3.x which required a database to work. With the need for a database, it meant the ground between WordPress and blojsom was a less so. Then add cheap hosting, amazing plugins, themes and community. And its pretty much a no brainer. I also found that less and less blog editors are supporting Blojsom (some kind of metaweblog xmlrpc category issue). So now I’m able to use Bilbo which is a KDE editor with support for pretty much everything WordPress allows. I’m also able to use Google Gears which is useful when offline.

This was also a chance to get a little more serious about my blogging and footprint online. Hence I’m really hoping to stretch what storytlr can do for me and some of the projects I have for it.

In the meantime, let me know if you see anything very weird which I may have missed….

So what happened to my blog?

The blog went down back in June and only went back up a few days ago. So what happened? Well I'll spare you the long story.

Hub.org did a upgrade on the Operating System and other bits and pieces, and broke the frankly weird setup I have for my blog. Now normally I would say they should fix it but the truth is that I've been hosting my blog on there lovely servers for years for free. Well I say free but I did do some work on the hub.org website a while back. So we have a gentlemans agreement that I don't bug them and they let me have a nice virtual private server, which is needed to run the Blojsom blogging server. So I needed to fix why Resin would start then stop. It ended up being a problem with the change of configuration in later versions of Resin.

The reason it took so long was because I tried a few things along the way to fix it and move my blog elsewhere. My previous setup was Apache running on port 80 in front of Resin on port 8080 (always worth checking port 8080 to see if my blog is actually down or not). Previously I was running Pound in front of Tomcat, now I'm using Pound again in front of Resin. Apache isn't really needed because Resin is fast enough to serve up static files along side dynamic processes.

When things went wrong, I did decide to finally move my blog from Blojsom to WordPress. To get stuff out of blojsom I needed to get the server up and running again. I won't explain how but theres ways to get every single entry out of blojsom in a xml/atom/rdf feed. Hell you can even filter between 2 dates and get those entries out in any format you like. Once blojsom was up and running on port 8080 I was able to get all my entries. I also discovered that Blojsom creator had created a export plugin which is now standard and very useful. The problem is WordPress, wordpress fails to import anything I get from Blojsom. I double checked the blojsom files and there all well formed. I've even tried importing into Blogger.com but none of them work, I think its mainly the size of the import file (almost 10meg). Ultimately I do want to move to WordPress because although Blojsom is great, its not quite as actively being developed. Also moving to WordPress will mean I can switch to a cheaper hosting plan and get full support. Hub are great but I think its worth paying for my blog to be up as much as possible. Maybe I'd host my blog somewhere else and I'd use the hub account for my experiments with Cocoon. Although with Google App engine supporting Java servelets now, even that might be a little mute.

For now, the blog will stay where it is but I may be doing some stuff else where in the background. Expect lots of entries from now on.

Comments [Comments]
Trackbacks [0]

8 Random Facts About Me (tagging meme)

Somethinkodd Diabolo

James Cridland tagged me in the blogging meme by the same name. Yes its been a while but I've been pretty busy with lots of other stuff. Unlike James I know what a meme is and I like the idea of memetics, which I first heard via Susan Blackmore. Anyway I'm sure James actually does know what a meme is and its on with the rules.

Here are the rules:

  1. Each player starts with eight random facts/habits about themselves.
  2. People who are tagged need to write a post on their own blog (about their eight things) and post these rules.
  3. At the end of your blog, you need to choose eight people to get tagged and list their names.
  4. Don’t forget to leave them a comment telling them they’re tagged, and to read your blog.
  1. My first computer was a Commodore 16
    Not a Commodore 64 or 128, 16 as in 16k of total memory. My father bought the C16 for me when I was about 7 and I loved it. I use to play games then started to get into basic programming. I still remember the weird rounded tape cassette recorder thing and the blocky as hell graphics.
  2. I have worked in some odd places including… Equinox Discotheque in Leicester Square
    I was between jobs and I only lasted 3 weeks before quiting. I learned how to pour drinks correctly and how to make a lot of cocktails. I was only paid 3.70 a hour because you were expected to make a ton of money in tips. This only worked well if you were a attractive woman. The best tips I got in one night was 20 pounds while my female equivilents were getting upwards of 100 pounds a night!
  3. I use to have hypersomnolence and now sleep Apnea
    This is a sleep disorder where I pause snoring and breathing during the night. This makes sleeping difficult and a couple of the side-effects are loud sleeping, morning headaches (like being hangover), trouble concentrating, decreased sex drive forgetfulness, nocturia, etc, etc. Now you can see why working later in the day works better for me. I have also had surgery to try and correct it but its had little to no effect. Most people think its down to weight but actually I've had this since I was really young and thin. I even know people who are very fit who have this same problem, you can imagine its a problem for the partner who thinks your dying while you sleep.
  4. I'm a Diabolist
    Whats a diabolo? Well it actually means the devil with two sticks but really its a Chinese Yo-Yo. It looks like a egg cup with a string and sticks. Anyway, I learned how to do the Yo-Yo during the coke-cola yo-yo days and got fed up because everyone had them and everyone knew how to do most of the limited tricks. So I picked up on this thing called a wow ball. It turns out the wow ball was a cheap version of a diabolo and before long I was doing most of the same tricks with a full size diabolo. When I was younger, I use to enter competitions at holiday camps and have won a couple of times including a few holidays. I'm going to start putting up videos of me doing the diabolo on to videojug or more likely sclipo, so look out for it. Although thediabolo.com might have beaten me to it (I love that light path idea).
  5. I can skateboard and rollerblade
    Yes believe it or not. I can skateboard and I use to skateboard to work and college everyday. Roughly 5+ miles a day. Could I do any tricks? Well basic jumps and thats about it. By the way I can also rollerblade ok.
  6. I think I can ski but have never tried it on snow
    Yes its odd eh? I can actually ski but I've only ski'ed on dry slopes and grass. Grass skiing is pretty cool but those boots are damm heavy.
  7. I don't know popular music
    I don't know my led zepplin from my rolling stones. Seriously I don't know much about rock or pop music. When I was young I use to listen to dance music and hiphop. I was deep in to my dance music too, I stopped listening to radio when GWR changed there policy to no rap, less chat (thanks Miles for the correction). This meant the end of Sandy Martin's disco tracking and no more dance music on local radio. Radio 1 was interesting to me but I only listen to pete tong when I was older. I was never into the MTV actually I turned my back on it and The Box because it had no rave music. I did for a while check out that show after the Fresh Prince on BBC 2, the one with Normski (good on the BBC for taking a risk with such a show, maybe this lead the way towards 1xtra?) and I was a avid watcher of the Hitman and Her (check out the episode where pete and michaela go to Eclipse in Covertry!) and after it BPM (which I can't find anywhere). I went to clubs and raves alot, the magazines I read were eternity and knowledge, the music I listened to was rave. I just didn't have the time for other music because rave culture was so involving.
  8. I have never taken any drugs stronger that caffeine and I don't like Beer
    As you can see above, I was well into the rave and dance culture, but never ever took any class A, B or C drugs. Hell I didn't even drink beer. So although I was surrounded by drug taking I never got involved. It was strange because I educated myself about drugs at a early age and while the press was having a field day about evil ravers and dealers following the Leah Betts death I was seeing strange through the bull. Although I could see ecstasy wasn't this evil drug, I just wasn't interested in taking it because for me life was wild and weird enough, plus I just loved the music. About Beer. I bought them for friends when I was younger simply because I looked old enough to pass for 18 when I was 13 (yes I was bad). So I always got a taste of their beers. I never liked it and trust me I've tried loads of beers. I even last week tried white or wheat beer, which I thought was nasty. Once when I was at a holiday camp, I bought a tray full of drinks for a group of new friends and the security guard tapped me on the back and asked if I was buying drinks for underage teenagers? I being a little shocked that he had not guessed I was also underage answered no its all for me. So he stood there for about 10mins while I slowly drank most of the drink! That was the first time I had tasted Bitter and boy oh boy was I almost sick. Beer tastes like crap to me, I don't get why people like it? The only beer I like is Ginger Beer.

I'm nominating…. (without checking to see if they have done this already)

Technorati Tags: , , , , , ,

Comments [Comments]
Trackbacks [0]

Reading the Cluetrain by our PR lady

Our PR lady started reading the Cluetrain Manfesto after I gave her a copy to read. She seems to be reading through it slowly but at the same time its brought up more questions that I'd expected. So I suggested to her that she should blog internally or even externally about reading through the book. So I was kinda of shocked when she agreed… And now reading the cluetrain is born.

I think its really good to see our PR lady blogging, shes quite strange in a nice way. Kind of person who would forgive you at that moment but would never forget what you did. Anyway, shes been willing to learn about the changes online markets make to PR and thats a really good thing. Please check out her blog, as she really wants comments and constructive ideas around what she reads and blogs.

Comments [Comments]
Trackbacks [0]

Cubicgarden.com issues recently

So if you've not noticed already, cubicgarden.com has been up and down like a yoyo on a broken string. This was not the fault of Hub.org, actually Apache 2.x has been running flawlessly through-out the troubles. Thats why you could see the 503 error and not a 404.

Without boring everyone to death, I'm running on a virtual server FreeBSD (thats right Marc?) in which I'm running Resin 3.x and Apache 2.x together connected via jk or something like that. I then use Apache for static files and Resin the Java container for dynamic applications. Currently I have Blojsom, Cocoon and ZK installed in the Java container.

So what was actually happening? Well Resin was running out of memory every few hours. But why? It seems there was a problem with the crazy amount of spam I getting on my pipeline blog and so Blojsom was trying to load most of them into memory for analysis or something. So I have deleted all comments and trackbacks from Flow currently and made it a read only place for now. I was going to use Akismet on it too but decided actually it was better as read only for now. Those really interested will email me and spammers can just die.

Hopefully now cubicgarden.com and all the other blogs hosted on the same install of blojsom will be more reliable and I won't get the floods of emails saying your site is down again. Thanks to everyone who did write. Honestly I would have done this much earlier if I was in my regular timezone.

In the long run, I'm considering putting geekdinner.co.uk on the same blojsom install, but I'm worried this will screw up the other geekdinners around the country. And geez, everyone just can't get enough of WordPress. By the way Photomatt is a very cool, down to earth guy and well worth talking to if you get a chance.

Comments [Comments]
Trackbacks [0]

MT? you might as well be dead to me

From Fowa, do you trust these people?

I've heard about the problems but have not publiclly said much. But I'm sorry as far as I'm concerned, I stopped recommending Movable Type a long time ago and can't understand why people still use it. Suw's post on strange attractor is simply awesome and well worth reading if you also recieved the email from Sixapart. But generally it doesn't scale effectively, and I'm not saying many blogging servers do. But I wonder why everyone seems to think there are only 2 blogging application servers out there?

What about Blojsom, Community Server, Dasblog, B2, Roller, etc. Theres much more to blogging servers that MT and WordPress. Go Explorer, don't be constrained by whats the norm. Thom Shannon recommended http://asymptomatic.net/blogbreakdown.htm

Comments [Comments]
Trackbacks [0]

ASCII vs Binary

Ben Metcalfe writes in response to Brad Templeton's post.

At the blogger panel at Fall VON … Vlogger and blip.tv advocate Dina Kaplan asked bloggers to start vlogging. Its started a minor debate.

My take? Please dont.

Ive written before on what I call the reader-friendly vs. writer-friendly dichotomy. My thesis is that media make choices about where to be on that spectrum, though ideal technology reduces the compromises

Brad actually goes on to advocate podcasting as an acceptable format as it can be listened to whilst jogging, etc. I think hes correct to a degree – podcasting is certainly easier to consume than video/vlogging as it is a multitaskable medium (unlike blogging too). However its still binary – it still requires you to consume at a pace decided by the producer and its far harder to index and search through. Unless it contains a meta-wrapper, there is also no hyperlinking which is arguably one of the greatest keys to the world-wide-webs success.
Its for this reason that Ive been sceptical as to the long-term success of podcasting as an informational medium. And this scepticism certainly extends to vlogging.

My take is that ascii is great but you know what, how bland would the world be if people couldn't express themselves in any form they feel most comfitable? See Ben is taking the reciever (user) point of view and I think thats fine for a lot of people. Even myself, I blog more that I podcast or videocast because I like hyperlinking. But it would be wrong for me to expect everyone to do the same. If podcasting works for you, hey go for it. Yes think about your audience you certainly need to balance that with your own creativity. So in the end I do agree with Ben's final point.

However I think most people would agree that we need to produce work in the format thats right for the content at hand, and for the end consumer whos going to consume it. For the moment at least, I think most people still use the blogosphere more for informational use than they do entertainment and as such that needs to be considered when youre about to produce your next blog/podcast/vlog

Ideally binary searching, scanning, deconstruction would be as easy as acsii but its not at the moment. I think this is mainly a limitation of the tools and services out there. The service and tool creators blame the lack of standards. So the question is where is the microformats of the media? I would have suggested Mpeg4 could have been but its got so screwed up who knows.

Comments [Comments]
Trackbacks [0]

Blogosphere is more international than ever before

I've been meaning to blog this for weeks now. Dave Sifry's latest report on the state of the blogosphere. So generally the blogosphere has becaome a lot more international with english taking a step down in the most used language in the blogosphere. Its actually better that you think too, because english now count for less than 35% of the blogosphere. Theres lots of other interesting things in the report like the Chinese blogosphere growing a lot due to MSN Spaces and Chinese and Bokee.com. Dave suggests that Japanese bloggers blog small posts from there phone, hence the huge jump. In the same post but not really realted Dave talks about how Tags and Categories are used by 47% of the blogosphere now.

Talking about languages and blogs, the BBC blogs has new additions to its own blognetwork. Spanish, Arabic and Persian blogs. The Chinese and new Urdu blog are just around the corner too. I guess this is perfectly fitting with the latest report. I have yet to try out Native text (a free web service that translates RSS feeds from blogs and podcasts into foreign languages) but it certainly sounds useful. I hear the Persian Blog already has a large audience visiting it.

Chinese just launched yesterday in simplfied chinese which causes it own problems because its all in UTF-8. It seems a lot of chinese reading people set there browsers to the encoding GB2312 or Traditional BIG5

Comments [Comments]
Trackbacks [0]