We need a PBS for the Internet age

PBS - Public Broadcasting Service Logo

Its quite amazing to read this opinion piece in the Washington Post recently… (if you like me are reading it in Europe, you might want to try this one)

Some bits I found amazing to read, especially since the united states’s public broadcast networks are so crippled. This says it all..

Americans like public media. NPR still consistently ranks among the most trusted news sources. Likewise, Americans have rated PBS among the most trusted institutions in the United States for the past decade and a half, according to polls conducted on PBS’s behalf. But these services operate in an increasingly challenging environment. Government cuts have forced public media to become far more dependent on listener contributions, sponsorships and private donors. These organizations have had to chase audiences migrating to private platforms along with the rest of the media, meeting audiences “where they’re at.”

To their credit, public media have made an impressive effort to upgrade on a dime. PBS states that its Digital Studios division averaged more than 38 million views per month on YouTube. NPR recently co-published a report about the promise of smart-speaker devices such as Amazon Echo for audience growth.

Rather than let public broadcasters who have accrued so much public trust languish — or, worse, be co-opted by a tech industry that has a vast interest in how its portrayed — both our federal and state governments need to play a more active role in public media’s health and digital future.

What the Internet needs is a fresh infusion of public media, properly funded and paired with federal policy that puts the public interest first.

Reading this piece, further reminds me why the public service internet research work is so critical. Without public media, we are lost. Can’t even really imagine what it must be like working for PBS and NPR consistently being knocked and sliced down. I mean the BBC has troubles but not like these (yet).

Google Stadia for Interactive digital narratives?

Yesterday Google announced Stadia, their cloud gaming project. The interesting parts of the announcement are…

  1. Play now on youtube
    I love the transition from watching to playing, 5 seconds and I’m sure with time it will drop down to even less.
  2. Play on any device and completely cross platform
    Really taking complete advantage of streaming and google’s massive cloud infrastructure.
  3. Record play state to the youtube
    Completing the circle, by sharing your state (not video) back to youtube, maybe even allowing others to play again with… This makes total sense because youtube is where they can start to show adverts too; although because its all generated it would be easy to advertise in the game its self.
  4. Share play state
    As mentioned above, you are not playing a video, but the game again complete with its world state, player position and player inventory.

Google Stadia on every platform

I think its quite a compelling idea and like everyone else, are interested in how much, how easy its to build for and will google get bored and kill it? I’m less interested in the exclusive games, game pad, etc but acknowledge  it will live or die by the games.

I do think theres some incredible possibilities for other types of media especially interactive digital narratives. It certainly could blow netflix’s interactive platform out the water. Said quite a few times, I find netflix’s interactive platform is horrible when you think, theres better more engaging experiences on the console attached to the same TV or even on the mobile you are using to drive it. With Stadia, its all the same thing.

Black Mirror choices can be snooped on?

Magic box

I have so much to say about Bandersnatch, most has been written here. But its clear that Netflix haven’t given up on the medium and even doubling down on it.

Something popped into my feed about some researchers paper saying you can snoop on the choices of people using Netflix’s interactive system. I’m hardly surprised as its typical network analysis and GDPR requests. But it reminds me how important the work we have done with perceptive media is.

I best explain it as delivering (broadcasting) the experience as a contained bundle which unfolds within the safe space (maybe living room) of the audience. Nothing is sent back to the cloud/base. This is closer to the concept of broadcast and means the audience/user(s) and their data isn’t surveil by the provider. This is exactly how podcasts use to work before podcast providers started focusing on metrics and providing apps which spy on their listeners. I would suggest the recent buy out of gimlet media by spotify might point this way too?

Of course the broadcast/delivery model this doesn’t work too well for surveillance capitalism but that frankly not my problem; and all audience interaction should be (especially under HDI) explicitly agreed before data is shared or exported.

I might be idealistic about this all but frankly I know I’m on the right side of history and maybe the coming backlash.

Facial recognition’s ‘dirty little secret’: Millions of online photos scraped without consent

By Olivia Solon

Facial recognition can log you into your iPhone, track criminals through crowds and identify loyal customers in stores.

The technology — which is imperfect but improving rapidly — is based on algorithms that learn how to recognize human faces and the hundreds of ways in which each one is unique.

To do this well, the algorithms must be fed hundreds of thousands of images of a diverse array of faces. Increasingly, those photos are coming from the internet, where they’re swept up by the millions without the knowledge of the people who posted them, categorized by age, gender, skin tone and dozens of other metrics, and shared with researchers at universities and companies.

When I first heard about this story I was annoyed but didn’t think too much about it. Then later down the story, its clear they used creative commons Flickr photos.

“This is the dirty little secret of AI training sets. Researchers often just grab whatever images are available in the wild,” said NYU School of Law professor Jason Schultz.

The latest company to enter this territory was IBM, which in January released a collection of nearly a million photos that were taken from the photo hosting site Flickr and coded to describe the subjects’ appearance. IBM promoted the collection to researchers as a progressive step toward reducing bias in facial recognition.

But some of the photographers whose images were included in IBM’s dataset were surprised and disconcerted when NBC News told them that their photographs had been annotated with details including facial geometry and skin tone and may be used to develop facial recognition algorithms. (NBC News obtained IBM’s dataset from a source after the company declined to share it, saying it could be used only by academic or corporate research groups.)

And then there is a checker to see if your photos were used in the teaching of machines. After typing my username, I found out I have 207 photo(s) in the IBM dataset. This is one of them:

Not my choice of photo, just the one which comes up when using the website

Georg Holzer, uploaded his photos to Flickr to remember great moments with his family and friends, and he used Creative Commons licenses to allow nonprofits and artists to use his photos for free. He did not expect more than 700 of his images to be swept up to study facial recognition technology.

“I know about the harm such a technology can cause,” he said over Skype, after NBC News told him his photos were in IBM’s dataset. “Of course, you can never forget about the good uses of image recognition such as finding family pictures faster, but it can also be used to restrict fundamental rights and privacy. I can never approve or accept the widespread use of such a technology.”

I have a similar view to Georg, I publish almost all my flickr photos under a creative commons non-commercial sharealike licence. I swear this has been broken. I’m also not sure if the pictures are all private or not. But I’m going to find out thanks to GDPR

There may, however, be legal recourse in some jurisdictions thanks to the rise of privacy laws acknowledging the unique value of photos of people’s faces. Under Europe’s General Data Protection Regulation, photos are considered “sensitive personal information” if they are used to confirm an individual’s identity. Residents of Europe who don’t want their data included can ask IBM to delete it. If IBM doesn’t comply, they can complain to their country’s data protection authority, which, if the particular photos fall under the definition of “sensitive personal information,” can levy fines against companies that violate the law.

Expect a GDPR request soon IBM! Anything I can do to send a message I wasn’t happy with this.

Will breaking up GAFFA do any good?

Elizabeth Warren wants to break up the monopoly of the big tech companies (GAFFA) nothing that new as Tim Berners-Lee’s been saying similar for a year or so.

I do find it interesting watching the calls for Europe to get in the game, but then applying the same metrics to the European market? Something is not quite right there? Why would you want a copy of GAFFA’s, therefore recreating the cycle again?

Best of the recent TEDxManchester’s

TedX Manchester 2019

Last year I never got a chance to write about TEDxManchester 2018, partly because I tend to take pictures with my camera and its the new venue (Bridgewater hall) policy not to allow cameras in.

Regardless I went to TedXManchester 2019 (without my DSLR), and thought its about time I got back into blogging some of the best talks, especially as they are put on youtube now. Because they are on youtube so quickly, I created a playlist with the best TedXManchester videos. There are a number missing and its worth saying the list is highly opinioned. Theres some key ones from previous years gone by including my own and Carrie’s super popular one.

But I wanted to give credit to the best ones this year and last year.

2018

Last year the outstanding talk was form Vikas Shah’s How to save your own life.

A year later Vikas tweet is a perfect description.

I felt the talk was extremely brave, powerful and honest. The mental health message was powerful.

2019

This years outstanding talks were difficult to pick one. I was torn but decided although everyone loved Ged Kings talk I wasn’t super keen. I found Andrew Szydlo and Jon Carmichael’s fantastic but its not online yet so decided Katherine Ormerod spoke to me like Emma Harvey’s “Whoops, I changed the world” at TedxBradford.

Although I don’t let social media run my life, and use it a certain way which bother some. I find the continuously running theme of living life with these digital tools interesting. There was a talk just before with Chris Bailey (this is from TedxLiverpool) which was good but felt too preachy for my palliate. As I write this blog post in a coffee shop, I’m watching a woman taking a selfie with her tiny dog to a social network. She took about 12 photos before finally settling on one to post. I find the whole thing strange as posted about before, and I wonder how many are in control, following fashion, doing so out of peer/social pressure, etc…

#web30: The world wide web at 30 years old

We owe a lot to Sir Tim Berners-Lee on the 30th Anniversary of the web.

Tim Berners-Lee helped invent the world wide web 30 years ago. And he has consistently pointed out that the original dream that gave rise to it is under threat.

It is exactly 30 years since Sir Tim submitted a paper to his colleagues at CERN, suggesting a way of sharing data across networks, under the title “Information Management: A Proposal”. The humble title belies the importance of what was contained inside, which included a complete sketch for the networked information system that would on to become the internet we know today.

But its really important to think about the next 30 years.

Surveillance capitalism and governmental/state control are hot topics which very much threaten the fabric of the web. But so does our use of the web and the way we treat each other.

I had a really good 10min talk with Sir Tim Berners-Lee during the last Mozilla Festival, while talking about Solid, Databox and data trust. What got me as we talked, was ultimately we were talking about power and where it lies. Power in the hands of governments (Chinese model) , corporations (American model) or people? (could be the European model?)

I think remembering their are humans, not eyeballs, not lefties/rightwingers, etc is so important. Lets celebrate the people of the web!

https://twitter.com/whynotadoc/status/1105400124447039489

Chidi and The Good Place

The good place cast

A podcast listen to now and then is the Nod, the latest episode (doesn’t seem to be on the site yet) talked about Chidi from the good place.

This week Eric talks with Cord Jefferson, a writer for NBC’s The Good Place, about how Blackness is depicted in the afterlife

Its a  good interview exploring stereotypes of black characters and I got to say Chidi is a excellent black geek. Just what we need more of…

Re-decentralising the internet recording at Futurefest

Futurefest 2018 panel

I had the pleasure of being on the panel of re-decentralising internet at Futurefest, last summer. (when England was still in the world cup and the weather was super warm) Feels like so long ago. I’m quite glad its audio only because I was sat in the sunshine sweating a lot!

The internet isn’t where we want it to be. With power increasingly centralised in the hands of very few players, citizens have little say in where we want the internet to go next. But challenging existing dynamics won’t be easy: we find ourselves caught in the crossfire between the dominant American models (driven by Big Tech) and the increasingly powerful Chinese model (where government reigns supreme). Is there scope to create a third, European model, where citizens and communities are in charge?

In this session, we discuss alternative trust models for the internet. This session is part of the European Commission’s Next Generation Internet initiative. We will hear from Manon den Dunnen, strategic specialist at the Dutch National Police, Ian Forrester, Chief Firestarter at BBC R&D and Marta Arniani, innovation strategist and founder of Futuribile / Curating Futures. Chairing will be Katja Bego, senior researcher at Nesta and coordinator of the Next Generation Internet Engineroom project.

Thanks Katja!

Gender diversity on twitter?

Results of who I follow on twitter

I rarely read twitter due to the API changes which I’ve talked about in the past. But I saw Teknoteacher talking about changing his followers after reading about Male tech CEOs follower accounts. I thought I’d share some things I discovered too. Especially reading this a while back.

So my results are above, using the online tool – https://www.proporti.onl.

But a while ago I used Open Human’s twitter archive analyzer by Bastian Greshake Tzovaras. It was super sobering!

Here is my replies by gender from when I first started using Twitter back in 2017. As you can see there was a massive spike of conversation with males in 2012, I also generally talk to more men than women on twitter.

My replies & gender Likewise when retweeting based on gender its mainly males. Recently its a lot closer to 50% which is great but I wonder with my lack of twitter use, how that will effect things? (I have requested a new update of my twitter data)

My retweets & genderOf course my instant thought is there is noise in the figures as its not always clear if people are male or female for many reasons. But its disappointing to read Elon Musk’s tweet.

And read about others such as…

Sundar Pichai, the CEO of Google, follows 267 accounts on Twitter. Of those, 238 appear to be men. He follows nearly as many Twitter Eggs (15) as women (21).

Satya Nadella, Microsoft CEO, followed the most women (39) of any of the accounts examined by the Guardian, though that is still half the number of men he follows (78) out of a total of 165 accounts.

I’d really like to see this applied to race not just gender too. It reminds me how I was going to learn more Python so I can create this as a Juno personal notebook in Open Humans.

Updated…

I updated Open Humans with my latest Twitter data export and here are the results.
Once again very sobering to see. Got to make some changes.

Screenshot of replies for 2019

Worth adding from TwArχiv site.

The graph shows you the number of replies to Twitter users that are classified as either male or female. The classifications are predictions based on users’ first names as given in their Twitter accounts. The predictions itself are performed by the Python package gender_guesser . It uses name/gender-frequencies from a larger text corpus. mostly male, mostly female, andy and unknown classifications are ignored. To decrease the noise the daily values have been averaged by a daily average over a 180 day window (dataframe.rolling('180d').mean()).

Ideally these graphs would include non-binary folks. Doing this is a bit trickier. It is thus a work in progress.

Screenshot of retweets for 2019Also worth mentioned…

Even more interesting than whether replying to people might be gendered can be the question which voices are being amplified . On Twitter a good indicator of amplification are retweets. These can be gender balanced or show biases, similarly to the replies to other users.

The graph shows you the number of retweets to Twitter users that are classified as either male or female. The classifications are again predictions made by the Python package gender_guesser . To decrease the noise the daily values have again been averaged by a daily average over a 180 day window (dataframe.rolling('180d').mean()).

Ideally these graphs would include non-binary folks. Doing this is a bit trickier. It is thus a work in progress.