Joho the Blog » liveblog

November 15, 2013

[liveblog] Noam Chomsky and Bart Gellman at Engaging Data

I’m at the Engaging Data 2013conference where Noam Chomsky and Pulitzer Prize winner (twice!) Barton Gellman are going to talk about Big Data in the Snowden Age, moderated by Ludwig Siegele of the Economist. (Gellman is one of the three people Snowden vouchsafed his documents with.) The conference aims at having us rethink how we use Big Data and how it’s used.

NOTE: Live-blogging. Getting things wrong. Missing points. Omitting key information. Introducing artificial choppiness. Over-emphasizing small matters. Paraphrasing badly. Not running a spellpchecker. Mangling other people’s ideas and words. You are warned, people.

LS: Prof. Chomsky, what’s your next book about?

NC: Philosophy of mind and language. I’ve been writing articles that are pretty skeptical about Big Data. [Please read the orange disclaimer: I’m paraphrasing and making errors of every sort.]

LS: You’ve said that Big Data is for people who want to do the easy stuff. But shouldn’t you be thrilled as a linguist?

NC: When I got to MIT at 1955, I was hired to work on a machine translation program. But I refused to work on it. “The only way to deal with machine translation at the current stage of understanding was by brute force, which after 30-40 years is how it’s being done.” A principled understanding based on human cognition is far off. Machine translation is useful but you learn precisely nothing about human thought, cognition, language, anything else from it. I use the Internet. Glad to have it. It’s easier to push some buttons on your desk than to walk across the street to use the library. But the transition from no libraries to libraries was vastly greater than the transition from librarites to Internet. [Cool idea and great phrase! But I think I disagree. It depends.] We can find lots of data; the problem is understanding it. And a lot of data around us go through a filter so it doesn’t reach us. E.g., the foreign press reports that Wikileaks released a chapter about the secret TPP (Trans Pacific Partnership). It was front page news in Australia and Europe. You can learn about it on the Net but it’s not news. The chapter was on Intellectual Property rights, which means higher prices for less access to pharmaceuticals, and rams through what SOPA tried to do, restricting use of the Net and access to data.

LS: For you Big Data is useless?

NC: Big data is very useful. If you want to find out about biology, e.g. But why no news about TPP? As Sam Huntington said, power remains strongest in the dark. [approximate] We should be aware of the long history of surveillance.

LS: Bart, as a journalist what do you make of Big Data?

BG: It’s extraordinarily valuable, especially in combination with shoe-leather, person-to-person reporting. E.g., a colleague used traditional reporting skills to get the entire data set of applicants for presidential pardons. Took a sample. More reporting. Used standard analytics techniques to find that white people are 4x more likely to get pardons, that campaign contributors are also more likely. It would be likely in urban planning [which is Senseable City Labs’ remit]. But all this leads to more surveillance. E.g., I could make the case that if I had full data about everyone’s calls, I could do some significant reporting, but that wouldn’t justify it. We’ve failed to have the debate we need because of the claim of secrecy by the institutions in power. We become more transparent to the gov’t and to commercial entities while they become more opaque to us.

LS: Does the availability of Big Data and the Internet automatically mean we’ll get surveillance? Were you surprised by the Snowden revelations>

NC: I was surprised at the scale, but it’s been going on for 100 years. We need to read history. E.g., the counter-insurgency “pacification” of the Philippines by the US. See the book by McCoy [maybe this. The operation used the most sophisticated tech at the time to get info about the population to control and undermine them. That tech was immediately used by the US and Britain to control their own populations, .g., Woodrow Wilson’s Red Scare. Any system of power — the state, Google, Amazon — will use the best available tech to control, dominate, and maximize their power. And they’ll want to do it in secret. Assange, Snowden and Manning, and Ellsberg before them, are doing the duty of citizens.

BG: I’m surprised how far you can get into this discussion without assuming bad faith on the part of the government. For the most part what’s happening is that these security institutions genuinely believe most of the time that what they’re doing is protecting us from big threats that we don’t understand. The opposition comes when they don’t want you to know what they’re doing because they’re afraid you’d call it off if you knew. Keith Alexander said that he wishes that he could bring all Americans into this huddle, but then all the bad guys would know. True, but he’s also worried that we won’t like the plays he’s calling.

LS: Bruce Schneier says that the NSA is copying what Google and Yahoo, etc. are doing. If the tech leads to snooping, what can we do about it?

NC: Govts have been doing this for a century, using the best tech they had. I’m sure Gen. Alexander believes what he’s saying, but if you interviewed the Stasi, they would have said the same thing. Russian archives show that these monstrous thugs were talking very passionately to one another about defending democracy in Eastern Europe from the fascist threat coming from the West. Forty years ago, RAND released Japanese docs about the invasion of China, showing that the Japanese had heavenly intentions. They believed everything they were saying. I believe these are universals. We’d probably find it for Genghis Khan as well. I have yet to find any system of power that thought it was doing the wrong thing. They justify what they’re doing for the noblest of objectives, and they believe it. The CEOs of corporations as well. People find ways of justifying things. That’s why you should be extremely cautious when you hear an appeal to security. It literally carries no information, even in the technical sense: it’s completely predictable and thus carries no info. I don’t doubt that the US security folks believe it, but it is without meaning. The Nazis had their own internal justifications.

BG: The capacity to rationalize may be universal, but you’ll take the conversation off track if you compare what’s happening here to the Stasi. The Stasi were blackmailing people, jailing them, preventing dissent. As a journalist I’d be very happy to find that our govt is spying on NGOs or using this power for corrupt self-enriching purposes.

NC: I completely agree with that, but that’s not the point: The same appeal is made in the most monstrous of circumstances. The freedom we’ve won sharply restricts state power to control and dominate, but they’ll do whatever they can, and they’ll use the same appeals that monstrous systems do.

LS: Aren’t we all complicit? We use the same tech. E.g., Prof. Chomsky, you’re the father of natural language processing, which is used by the NSA.

NC: We’re more complicit because we let them do it. In this country we’re very free, so we have more responsibility to try to control our govt. If we do not expose the plea of security and separate out the parts that might be valid from the vast amount that’s not valid, then we’re complicit because we have the oppty and the freedom.

LS: Does it bug you that the NSA uses your research?

NC: To some extent, but you can’t control that. Systems of power will use whatever is available to them. E.g., they use the Internet, much of which was developed right here at MIT by scientists who wanted to communicate freely. You can’t prevent the powers from using it for bad goals.

BG: Yes, if you use a free online service, you’re the product. But if you use a for-pay service, you’re still the product. My phone tracks me and my social network. I’m paying Verizon about $1,000/year for the service, and VZ is now collecting and selling my info. The NSA couldn’t do its job as well if the commercial entities weren’t collecting and selling personal data. The NSA has been tapping into the links between their data centers. Google is racing to fix this, but a cynical way of putting this is that Google is saying “No one gets to spy on our customers except us.”

LS: Is there a way to solve this?

BG: I have great faith that transparency will enable the development of good policy. The more we know, the more we can design policies to keep power in place. Before this, you couldn’t shop for privacy. Now a free market for privacy is developing as the providers now are telling us more about what they’re doing. Transparency allows legislation and regulation to be debated. The House Repubs came within 8 votes of prohibiting call data collection, which would have been unthinkable before Snowden. And there’s hope in the judiciary.

NC: We can do much more than transparency. We can make use of the available info to prevent surveillance. E.g., we can demand the defeat of TPP. And now hardware in computers is being designed to detect your every keystroke, leading some Americans to be wary of Chinese-made computers, but the US manufacturers are probably doing it better. And manufacturers for years have been trying to dsign fly-sized drones to collect info; that’ll be around soon. Drones are a perfect device for terrorists. We can learn about this and do something about it. We don’t have to wait until it’s exposed by Wikileaks. It’s right there in mainstream journals.

LS: Are you calling for a political movement?

NC: Yes. We’re going to need mass action.

BG: A few months ago I noticed a small gray box with an EPA logo on it outside my apartment in NYC. It monitors energy usage, useful to preventing brown outs. But it measures down to the apartment level, which could be useful to the police trying to establish your personal patterns. There’s no legislation or judicial review of the use of this data. We can’t turn back the clock. We can try to draw boundaries, and then have sufficient openness so that we can tell if they’ve crossed those boundaries.

LS: Bart, how do you manage the flow of info from Snowden?

BG: Snowden does not manage the release of the data. He gave it to three journalists and asked us to use your best judgment — he asked us to correct for his bias about what the most important stories are — and to avoid direct damage to security. The documents are difficult. They’re often incomplete and can be hard to interpret.

Q&A

Q: What would be a first step in forming a popular movement?

NC: Same as always. E.g., the women’s movement began in the 1960s (at least in the modern movement) with consciousness-raising groups.

Q: Where do we draw the line between transparency and privacy, given that we have real enemies?

BG: First you have to acknowledge that there is a line. There are dangerous people who want to do dangerous things, and some of these tools are helpful in preventing that. I’ve been looking for stories that elucidate big policy decisions without giving away specifics that would harm legitimate action.

Q: Have you changed the tools you use?

BG: Yes. I keep notes encrypted. I’ve learn to use the tools for anonymous communication. But I can’t go off the grid and be a journalist, so I’ve accepted certain trade-offs. I’m working much less efficiently than I used to. E.g., I sometimes use computers that have never touched the Net.

Q: In the women’s movement, at least 50% of the population stood to benefit. But probably a large majority of today’s population would exchange their freedom for convenience.

NC: The trade-off is presented as being for security. But if you read the documents, the security issue is how to keep the govt secure from its citizens. E.g., Ellsberg kept a volume of the Pentagon Papers secret to avoid affecting the Vietnam negotiations, although I thought the volume really only would have embarrassed the govt. Security is in fact not a high priority for govts. The US govt is now involved in the greatest global terrorist campaign that has ever been carried out: the drone campaign. Large regions of the world are now being terrorized. If you don’t know if the guy across the street is about to be blown away, along with everyone around, you’re terrorized. Every time you kill an Al Qaeda terrorist, you create 40 more. It’s just not a concern to the govt. In 1950, the US had incomparable security; there was only one potential threat: the creation of ICBM’s with nuclear warheads. We could have entered into a treaty with Russia to ban them. See McGeorge Bundy’s history. It says that he was unable to find a single paper, even a draft, suggesting that we do something to try to ban this threat of total instantaneous destruction. E.g., Reagan tested Russian nuclear defenses that could have led to horrible consequences. Those are the real security threats. And it’s true not just of the United States.

Follow me

1 Comment »

October 25, 2013

[dplafest] Advanced Research and the DPLA

I’m at a DPLAfest session. Jean Bauer (Digital Humanities Librarian, Brown U.), Jim Egan (English Prof, Brown), Kathryn Shaughnessy (Assoc. Prof, University Libraries, St. John’s U), and David Smth (Ass’t Prof CS, Northeastern).

Rather than liveblogging in this blog, I contributed to the collaboratively-written Google Doc designated for the session notes. It’s here.

Follow me

Categories: libraries, liveblog Tagged with: dpla • dplafest • liveblog • research Date: October 25th, 2013 dw

Be the first to comment »

[dplafest] Dan Cohen opens DPLA meeting

Dan Cohen has some announcements in his welcome to the DPLAfest.

NOTE: Live-blogging. Getting things wrong. Missing points. Omitting key information. Introducing artificial choppiness. Over-emphasizing small matters. Paraphrasing badly. Not running a spellpchecker. Mangling other people’s ideas and words. You are warned, people.

The collection now has 5M items. These come from partner hubs (large institutions) and service hubs (aggregations of smaller providers). Three new hubs have joined, bringing the total to nine, from NY, North Carolina, and Texas. Dan stresses the diversity of contributors.

The DPLA sends visitors back to the contributing organizations. E.g., Minnesota Reflections is up 55% in visitors and 62% in unique visitors over the year since it joined the DPLA.

He also announces the DPLA Bookshelf, which is a contribution from the Harvard Library Innovation Lab that I co-direct. It’s an embedded version of the Stacklife browser, which you can see by going to DP.LA and searching for a book. (You can use the Harvard version here.

Dan announces a $1M grant from the Bill & Melinda Gates Foundation, to help local libraries curate material in the DPLA and start scanning in local collections. Also, an anonymous donor gave $450,000. [I don’t want to say who it was, but, well, you’re welcome.] Dan Cohen suggests we become a sponsor athttp://www.dp.la/donate. T-shirts and, yes, tote bags.

There have been 1,7M uses of the DPLA API as of September 2013. Examples of work already done:

Culture Collage
Open Pics (a mobile app that uses the geocoding of items that the DPLA does)
Serendip-o-automatic (from NEH): paste in some text and it will show you related material.

Dan talks about DPA Local, and idea that would enable local communities to use the services the DPLA provides.

Dan says that all of the sessions have Google Docs already set up for collaborative note-taking [an approach I’m very fond of].

Follow me

Categories: libraries, liveblog Tagged with: dpla • libraries • liveblog Date: October 25th, 2013 dw

Be the first to comment »

June 20, 2013

[lodlam] Topics for Day 2

Here are the sessions people are proposing for the second day of the LODLAM conference in Montreal:

Getty Vocabulary goes open

Linked data on mobiles, wearable devices

Do cool things with the data sets that you have on your laptop – let’s build stuff!

Your tools and solutions

NLP for linked open data for libraries, archives, and museums. Data extraction, taxonomy alignment, context extraction, etc.

World War I in LOD

LOD and accessibility & assistive devices

The Pundit software package

the KARMA mapping tool

Tools and techniques for generating concordances between people

Why Schema.org?

Copying and synching linked data

FRBR and other standards [couldn’t hear]

How to create a new generation of LOD professionals. Getting students involved in projects.

The future of LODLAM

Normalizing ata models and licensing models

The official list is here.

Follow me

Categories: libraries, liveblog Tagged with: linked data • liveblog • lodlam Date: June 20th, 2013 dw

Be the first to comment »

June 19, 2013

[lodlam] Convert to RDF with KARMA

KARMA from University of Southern California takes tools for a wide variety of sources and maps it to your ontologies and generates linked data. It is open source and free. [I have not even re-read this post. Running to the next session.]

They are demo-ing using a folder full of OWL ontology files. [OWL files contain the rules that define ontologies. KARMA runs in your browser. The mapping format is R2RML, which is designed for relational databases, but they’ve extended it to handle more types of databases. You can import from a database, files, or a service. For the demo, they’re using CSV files from a Smithsonian database that consists of display names, IDs represented unique people, and a variant or married name. They want to map it to the Europeana ontology. KARMA shows the imported CSV and lets you (for example) create a URI for every person’s name in the table. You can use Python to transform the variant names into a standard name ontology, e.g. transforming “married name” into aac-ont:married (American Art Consortium), You can model the data and it learns it. E.g., it asks if you want to map the original’s ConstituentID to saam-ont:constituentID or saam-ont:objectId. (It recognizes that the ID is all numerals.) There’s an advanced option that lets you mp it to, for example, a URI for aac-ont:Person1.

He clicks on the “display name” and KARMA suggests that it’s a SKOS altLabel, or a FOAF name, etc. If there are no useful suggestions, you can pick one that’s close and then edit it. You can browse the ontologies in the folders you’ve configured it to load. You can have synonyms (“a FOAF person can be a SKOS person.”) [There’s yet more functionality, but this where I topped out.]

You can save this as a process that can be run in batch mode.

Follow me

Categories: libraries Tagged with: libraries • liveblog • lodlam Date: June 19th, 2013 dw

Be the first to comment »

June 2, 2013

[2b2k] Knowledge in its natural state

I gave a 20 minute talk at the Wired Next Fest in Milan on June 1, 2013. Because I needed to keep the talk to its allotted time and because it was being simultaneously translated into Italian, I wrote it out and gave a copy to the translators. Inevitably, I veered from the script a bit, but not all that much. What follows is the script with the veerings that I can remember. The paragraph breaks track to the slide changes

(I began by thanking the festival, and my progressive Italian publisher, Codice Edizioni Codice are pragmatic idealists and have been fantastic to work with.)

Knowledge seems to fit so perfectly into books. But to marvel at how well Knowledge fits into books…

… is to marvel at how well each rock fits into its hole in the ground. Knowledge fits books because we’ve shaped knowledge around books and paper.

And knowledge has taken on the properties of books and paper. Like books, knowledge is ordered and orderly. It is bounded, just as books stretch from cover to cover. It is the product of an individual mind that then is filtered. It is kept private and we’re not responsible for it until it’s published. Once published, it cannot be undone. It creates a privileged class of experts, like the privileged books that are chosen to be published and then chosen to be in a library

Released from the bounds of paper, knowledge takes on the shape of its new medium, the Internet. It takes on the properties of its new medium just it had taken on the properties of its old paper medium. It’s my argument today that networked knowledge assumes a more natural shape. Here are some of the properties of new, networked knowledge

1. First, because it’s a network, it’s linked.

2. These links have no natural stopping point for your travels. If anything, the network gives you temptations to continue, not stopping points.

3. And, like the Net, it’s too big for any one head, Michael Nielsen, the author of Reinventing Discovery, uses the discovery of the Higgs Boson as an example. That discovery required gigantic networks of equipment and vast networks of people. There is no one person who understands everything about the system that proved that that particle exists. That knowledge lives in the system, in the network.

4. Like the net, networked knowledge is in perpetual disagreement. There is nothing about which everyone agrees. We like to believe this is a temporary state, but after thousands of years of recorded history, we can now see for sure that we are never going to agree about anything. The hope for networked knoweldge is that we’re learning to disagree more fruitfully, in a linked environment

5. And, as the Internet makes very clear, we are fallible creatures. We get everything wrong. So, networked knowledge becomes more credible when it acknowledges fallibility. This is very different from the old paper based authorities who saw fallibility as a challenge to their authority.

6. Finally, knowledge is taking on the humor of the Internet. We’re on the Internet voluntarily and freed of the constrictions of paper, it turns out that we like being with one another. Even when the topic is serious like this topic at Reddit [a discussion of a physics headline], within a few comments, we’re making jokes. And then going back to the serious topic. Paper squeezed the humor out of knowledge. But that’s unnatural.

These properties of networked knowledge are also properties of the Network. But they’re also properties that are more human and more natural than the properties of traditional knowledge.

But there’s one problem:

There is no such thing as natural knowledge. Knowledge is a construct. Our medium may have changed, but we haven’t, at least so it seems. And so we’re not free to reinvent knowledge any way we’d like. Significant problems based on human tendencies are emerging. I’ll point to four quick problem areas.

First, We see the old patterns of concentration of power reemerge on the Net. Some sites have an enormous number of viewers, but the vast majority of sites have very few. [Slide shows Clay Shirky’s Power Law distribution chart, and a photo of Clay]

Albert-László Barabási has shown that this type of clustering is typical of networks even in nature, and it is certainly true of the Internet

Second, on the Internet, without paper to anchor it, knowledge often loses its context. A tweet…

Slips free into the wild…

It gets retweeted and perhaps loses its author

And then gets retweeted and lose its meaning. And now it circulates as fact. [My example was a tweet about the government not allowing us to sell body parts morphing into a tweet about the government selling body parts. I made it up.]

Third, the Internet provides an incentive to overstate.

Fourth, even though the Net contains lots of different sorts of people and ideas and thus should be making us more open in our beliefs…

… we tend to hang out with people who are like us. It’s a natural human thing to prefer people “like us,” or “people we’re comfortable with.” And this leads to confirmation bias — our existing beliefs get reinforced — and possibly to polarization, in which our beliefs become more extreme.

This is known as the echo chamber problem, and it’s a real problem. I personally think it’s been overstated, but it is definitely there.

So there are four problems with networked knowledge. Not one of them is new. Each has a analog from before the Net.

The loss of context has always been with us. Most of what we believe we believe because we believe it, not because of evidence. At its best we call it, in English, common sense. But history has shown us that common sense can include absurdities and lead to great injustices.
Yes, the Net is not a flat, totally equal place. But it is far less centralized than the old media were, where only a handful of people were allowed to broadcast their ideas and to choose which ideas were broadcast.
Certainly the Internet tends towards overstatement. But we have had mass media that have been built on running over-stated headlines. This newspaper [Weekly World News] is a humor paper, but it’s hard to distinguish from serious broadcast news.
And speaking of Fox, yes, on the Internet we can simply stick with ideas that we already agree with, and get more confirmed in our beliefs. But that too is nothing new. The old media actually were able to put us into even more tightly controlled echo chambers. We are more likely to run into opposing ideas — and even just to recognize that there are opposing ideas — on the Net than in a rightwing or leftwing newspaper.

It’s not simply that all the old problems with knowledge have reemerged. Rather, they’ve re-emerged in an environment that offers new and sometimes quite substantial ways around them.

For example, if something loses its context, we can search for that context. And links often add context.
And, yes, the Net forms hubs, but as Clay Shirky and Chris Anderson have pointed out, the Net also lets a long tail form, so that voices that in the past simply could not have been heard, now can be. And the activity in that long tail surpasses the attention paid to the head of the tail.
Yes, we often tend to overstate things on the Net, but we also have a set of quite powerful tools for pushing back. We review our reviews. We have sites like the well-regarded American site, Snopes.com, that will tell you if some Internet rumor is true. Snopes is highly reliable. Then we have all of the ways we talk with one another on the Net, evaluating the truth of what we’ve read there.
And, the echo chamber is a real danger, but we also have on the Net the occasional fulfillment of our old ideal of being able to have honest, respectful conversations with people with whom we fundamentally disagree. These examples are from Reddit, but there are others.

So, yes, there are problems of knowledge that persist even when our technology of knowledge changes. That’s because these are not technical problems so much as human problems…

…and thus require human solutions. And the fundamental solution is that we need to become more self-aware about knowledge.

Our old technology — paper — gave us an idea of knowledge that said that knowledge comes from experts who are filtered, printed, and then it’s settled, because that’s how books work. Our new technology shows us we are complicit in knowing. In order to let knowledge get as big as our new medium allows, we have to recognize that knowledge comes from all of us (including experts), it is to be linked, shared, discussed, argued about, made fun of, and is never finished and done. It is thoroughly ours – something we build together, not a product manufactured by unknown experts and delivered to us as if it were more than merely human.

The required human solution therefore is to accept our human responsibility for knowledge, to embrace and improve the technology that gives knowledge to us –- for example, by embracing Open Access and the culture of linking and of the Net, and to be explicit about these values.

Becoming explicit is vital because our old medium of knowledge did its best to hide the human qualities of knowledge. Our new medium makes that responsibility inescapable. With the crumbling of the paper authorities, it bcomes more urgent than ever that we assume personal and social responsibility for what we know.

Knowing is an unnatural act. If we can remember that –- remember the human role in knowing — we now have the tools and connections that will enable even everyday knowledge to scale to a dimension envisioned in the past only by the mad and the God-inspired.

Thank you.

Follow me

Categories: culture, liveblog, too big to know Tagged with: 2b2k • conferences • italy • liveblog • milan Date: June 2nd, 2013 dw

3 Comments »

May 15, 2013

[meshcon] Ryan Carson of Treehouse

Ryan Carson [twitter:RyanCarson] of Treehouse at the Mesh Conference is keynoting the Mesh Conference. He begins his introduction of himself by saying he is a father, which I appreciate. Treehouse is an “online education company that teaches technology. We hope we can remove the need to go to university to do technology.”

Treehouse “treasures personal time.” They work a 4-day week, 8 hours a day, although they pay for a full 40-hour week. He asks how many people in the audience work for themselves or run their own company; half the people raise their hands. “We have a fundamental belief that people can work smarter, and thus faster…We use a lot of tools that decrease drag.” E.g., they have an internal version of Reddit called “Convoy.” It keeps conversation out of email. “We ask people to never put anything in email that isn’t actionable.” A 4 day week also makes recruiting easy.

“As a father, I realize I’m going to die, sooner rather than later. If I work four days a week, I can send 50% more of my life with my wife and kids.”

Q: Why not a 3 day week?

A: It’s a flag to say “We believe personal time is important.” We’ll do whatever we have to. I’ve told people not to send email over the weekend because it makes work for others.

Q: How about flex time instead?

A: We have tried that, and we let people work from home. “People are smart and motivated and want to succeed. We presume that about people.” We’re demanding, and we’ll fire people if they don’t perform. But you have to institute practices, and not just say that you believe in personal time.

Q: Do you have investors? How do they respond?

A: We have $12M in investment. But we didn’t raise money until after we were profitable. I used my experience running 3 prior companies to give investors confidence. And no one asked about the 4 day week. It doesn’t seem to matter to them. My prior company was an events company and it got bought by a company that worked 5 days a week, and it was messy. I think our team there is now working 5 days.

Q: How do you provide 7 day a week support?

A: Our support team time shifts.

Q: How do you control email so that it’s only actionable?

A: It’s a policy. Also, we use Boomerang which lets us schedule when email is sent.

Now Ryan talks about the tools they use to facilitate a distributed team: about 30 people in Orlando, 8 in Portland, and the rest are distributed in the US and UK. “We don’t have a headquarters.” We are an Internet company. We use Convoy: part water cooler, part news distribution. Notes from meetings go there. It took a dev about a day to create Convoy.

We also use Campfire, a chat program. And Trello for task management. And Google Hangouts. (He notes that you have to be wired, not wifi, and have good gear, for Hangouts to work well.)

Q: Do you have to work over the weekend when there’s a hard deadline? And do you put more of an emphasis on planning?

A: Yes, we sometimes have worked over the weekend. And we’ve sometimes had a problem with people working too much. I think some people work without telling us, especially developers and designers. But if they have to work, their managers have failed. And it does mean we have to plan carefully.

Q: What are your annual meetups like?

A: It’s a full week. No agenda, no working. Pure get drunk, have fun. People work much harder if they like each other and believe in each other.

Now on education. By 2020, there will be 1,000,000 jobs in tech than students. Nine out of ten high schools don’t even offer computer programming classes. [Really? Apparently so. Wow.] Treehouse tries to address this, along with Udacity, CodeAcademy, Code School. In a video, Ryan says that Treehouse will cost you about $300 for an entire course of tech education, making you ready to enter the workforce. “The education system is a racket. Universities have milked us dry for ten years.” 40% of jobs in STEM are in computer science, but only 2% of STEM students are studying it. “In 41 out of 50 states coding classes don’t count toward high school graduation math or science requirements.” “In the future, most students won’t get a four year degree, and I think that’s a good thing. We are moving toward a trade school model.”

Q: Many companies use college degrees as a filter. How do you filter?

A: In 5 yrs there won’t be enough graduates for you to hire anyone because Google and FB will pay them $500,000/year. At Treehouse we apply points. You can see someone’s skills.

Q: What will people miss out on if they don’t go to college?

A: People will miss out on the social aspect, but people can’t afford to go into debt for that. College as the next step is a new idea in the past 15 years. [Really?] You’ll have free liberal arts education available through free online courses. You’ll pay for trade school training. “We’ll just have to have faith that people can be responsible adults without going to university.”

Q: How do you help people who complete your courses find job?

A: We’re rolling out an entire department for this. As you learn on Treehouse, you get points and start to establish your rank. Employers will be able to search our database saying, e.g., “I want someone with over 1,000 points in CSS, 800 points in Javascript, and 500 points in business.”

Q: How are you going to mesh these ideas into traditional education?

A: Sub-par universities will die. Education will be completely different in 10 years. We don’t know what it will be.

Ryan says that he’s not doing this for the money. “People who need education can’t afford it.”

[Judy Lee tweeted that Ryan should have asked us how many in the audience have a university degree, and how many of us regret it. Nice.]

Follow me

Categories: education, liveblog Tagged with: education • liveblog • meshcon Date: May 15th, 2013 dw

3 Comments »

April 2, 2013

[berkman] Anil Dash on “The Web We Lost”

Anil Dash is giving a Berkman lunchtime talk, titled “The Web We Lost.” He begins by pointing out that the title of his talk implies a commonality that at least once was.

[Light editing on April 3 2013.]

Anil puts up an icon that is a symbol of privately-owned public spaces in New York City. Businesses create these spaces in order to be allowed to build buildings taller than the zoning requirements allow. These are sorta kinda like parks but are not. E.g., Occupy isn’t in Zuccotti Park any more because the space is a privately-own public space, not a park. “We need to understand the distinction” between the spaces we think are public and the ones that are privately owned.

We find out about these when we transgress rules. We expect to be able to transgress in public spaces, but in these privately-owned spaces we cannot. E.g., Improv Everywhere needs to operate anonymously to perform in these spaces. Anil asks us to imagine “a secretive, private ivy league club.” He is the son of immigrants and didn’t go to college. “A space even as welcoming as this one [Harvard Berkman] can seem intimidating.” E.g., Facebook was built as a private club. It welcomes everyone now, but it still doesn’t feel like it’s ours. It’s very hard for a business to get much past its origins.

One result of online privately-owned public spaces is “the wholesale destruction of your wedding photos.” When people lose them in a fire, they are distraught because those photos cannot be replaced. Yet everyday we hear about a startup that “succeeds” by selling out, and then destroying the content that they’d gathered. We’ve all gotten the emails that say: “Good news! 1. We’re getting rich. 2. You’re not. 3. We’re deleting your wedding photos.” They can do this because of the terms of service that none of us read but that give them carte blanche. We tend to look at this as simply the cost of doing business with the site.

But don’t see it that way, Anil urges. “This is actually a battle” against the values of the early Web. In the mid to late 1990s, the social Web arose. There was a time when it was meaningful thing to say that you’re a blogger. It was distinctive. Now being introduced as a blogger “is a little bit like being introduced as an emailer.” “No one’s a Facebooker.” The idea that there was a culture with shared values has been dismantled.

He challenges himself to substantiate this:

“We have a lot of software that forbids journalism.” He refers to the IoS [iphone operating system] Terms of Service for app developers that includes text that says, literally: “If you want to criticize a religion, write a book.” You can distribute that book through the Apple bookstore, but Apple doesn’t want you writing apps that criticize religion. Apple enforces an anti-journalism rule, banning an app that shows where drone strikes have been.

Less visibly, the laws is being bent “to make our controlling our data illegal.” All the social networks operate as common carriers — neutral substrates — except when it comes to monetizing. The boundaries are unclear: I can sing “Happy Birthday” to a child at home, and I can do it over FaceTime, but I can’t put it up at YouTube [because of copyright]. It’s very open-ended and difficult to figure. “Now we have the industry that creates the social network implicitly interested in getting involved in how IP laws evolve.” When the Google home page encourages visitors to call their senators against SOPA/PIPA, we have what those of us against Citizens United oppose: we’re asking a big company to encourage people to act politically in a particular way. At the same time, we’re letting these companies capture our words and works and put them under IP law.

A decade ago, metadata was all the rage among the geeks. You could tag, geo-tag, or machine-tag Flickr photos. Flickr is from the old community. That’s why you can still do Creative Commons searches at Flickr. But you can’t on Instagram. They don’t care about metadata. From an end-user point of view, RSS is out of favor. The new companies are not investing in creating metadata to make their work discoverable and shareable.

At the old Suck.com, hovering on a link would reveal a punchline. Now, with the introduction of Adlinks and AdSense, Google transformed links from the informative and aesthetic, to an economic tool for search engine optimization (SEO). Within less than 6 months, linkspam was spawned. Today Facebook’s EdgeRank is based on the idea that “Likes” are an expression of your intent, which determines how FB charges for ads. We’ll see like-spammers and all the rest we saw with links. “These gestural things that were editorial or indicators of intent get corrupted right away.” There are still little islands, but for the most part these gestures that used to be about me telling you that I like your work are becoming economic actions.

Anil says that a while ago when people clicked on a link from Facebook to his blog, FB popped up a warning notice saying that it might be dangerous to go there. “The assumption is that my site is less trustworthy than theirs. Let’s say that’s true. Let’s say I’m trying to steal all your privacy and they’re not.” [audience laughs] He has FB comments on his site. To get this FB has to validate your page. “I explicitly opted in to the Facebook ecology” in part to prove he’s a moderate and in part as a convenience to his readers. At the same time, FB was letting the Washington Post and The Guardian publish within the FB walls, and FB never gave that warning when you clicked on their links. A friend at FB told Anil that the popup was a bug, which might be. But that means “in the best case, we’re stuck fixing their bugs on our budgets.” (The worst case is that FB is trying to shunt traffic away from other sites.)

And this is true for all things that compete with the Web. The ideas locked into apps won’t survive the company’s acquisition, but this is true when we change devices as well. “Content tied to devices dies when those devices become obsolete.” We have “given up on standard formats.” “Those of us who cared about this stuff…have lost,” overall. Very few apps support standard formats, with jpg and html as exceptions. Likes and follows, etc., all use undocumented proprietary formats. The most dramatic shift: we’ve lost the expectation that they would be interoperable. The Web was built out of interoperability. “This went away with almost no public discourse about the implications of it.”

The most important implication of all this comes when thinking about the Web as a public space. When the President goes on FB, we think about it as a public space, but it’s not, and dissent and transgression are not permitted. “Terms of Service and IP trump the Constitution.” E.g., every single message you put on FB during the election FB could have transformed into its opposite, and FB would be within its ToS rights. After Hurricane Sandy, public relief officials were broadcasting messages only through FB. “You had to be locked into FB to see where public relief was happening. A striking change.”

What’s most at risk are the words of everyday people. “It’s never the Pharaoh’s words that are lost to history.” Very few people opt out of FB. Anil is still on FB because he doesn’t want to lose contact with his in-laws. [See Dan Gillmor’s talk last week.) Without these privately-owned public spaces, Anil wouldn’t have been invited to Harvard; it’s how he made his name.

“The main reason this shift happened in the social web is the arrogance of the people who cared about the social web in the early days…We did sincerely care about enabling all these positive things. But the way we went about it was so arrogant that Mark Zuckerberg’s vision seemed more appealing, which is appalling.” An Ivy League kid’s software designed for a privileged, exclusive elite turned out to be more appealing than what folks like Anil were building. “If we had been listening more, and a little more open in self-criticism, it would have been very valuable.”

There was a lot of triumphalism after PIPA/SOPA went down, but it took a huge amount of hyperbole: “Hollywood wants to destroy the First Amendment, etc.” It worked once but it doesn’t scale. The willingness to pat ourselves on our back uncritically led us to vilify people who support creative industries. That comes from the arrogance that they’re dinosaurs, etc. People should see us being publicly critical of ourselves. For something to seem less inclusive than FB or Apple — incredibly arrogant, non-egalitarian cultures — that’s something we should look at very self-critically.

Some of us want to say “But it’s only some of the Web.” We built the Web for pages, but increasingly we’re moving from pages to streams (most recently-updated on top, generally), on our phones but also on bigger screens. Sites that were pages have become streams. E.g., YouTube and Yahoo. These streams feel like apps, not pages. Our arrogance keeps us thinking that the Web is still about pages. Nope. The percentage of time we spend online looking at streams is rapidly increasing. It is already dominant. This is important because these streams are controlled access. The host controls how we experience the content. “This is part of how they’re controlling the conversation.” No Open Web advocate has created a stream that’s anywhere near as popular as the sites we’re going to. The geeks tend to fight the last battle. “Let’s make an open source version of the current thing.” Instead, geeks need to think about creating a new kind of stream. People never switch to more open apps. (Anil says Firefox was an exception.)

So, what do we do? Social technologies follow patterns. It’s cyclical. (E.g., “mainframes being rebranded as The Cloud.”) Google is doing just about everything Microsoft was doing in the late 1990s. We should expect a reaction against their overreach. With Microsoft, “policy really worked.” The Consent Decree made IE an afterthought for developers. Public policy can be an important of this change. “There’s no question” that policy over social software is coming.

Also, some “apps want to do the right thing.” Anil’s ThinkUp demonstrates this. We need to be making apps that people actually want, not ones that are just open. “Are you being more attentive to what users want than Mark Zuckerberg is?” We need to shepherd and coach the apps that want to do the right thing. We count on 23 yr olds to do this, but they were in 5th grade when the environment was open. It’s very hard to learn the history of the personal software industry and how it impacted culture. “What happened in the desktop office suite wars ?” [Ah, memories!] We should be learning from such things.

And we can learn things from our own data. “It’s much easier for me to check my heart-rate than how often I’m reading Twitter.”

Fortunately, there are still institutions that care about a healthy Web. At one point there was a conflict between federal law and Terms of Service: the White House was archiving coments on its FB wall, whereas FB said you couldn’t archive for more than 24 hrs.

We should remember that ToS isn’t law. Geeks will hack software but treat ToS as sacred. Our culture is negatively impacted by ToS and we should reclaim our agency over them. “We should think about how to organize action around specific clauses in ToS.” In fact, “people have already chosen a path of civil disobedience.” E.g., search YouTube for “no infringement intended.” “It’s like poetry.” They’re saying “I’m not trying to step on your toes, but the world needs to see this.” “I’m so inspired by this.” If millions of teenagers assembled to metformin without prescription engage in civil disobedience, we’d be amazed. They do on line. They feel they need to transgress because of a creative urge, or because it’s speech with a friend not an act of publishing. “That’s the opportunity. That’s the exciting part. People are doing this every single day.

[I couldn’t capture the excellent Q&A because I was running the microphone around.]

The video of the talk will be posted here.

Follow me

Categories: culture, liveblog Tagged with: berkman • liveblog • social media • web 2.0 Date: April 2nd, 2013 dw

38 Comments »

March 28, 2013

[annotation][2b2k] Critique^it

Ashley Bradford of Critique-It describes his company’s way of keeping review and feedback engaging.

To what extent can and should we allow classroom feedback to be available in the public sphere? The classroom is a type of Habermasian civic society. Owning one’s discourse in that environment is critical. It has to feel human if students are to learn.

So, you can embed text, audio, and video feedback in documents, video and images. It translates docs into HTML. To make the feedback feel human, it uses slightly stamps. You can also type in comments, marking them as neutral, positive, or critique. A “critique panel” follows you through the doc as you read it, so you don’t have to scroll around. It rolls up comments and stats for the student or the faculty.

It works the same in different doc types, including Powerpoint, images, and video.

Critiques can be shared among groups. Groups can be arbitrarily defined.

It uses HTML 5. It’s written in Javascript, PHP, and uses Mysql.

“We’re starting with an environment. We’re building out tools.” Ashley aims for Critique^It to feel very human.

Follow me

Categories: interop, too big to know Tagged with: 2b2k • annotation • interop • liveblog Date: March 28th, 2013 dw

2 Comments »

[annotation][2b2k] Mediathread

Jonah Bossewich and Mark Philipsonfrom Columbia University talk about Mediathread, an open source project that makes it easy to annotate various digital sources. It’s used in many courses at Columbi, as well as around the world.

It comes from Columbia’s Center for New Media Teaching and Learning. It began with Vital, a video library tool. It let students clip and save portions of videos, and comment on them. Mediathread connects annotations to sources by bookmarking, via a bookmarklet that interoperates with a variety of collections. The bookmarklet scrapes the metadata because “We couldn’t wait for the standards to be developed.” Once an item is in Mediathread, it embeds the metadata as well.

It has always been conceived of a “small-group sharing and collaboration space.” It’s designed for classes. You can only see the annotations by people in your class. It does item-level annotation, as well as regions.

Mediathread connects assignments and responses, as well as other workflows. [He’s talking quickly :)]

Mediathread’s bookmarklet approach requires it to have to accommodate the particularities of sites. They are aiming at making the annotations interoperable in standard forms.

Follow me

Categories: interop, liveblog, too big to know Tagged with: 2b2k • annotation • interop • liveblog Date: March 28th, 2013 dw

Be the first to comment »

« Previous Page | Next Page »