Joho the Blog » everythingIsMiscellaneous

December 11, 2010

Boston Public Library has 15,827 photos on Flickr

The Boston Public Library has put 15,827 photos into Flickr, using the least restrictive Creative Commons licenses possible. Tom Blake, the Digital Projects Manager at the BPL reports “he images on our Flickr account have been viewed collectively over 1.6 million times since we launched the account in March of 2008.”

The photos I dipped into were well marked up with metadata, and tagged. (Their new collection is called “Misc.” :) Some great stuff there. E.g., if you’re interested in the early Red Sox, try these. Or stereopticon images.

[the next day:] Jon Udell, in a tweet [twitter: judell], points to Keene Public Library’s recent Flickr uploadingg. ” KPL nicely models photo curation,” Jon tweets.

Follow me

1 Comment »

August 19, 2009

Dilbert goes miscellaneous

Amusing Dilbert today, for those who can’t resist a good taxonomy joke. (Thanks for the tip, Helena!)

[Tags: everything_is_miscellaneous comics dilbert humor taxonomy ]

Follow me

Categories: Uncategorized Tagged with: comics • dilbert • everythingIsMiscellaneous • everything_is_miscellaneous • humor • taxonomy Date: August 19th, 2009 dw

1 Comment »

August 18, 2009

RecapTheLaw.org

RecapTheLaw.org has a Firefox extension that both gives access to public docket records and makes them actually publicly accessible. The courts charge for access to these dockets, including every time you search and for every page of search results. The system is called PACER. RECAP gives you access to PACER (and is PACER spelled backwards). When you use RECAP to view a docket through PACER, RECAP uploads it into the Internet Archive, since the docket info is in the public domain even though the courts charge you for accessing it. The next time someone goes through RECAP to find that docket, she’ll get it for free from the Internet Archive. RECAP also adds helpful headers and other metadata.

RecapTheLaw comes out of the Princeton Center for Information Technology Policy. Well done!

[Tags: law courts dockets ]

Follow me

Categories: Uncategorized Tagged with: courts • digital rights • dockets • egov • everythingIsMiscellaneous • expertise • law • metadata Date: August 18th, 2009 dw

2 Comments »

August 14, 2009

Search Pidgin

I know I’m not the only one who’s finding WolframAlpha sometimes frustrating because I can’t figure out the magic words to use to invoke the genii. To give just one example, I can’t figure out how to see the frequency of the surnames Kumar and Weinberger compared side-by-side in WolframAlpha’s signature fashion. It’s a small thing because “surname Kumar” and “surname Weinberger” will get you info about each individually. But over and over, I fail to guess the way WolframAlpha wants me to phrase the question.

Search engines are easier because they have already trained us how to talk to them. We know that we generally get the same results whether we use the stop words “when,” “the,” etc. and questions marks or not. We eventually learn that quoting a phrase searches for exactly that phrase. We may even learn that in many engines, putting a dash in front of a word excludes pages containing it from the results, or that we can do marvelous and magical things with prefaces that end in a colon site:, define:. We also learn the semantics of searching: If you want to find out the name of that guy who’s Ishmael’s friend in Moby-Dick, you’ll do best to include some words likely to be on the same page, so “‘What was the name of that guy in Moby-Dick who was the hero’s friend?'” is way worse than “Moby-Dick harpoonist’.” I have no idea what the curve of query sophistication looks like, but most of us have been trained to one degree or another by the search engines who are our masters and our betters.

In short, we’re being taught a pidgin language â€” a simplified language for communicating across cultures. In this case, the two cultures are human and computers. I only wish the pidgin were more uniform and useful. Google has enough dominance in the market that its syntax influences other search engines. Good! But we could use some help taking the next step, formulating more complex natural language queries in a pidgin that crosses application boundaries, and that isn’t designed for standard database queries.

Or does this already exist?

Tags: search pidgin nlp natural_language_processing google everything_is_miscellaneous

Follow me

Categories: Uncategorized Tagged with: everythingIsMiscellaneous • everything_is_miscellaneous • google • metadata • natural_language_processing • nlp • pidgin • search Date: August 14th, 2009 dw

4 Comments »

August 11, 2009

The universality of names

There’s a terrific article by Carol Kaesuk Yoon in the NY Times about research that shows that humans around the world tend to cluster the natural world in highly similar ways, even using similar-ish names.

[Tags: everything_is_miscellaneous taxonomy ]

Follow me

Categories: Uncategorized Tagged with: everythingIsMiscellaneous • everything_is_miscellaneous • folksonomy • taxonomy Date: August 11th, 2009 dw

1 Comment »

August 9, 2009

Twitterelevancy

With it’s new Fresh view, Delicious builds on the TweetNews idea of using links in Tweets (and other measures) as a way to find what’s newest and most interesting. As the blog post about it says:

Underneath the hood, Fresh factors several features into the ranking like related bookmark and tweet counts,Â “eats our own dogfood”Â by leveragingÂ BOSSÂ to filter for high quality results, as well as stitches tweets to related articles even if the tweets do not provide matching URLs (asÂ ~81% of tweetsÂ do not contain URLs). Try clicking the â€˜x Related Tweets’ link for any given story to see the Twitter conversation appear instantly inline.

It’s a welcome reslicing, not a whole new beast, but it seems useful.

[Tags: delivious everything_is_miscellaneous twitter news ]

Follow me

Categories: Uncategorized Tagged with: delivious • everythingIsMiscellaneous • everything_is_miscellaneous • metadata • news • social networks • tagging • twitter Date: August 9th, 2009 dw

1 Comment »

August 7, 2009

Tags again

Jeez, it would save me a lot of time if Keynote (or Powerpoint, if you insist) let me tag slides and objects in slides (especially images). I spend way too much time looking for that slide of a “smart room” or the one that shows business vs. end-user use of Web 2.0, or that photo of an old broadcast tower. (Later that day: Maybe I should add, having just rewritten the Wikipedia entry on Interleaf, that back in the early 1990s, Interleaf gave us exactly that capability.)

Instead, I have two hacks, both a pain in the butt. First, I keep a humungous file of slides I think I’ll want to use again. Second, I’ve started putting tags into the speaker notes by putting the tags in brackets. But I use the speaker notes to speak from, so larding them up with tags is sub-optimal.

And especially if you save Keynote files in the pre-2009 multi-file formats, then it’d be a snap for third parties to build tools that extract the tags and manage them. (I have a fussy home-made utility that extracts the text from the speaker notes and builds an editable file of them. If you want it, let me know.)

Tags are easy! Tags are useful! Let tags be tags!

[Tags: tags everything_is_miscellaneous keynote powerpoint metadata whines ]

Follow me

Categories: Uncategorized Tagged with: everythingIsMiscellaneous • everything_is_miscellaneous • keynote • metadata • powerpoint • tagging • tags • whines Date: August 7th, 2009 dw

2 Comments »

August 5, 2009

Media Cloud unclouds media

The NY Times has a terrific article about Media Cloud , a Berkman Center project (hats off to Ethan Zuckerman, Yochai Benkler, Hal Roberts, among others) that will let researchers track the actual movement of ideas through the mediasphere and blogosphere.

Data about concepts! What a concept!

[Tags: berkman media blogs memes research media_cloud ]

Follow me

Categories: blogs Tagged with: berkman • blogs • everythingIsMiscellaneous • media • memes • research Date: August 5th, 2009 dw

Be the first to comment »

July 29, 2009

Office hours for non-academics

Academics hold office hours â€” set periods when they can be found in their on-campus offices, available to talk â€” because they are not required to be on campus except for when they teach. Since more of the workforce is adopting that work-wherever-you-want approach, I really like Whitney Hess’ idea of establishing of office hours for herself. She’s a user experience designer, and if you want to talk with her, you can count on her being in her office, with a chat client open, on Sunday mornings, 9-10 EDT.

More of us ought to be doing that. I think it’s a keeper of an idea.

[Tags: office_hours free_agent_nation business ]

Follow me

Categories: Uncategorized Tagged with: business • cluetrain • digital culture • everythingIsMiscellaneous • free_agent_nation • office_hours Date: July 29th, 2009 dw

5 Comments »

July 28, 2009

Annals of openness in peril

1. The court has rejected Charlie Nesson’s basic defense of Joel Tenenbaum’s sharing of music files. The case is going to jury which may levy the same sort of insanely excessive fines as in the Jammie Thomas-Rassert trial. I hope Charlie’s team can convince the jury that the fines and the entire process are so onerous and disproportionate that the RIAA has been abusing the court system. Of course, IANAL, and IANAOTJ (I am not on the jury).

2. Barnes and Noble has launched its e-book software. It runs on iPhones as well as on PC’s and Mac’s. I’m having trouble finding which formats it supports, but judging from its Open dialogue, not PDF, .doc, .html, .mobi, or text. It does support .PBD books.

After a very very quick session playing with it, it seems quite competitive with the Kindle, and because I’m running it on my Mac and not on the little piece of crippled hardware I bought from Amazon — the Kindle is just barely adequate as a reader, and is still overpriced by more than 100% in terms of its value, imo — having the use of a keyboard and a mouse is a big step up. And, unlike the Kindle, you can use whatever fonts you have on your machine. Still, it’s only incrementally better than the Kindle’s software (again, on a quick look), not a great leap forward for readers.

One of B&N’s big advantages is that it’s hooked into Google Books, enabling you to download public domain books that Google has scanned in. You do this by searching for a book on the B&N site and noticing the “free from Google Books” label. Be sure to sort by price; otherwise B&N lists the for-pay versions first. If B&N wants to be aggressive in this space (= succeed), it should create an easy-to-find section that lets you browse Google’s free books. Get us using the ereader and then sell us the copyrighted books. (If B&N has such a section, I couldn’t find it quickly enough.)

BTW, I presume (and thus may be wrong) that Google did a special deal with B&N to enable this. If so, I find it worrisome. If Google is going to be granted a special right to scan in books without fear of copyright reprisals, it will be the de facto national e-library, discouraging others from undertaking similarly scaled scanning projects, and thus should be making its public domain books equally and maximally freely available. IMO.

2a. [Later that evening:] B&N stores are now providing free Wifi. Yay!

3. Apple is not permitting the Google telephone service into the Apple App store, thus simultaneously and inadvertently making the case for Zittrainian generativity.

4. [Later that day]: On the happy front, Google has open-sourced an implementation of Wave.

[Tags: copyright copyleft books e-books google libraries everything_is_miscellaneous charles_nesson jonathan_zittrain law fair_use amazon kindle b&n ]