Paolo Mottadelli (MOZ)

And you know what they call a… Quarter Pounder with Cheese in Paris?
With my coach Roberto
1_Roman_Sebrle_poids02
8_Agustin_Felix_110h
Pole vault competition


Archive for the 'Sourcesense' Category

05 24th, 2008

Well, the title of this post is cut and pasted from one of my last received emails.

Good news today by ApacheCon Schedule: I’ll be a speaker at the next ApacheConUS 2008 in New Orleas.

Here is an anticipation of what my session will be about. Please fell free to send me or comment here any suggestions you wish! I really would like to build the session presentation through a community-like process, i would like to be the spokesman of an interested community!
Do you like the idea?Here is the session abstract.

Content analysis for ECM with Apache Tika

Apache TIKAApache Tika is an extensible content analysis toolkit designed for detecting and extracting metadata and structured text content from a large number of document formats. It represents an higher level layer over existing parser libraries.
World-class content management systems, and most of all enterprise document management focused ones, always have to face the challenge of detecting, extracting and indexing as more various media content types as possible.
This session provides a technical presentation on how you can integrate Tika inside an Enterprise Document Management System, in order to centralize media type detection implementation and leverage dedicated parsers behind a common extraction layer.
Partecipants will also learn how Tika is integrated with Lucene in order to provide high performance document indexing and searching features.
As a real life demostration, the most recent supported media types, Office Open XML (OOXML), will be detected, extracted and indexed.

Today at 8 p.m CET Microsoft has announced its partnership with Sourcesense.
Visit microsoft PR page to know all the details.

I’m very proud to announce this partnership as I am one of the involved developers. It is going to be an important step for my company on the way to spread Open Source in the Enterprise industry.

We are developing the Office open XML support inside the POI Apache project. New Microsoft Spreadsheets (Office 2007 format) will be soon java manageable!