Refining the process of gathering information

Bombarded by information from numerous sources, many people today turn to electronic news-aggregation services to find what they want. One European team of researchers claims to have developed a flexible and innovative tool that enables journalists and other users to fine-tune the process of news-gathering and delivery.

Early trials of this open-source, distributed system, developed under the IST project PENG, have only recently completed. But coordinator Gabriella Pasi says the participants were impressed with the results.

"Selected Swiss journalists and students assessed the performance of the system's various modules," she says. "For example they checked the effectiveness and accuracy of the information filtering, comparing the results with those from existing systems. They then looked at the integrated system and praised its user-friendliness."

Pasi adds that they liked the system's ability to find relevant information – measured in terms of recall (the proportion of retrieved and relevant documents compared to all documents in the collection) and precision (the ratio of retrieved and relevant documents to all the documents retrieved). A more detailed system trial is due for completion in November 2006.

More than just 'push'
The project originated in research carried out by several partners on information retrieval and filtering. Pasi notes that, "Our project proposal predated the launch of present news-aggregation services, which focus on 'pushing' out information based on user needs." The PENG system, by contrast, offers two distinct techniques: information filtering (push) and information retrieval (pull).

Current news-aggregation systems work very much like internet search engines, pushing out information based on certain user criteria. If users require further filtering, they must create a profile for themselves – which can result in the generation of somewhat limited lists. This process works well for journalists receiving information from online news agencies that produce a continuous news stream; but they still face the problem of selecting the most relevant news.

The PENG system enables users to go much further. By personalising filters, they can pick up targeted information from agencies and combine this with data retrieved from the web or specialised archives. They can also place constraints on the content they seek – such as the media category or trustworthiness of sources – to generate highly specific information. The system then calls on various modules to edit and summarise all this information automatically, before presenting it as the user wishes.

Innovative fuzzy algorithm
Pasi highlights the system's ability to learn user preferences over time. It can also deal with human vagueness or imprecision, such as in the filtering or interaction with the software.

The partners have also developed a new filtering algorithm. Based on categories, it can cluster news from agencies into thematic cluster groups such as sports or politics, for creating data subsets based on common characteristics (e.g. people with a certain hair colour). After these subsets are defined, the system can describe each group (e.g. this is the group with black hair).

"Of the two possible approaches to data clustering," says Pasi, "we chose 'unsupervised' because this approach does not force us to select a priori categories." She adds that the PENG system can display audiovisual content, but works mainly with textual information.

PENG was completed in August 2006. Though the complete system exists only as a prototype, project partner ATOS Origin is examining the possibility of using certain modules in standalone applications. The company is also interested in marketing the project's clustering algorithm, which could be used not only for filtering news but also for image gathering or e-commerce applications.

Contact:
Professor Gabriella Pasi
Consiglio Nazionale Delle Ricerche ITC-CNR
Via Bassini N. 15
I-20131 Milan
Italy
Tel: +39 02 2369 9489
Email: This email address is being protected from spambots. You need JavaScript enabled to view it.

Source: IST Results Portal

Most Popular Now

Oracle Buys Cerner

Oracle Corporation (NYSE: ORCL) and Cerner Corporation jointly announced an agreement for Oracle to acquire Cerner through an all-cash tender offer for $95.00 per share, or approximately $28.3 billion in...

Philips and IJsselland Hospital Sign Lon…

Royal Philips (NYSE: PHG, AEX: PHIA), a global leader in health technology, today announced it has signed a 12-year strategic partnership with IJsselland Hospital (Capelle aan den Ijssel, The Netherlands)...

Computer Programs and Mobile Apps may He…

The COVID-19 pandemic has had a major impact on mental health across the globe. Depression is predicted to be the leading cause of lost life years due to illness by...

AI Points the Way to Better Doctor-Patie…

A computer analysis of hundreds of thousands of secure email messages between doctors and patients found that most doctors use language that is too complex for their patients to understand...

Could EKGs Help Doctors use AI to Detect…

Pulmonary embolisms are dangerous, lung-clogging blot clots. In a pilot study, scientists at the Icahn School of Medicine at Mount Sinai showed for the first time that artificial intelligence (AI)...

Mayo Clinic Researchers Use AI, Biomarke…

Treatment options for rheumatoid arthritis have often relied on trial and error. Now Mayo Clinic researchers are exploring the use of artificial intelligence (AI) and pharmacogenomics to predict how patients...

Open Call DIGITAL-2021-DEPLOY-01-TWINS-H…

The development of digital twins in healthcare (DTH) has progressed substantially, profiting from advances in science and technology. In order to exploit their benefits in view of better prevention approaches...

Computer Model of Blood Enzyme

Membrane-associated proteins play a vital role in a variety of cellular processes, yet little is known about the membrane-association mechanism. Lipoprotein-associated phospholipase A2 (Lp-PLA2) is one such protein with an...

Mjog by Livi Launches Remote Monitoring …

Mjog by Livi has launched a remote monitoring tool that will help GPs support and monitor people with depression through messages sent to their smartphones. The latest data from the Office...

4.5 Million Euros in EU Funding for Saar…

This year, three computer scientists from Saarbrücken were awarded an "ERC Starting Grant" by the European Research Council. This award, endowed with 1.5 million euros each, is among the most...

2022 EU4Health Work Programme Adopted to…

Today the Commission has adopted the second EU4Health work programme. In 2022, the EU4Health will continue to invest in building stronger, more resilient health systems and pave the way for...

Five NHS Trusts in Surrey and Sussex to …

A consortium of NHS trusts that covers a population of circa 1.2 million will gain immediate access to important patient imaging, and will mobilise a regional workforce for patients, following...