Portuguese Text Simplification for Digital Inclusion and Accessibility
- π€ Speaker: Caroline Gasperin
- π Date & Time: Wednesday 24 June 2009, 12:00 - 13:00
- π Venue: SW01, Computer Laboratory
Abstract
I will present PorSimples, a project for developing text simplification technology for the Portuguese language. We focus on syntactic simplification, which consists of breaking complex syntactic constructs in order to make sentences easier to read by people with poor reading skills. Our text simplification system has two modules: a machine learning-based module that decides when a sentence needs to be simplified, and a rule-based module that simplifies the sentences. The machine-learning module collects features of the sentences from a corpus of manually simplified texts and decides when a simplification operation is required, so that the output text is “natural” and not over simplified. The rule-based module executes simplification operations for the syntactic phenomena that are considered complex. I will detail both modules and present our experimental results so far.
Series This talk is part of the NLIP Seminar Series series.
Included in Lists
- All Talks (aka the CURE list)
- bld31
- Cambridge Centre for Data-Driven Discovery (C2D3)
- Cambridge Forum of Science and Humanities
- Cambridge Language Sciences
- Cambridge talks
- Chris Davis' list
- Computer Education Research
- Computing Education Research
- Department of Computer Science and Technology talks and seminars
- Graduate-Seminars
- Guy Emerson's list
- Interested Talks
- Language Sciences for Graduate Students
- ndk22's list
- NLIP Seminar Series
- ob366-ai4er
- PMRFPS's
- rp587
- School of Technology
- Simon Baker's List
- SW01, Computer Laboratory
- Trust & Technology Initiative - interesting events
- yk449
Note: Ex-directory lists are not shown.
![[Talks.cam]](/static/images/talkslogosmall.gif)

Caroline Gasperin
Wednesday 24 June 2009, 12:00-13:00