Word sense disambiguation is an essential, yet a very difficult task in natural language processing. While several other NLP tasks, such as POS tagging, can provide more than fairly good results (highly accurate, with almost 100% rate of...
moreWord sense disambiguation is an essential, yet a very difficult task in natural language processing. While several other NLP tasks, such as POS tagging, can provide more than fairly good results (highly accurate, with almost 100% rate of successfully labeled words), disambiguation is far from achieving such performances. However, we will demonstrate the need of word sense disambiguation in computing the lexical chains on a special kind of text (chats) using a WordNet-based approach. In addition, we will try to identify the bottlenecks (mostly in respect to accuracy) in such an approach and provide possible improvements.
Public data can be considered large and important sources of data that can be used for different purposes. In this paper we present a method for collecting and analyzing data within urban settlements. For more focused analysis and...
morePublic data can be considered large and important sources of data that can be used for different purposes. In this paper we present a method for collecting and analyzing data within urban settlements. For more focused analysis and gathering of large amount of data we considered a case study of Bucharest. The main purpose of this analysis is to pick up important information about different streets, points of interests, details about urban planning, etc., with the goal of facilitating a quick and correct evaluation of specific areas and identifying suitable location for adding new points of interest. The prediction of suitable location involves using heuristics and data mining technics such as clustering algorithms, association rules.
In this paper, we present an application that can be used for assessing the contributions of participants to multiple chat conversations (that debate the same topics) according to different criteria, along with the ranking of the...
moreIn this paper, we present an application that can be used for assessing the contributions of participants to multiple chat conversations (that debate the same topics) according to different criteria, along with the ranking of the conversations considering a list of important concepts to be debated. Introduction Chat is one of the favorite environments for Computer Supported Collaborative Learning (CSCL) tasks that require online and synchronous textual interactions among participants (Stahl, 2006). Most of the existing tools for supporting chats are only aiming at facilitating the conversation without offering analysis instruments. One exception is PolyCAFe (Rebedea et al., 2011), a system which analyzes each user contribution and provides abstraction, visualization and feedback services for supporting learners and tutors. In another context, Chiru et al. (2011) start from the idea of topics rhythmicity in the participants' utterances in order to evaluate the chat quality and th...
It is well known that there is a world-wide gender gap in most STEM domains. We propose a study of the participation of undergraduate students in computer science to a CSCL experiment in order to detect possible differences between female...
moreIt is well known that there is a world-wide gender gap in most STEM domains. We propose a study of the participation of undergraduate students in computer science to a CSCL experiment in order to detect possible differences between female and male students. Moreover, we have tried to determine if the composition of the groups influences the value of the factors proposed for analyzing the activity of a student. The factors used for analysis are qualitative heuristics used to determine the activity of the users with regard to both involvement and content. Thus, we have been able to identify differences between the knowledge, innovation, involvement and vocabulary manifested by each gender group in several cases: chats only with males, with a majority of males or females and with an equal distribution of genders. The main conclusions of the research are that females are innovative and that equally distributed groups have higher scores than the others for several indicators.
The wider acceptance and usage of instant messaging (chat) represents one of the consequences of undertaking Computer-Supported Collaborative Learning (CSCL) practices in formal education settings. However, the difficulty of analyzing...
moreThe wider acceptance and usage of instant messaging (chat) represents one of the consequences of undertaking Computer-Supported Collaborative Learning (CSCL) practices in formal education settings. However, the difficulty of analyzing these textual artifacts of learners in order to offer them feedback represents a serious problem in further extending the usage of chat conversations. PolyCAFe is a system that was designed to support the tutors and to provide automatic feedback for the learners engaged in ...
In this paper we present a method that combines the cognitive and socio-cultural paradigms for automatically identifying the most important moments (the so-called pivotal moments) from a Computer Supported Collaborative Learning chat. The...
moreIn this paper we present a method that combines the cognitive and socio-cultural paradigms for automatically identifying the most important moments (the so-called pivotal moments) from a Computer Supported Collaborative Learning chat. The existing applications do not identify these moments and we propose a flexible visual method for filling this gap. Since these moments may have different roles in a discourse, we also propose a classification of the identified types of important moments from chat conversations.
ABSTRACT Because of the ubiquity of metaphors in language, metaphor processing is a very important task in the field of natural language processing. The first step towards metaphor processing, and probably the most difficult one, is...
moreABSTRACT Because of the ubiquity of metaphors in language, metaphor processing is a very important task in the field of natural language processing. The first step towards metaphor processing, and probably the most difficult one, is metaphor detection. In the first part of this paper, we review the theoretical background for metaphors and the models and implementations that have been proposed for their detection. We then build corpora for detecting three types of metaphors: IS-A metaphors, metaphors formed with the preposition 'of' and metaphors formed with a verb. For the first two tasks, we train supervised classifiers using semantic features. For the third task, we use features commonly used in text categorization.
Abstract In this paper we present a system that combines the cognitive and socio-cultural paradigms existent in the field of discourse analysis in order to analyze both narrations and conversations. The novelty of our approach is that...
moreAbstract In this paper we present a system that combines the cognitive and socio-cultural paradigms existent in the field of discourse analysis in order to analyze both narrations and conversations. The novelty of our approach is that existing applications are oriented on analyzing only one of these two types, an adaptation being necessary for the analysis of the other type.
... Independetei, Bucharest, Romania, costin. chiru@ cs. pub. ro, alexandru. janca@ cti. pub. ro, traian. rebedea@ cs. pub. ro Abstract. Word sense disambiguation is an essential, yet a very difficult task in natural language processing....
more... Independetei, Bucharest, Romania, costin. chiru@ cs. pub. ro, alexandru. janca@ cti. pub. ro, traian. rebedea@ cs. pub. ro Abstract. Word sense disambiguation is an essential, yet a very difficult task in natural language processing. ...
logo IndexCopernicus, Current language: English. ...
The Web pages have been intensively used lately for automatic or semiautomatic extraction of useful information. Because of the open nature of the Web, the texts that have no spelling errors are very rare exceptions. One of the most...
moreThe Web pages have been intensively used lately for automatic or semiautomatic extraction of useful information. Because of the open nature of the Web, the texts that have no spelling errors are very rare exceptions. One of the most wide-spread errors in Internet texts is the malapropism. Thus, malapropism detection and correction algorithms have been investigated. The detection algorithms are based on text cohesion while the correction algorithms use precompiled paronyms dictionaries. Therefore, it is very important to ...
This report is about the way to deliver relevant feedback to students' written production, either for free texts (eg, essays, syntheses, notes) or chat conversation, in order for the students to build knowledge. This report presents...
moreThis report is about the way to deliver relevant feedback to students' written production, either for free texts (eg, essays, syntheses, notes) or chat conversation, in order for the students to build knowledge. This report presents an overview and a selection of existing models, methods and resources for: 1) the automatic analysis of learner interactions using language technologies or social network analysis (Task 5.1) and 2) the automatic analysis of learner text (Task 5.2). Secondly, a proposition of the tools to be developed for the project ...
This report presents Version 1 of the support and feedback services (delivering recommendations based on interaction analysis and on students' textual production) that can be integrated within an e-learning environment. Further steps...
moreThis report presents Version 1 of the support and feedback services (delivering recommendations based on interaction analysis and on students' textual production) that can be integrated within an e-learning environment. Further steps toward the implementation of Version 2 of these services and their future integration with all the LTfLL services are also suggested.
The perception of environmental information (EI) and the way that it is structured within Environmental Information Portals (EIP) are of major importance for the design of such information systems. The present paper investigates the...
moreThe perception of environmental information (EI) and the way that it is structured within Environmental Information Portals (EIP) are of major importance for the design of such information systems. The present paper investigates the concepts of air quality perception and applies natural language processing for developing a framework of concepts, phrases, patterns, collocations, and metaphors around an air quality ontology. The results reveal the frequency of appearance and the relations between basic terms, thus reflecting domain ...
ABSTRACT Starting from the socio-constructivist concepts of (virtual) community of practice (vCoP) and internet-based argumentative open-ended learning environments, this study proposes and validates two tools for automated dialogue...
moreABSTRACT Starting from the socio-constructivist concepts of (virtual) community of practice (vCoP) and internet-based argumentative open-ended learning environments, this study proposes and validates two tools for automated dialogue assessment, ReaderBench and Important Moments, developed on the ground of the polyphonic social knowledge building model. The analyzed corpus was the dialogue produced by an academic vCoP with N = 179 community members in 23 months, and consisting of 3685 interventions in 292 text-based discussion threads. The analysis results uncovered significant differences in the discussion threads produced by central and peripheral participants, such that central participants produced more interventions with higher collaborative dialogue quality, and the discussion threads they initiated were longer and involved a larger number of participants. Moreover, based on the automated analysis result, the vCoP participants could be classified in two clusters corresponding to the well-known core-periphery structure of CoPs. These findings are consistent with those revealed by other methods, and suggest that the employed tools are appropriate for identifying virtual communities that are appropriate as open-ended learning environments. Further research and development is needed to deepen quantitative vCoP models and test communication strategies recommended to students in vCoP-based argumentative open-ended learning environments.
This paper presents a model and an application that can be used to assess chat conversations according to their content, which is related to a number of imposed topics, and to the personal involvement of the participants. The main...
moreThis paper presents a model and an application that can be used to assess chat conversations according to their content, which is related to a number of imposed topics, and to the personal involvement of the participants. The main theoretical ideas that stand behind this application are Bakhtin’s polyphony theory and Tannen’s ideas related to the use of repetitions. The results of the application are validated against the gold standard provided by two teachers from the Human-Computer Interaction evaluating the same chats and after that the verification is done using another teacher from the same domain. During the verification we also show that the model used for chat evaluation is dependent on the number of participants to that chat.
ABSTRACT Conference code: 81593, Export Date: 16 August 2011, Source: Scopus, Language of Original Document: English, Correspondence Address: Rebedea, T.; Department of Computer Science and Engineering, Politehnica University of...
moreABSTRACT Conference code: 81593, Export Date: 16 August 2011, Source: Scopus, Language of Original Document: English, Correspondence Address: Rebedea, T.; Department of Computer Science and Engineering, Politehnica University of Bucharest, 313 Splaiul Independetei, Bucharest, Romania; email:
traian.rebedea@cs.pub.ro, References: Adams, P.H., Martell, C.H., Topic detection and extraction in chat (2008) Proceedings of the 2008 IEEE International Conference on Semantic Computing, pp. 581-588;
Although Computer-Supported Collaborative Learning (CSCL) advocates the use of instant messaging and discussion forums for collaboration between learners, there is a scarcity of tools for leveraging the information in this kind of...
moreAlthough Computer-Supported Collaborative Learning (CSCL) advocates the use of instant messaging and discussion forums for collaboration between learners, there is a scarcity of tools for leveraging the information in this kind of conversations. Thus, these technologies are primarily used for communication and, once the conversation is over, the raw data is rarely manually analyzed by tutors, teachers and other learners. This paper presents a methodology and a system that can be used for providing feedback and support to learners and tutors that are involved in tasks that make use of chats and forums. In order to achieve this objective, PolyCAFe employs Natural Language Processing and Social Network Analysis techniques to discover polyphony and inter-animation in textual collaborations. To evaluate the proposed approach and the designed system a first validation experiment has been performed and the results are discussed and analyzed in the end of the paper.
Abstract Depending on the user's intention, the queries processed by a search engine can be classified in transactional, informational and navigational [1]. In order to meet the three types of searches, at...
moreAbstract Depending on the user's intention, the queries processed by a search engine can be classified in transactional, informational and navigational [1]. In order to meet the three types of searches, at this moment search engines basically use algorithmic analysis of the links between pages improved by a factor that depends on the number of occurrences of the keywords in the query and the order of these words on each web page returned as a result. For transactional and informational queries, the relevance of the results returned by the ...