{"id":710,"date":"2017-12-06T06:12:48","date_gmt":"2017-12-06T06:12:48","guid":{"rendered":"https:\/\/www.science.nus.edu.sg\/?p=710"},"modified":"2019-11-06T06:21:35","modified_gmt":"2019-11-06T06:21:35","slug":"unlocking-the-power-of-web-text-data","status":"publish","type":"post","link":"https:\/\/www.science.nus.edu.sg\/blog\/2017\/12\/unlocking-the-power-of-web-text-data\/","title":{"rendered":"Unlocking the power of web text data"},"content":{"rendered":"<p>6\u00a0Dec 2017. NUS statisticians have developed the Regularised Text Logistic (RTL) regression model to extract informative word features from digital text for decision-making.<\/p>\n<p>The world is increasingly becoming connected through the internet and social media applications, creating vast amounts of data. With the massive increase in web posts, user reviews and feedback around the world via electronic word-of-mouth, web text data has been shown to provide important information for content analysis, as well as create an impact on decision-making processes. Businesses and organisations need to be able to analyse and make sense of data to remain competitive and relevant.<\/p>\n<p>Prof CHEN Ying from the Department of Statistics and Applied Probability, NUS and her research team have developed a text mining and analysis model which can identify and extract informative textual data of interest automatically from public postings on the internet (e.g. social media comments etc). This is known as the Regularised Text Logistic (RTL) regression model.<\/p>\n<p>Online web textual data comes from many distributed sources and is often unstructured. This makes it difficult to analyse using conventional approaches. The RTL regression is a machine learning classifier that helps to accurately classify customers\u2019 review polarity (positive or negative) based on the textual content. It is also capable of automatically detecting a small set of informative word features that help business decision-makers pinpoint the key aspects of customer reviews easily.<\/p>\n<p>Prof Chen said, \u201cThis automated feature saves time which would otherwise be spent reading the review information online. With this feature, business decision-makers can obtain immediate feedback on customer sentiments towards their products or services, so that they can tailor their offerings to improve the customer experience.\u201d<\/p>\n<p>\u201cFrom our knowledge, the RTL model is the first supervised sentiment classifier for large amount of web-based text using the logistic regression framework with theoretical derivation,\u201d added Prof Chen.<\/p>\n<p><img decoding=\"async\" alt=\"\" src=\"https:\/\/www.science.nus.edu.sg\/wp-content\/uploads\/2019\/11\/120._Chen_Y_STA_20171108_1.jpg\" \/><\/p>\n<p>Figure shows the word features which were found in online customer ratings and reviews for nine major restaurants managed by a hotel chain in Singapore. Each word feature in the corpus is displayed as a grey dot. The features selected by the RTL classifier are highlighted as black dots.<\/p>\n<p>&nbsp;<\/p>\n<p><strong>Reference<\/strong><\/p>\n<p>P Liu; Y Chen; CP Teo, \u201cSentiment Analysis for Online Reviews with Regularized Text Logistic Regression\u201d working paper (2017).<\/p>\n","protected":false},"excerpt":{"rendered":"<p>6\u00a0Dec 2017. NUS statisticians have developed the Regularised Text Logistic (RTL) regression model to extract informative word features from digital&#8230;<\/p>\n","protected":false},"author":1,"featured_media":720,"comment_status":"open","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"_acf_changed":false,"footnotes":""},"categories":[17,13],"tags":[],"class_list":["post-710","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-data-science","category-research-news"],"acf":[],"yoast_head":"<!-- This site is optimized with the Yoast SEO Premium plugin v23.6 (Yoast SEO v23.6) - https:\/\/yoast.com\/wordpress\/plugins\/seo\/ -->\n<title>Unlocking the power of web text data - NUS Faculty of Science<\/title>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/www.science.nus.edu.sg\/blog\/2017\/12\/unlocking-the-power-of-web-text-data\/\" \/>\n<meta property=\"og:locale\" content=\"en_US\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Unlocking the power of web text data\" \/>\n<meta property=\"og:description\" content=\"6\u00a0Dec 2017. NUS statisticians have developed the Regularised Text Logistic (RTL) regression model to extract informative word features from digital...\" \/>\n<meta property=\"og:url\" content=\"https:\/\/www.science.nus.edu.sg\/blog\/2017\/12\/unlocking-the-power-of-web-text-data\/\" \/>\n<meta property=\"og:site_name\" content=\"NUS Faculty of Science\" \/>\n<meta property=\"article:publisher\" content=\"https:\/\/www.facebook.com\/NUSFacultyofScience\/\" \/>\n<meta property=\"article:published_time\" content=\"2017-12-06T06:12:48+00:00\" \/>\n<meta property=\"article:modified_time\" content=\"2019-11-06T06:21:35+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/www.science.nus.edu.sg\/wp-content\/uploads\/2017\/11\/120._Chen_Y_STA_20171108.jpg\" \/>\n\t<meta property=\"og:image:width\" content=\"550\" \/>\n\t<meta property=\"og:image:height\" content=\"431\" \/>\n\t<meta property=\"og:image:type\" content=\"image\/jpeg\" \/>\n<meta name=\"author\" content=\"nussdo\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:label1\" content=\"Written by\" \/>\n\t<meta name=\"twitter:data1\" content=\"nussdo\" \/>\n\t<meta name=\"twitter:label2\" content=\"Est. reading time\" \/>\n\t<meta name=\"twitter:data2\" content=\"2 minutes\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":\"Article\",\"@id\":\"https:\/\/www.science.nus.edu.sg\/blog\/2017\/12\/unlocking-the-power-of-web-text-data\/#article\",\"isPartOf\":{\"@id\":\"https:\/\/www.science.nus.edu.sg\/blog\/2017\/12\/unlocking-the-power-of-web-text-data\/\"},\"author\":{\"name\":\"nussdo\",\"@id\":\"https:\/\/www.science.nus.edu.sg\/#\/schema\/person\/49b52e0adf731af61b2678f59bb0e364\"},\"headline\":\"Unlocking the power of web text data\",\"datePublished\":\"2017-12-06T06:12:48+00:00\",\"dateModified\":\"2019-11-06T06:21:35+00:00\",\"mainEntityOfPage\":{\"@id\":\"https:\/\/www.science.nus.edu.sg\/blog\/2017\/12\/unlocking-the-power-of-web-text-data\/\"},\"wordCount\":380,\"commentCount\":0,\"publisher\":{\"@id\":\"https:\/\/www.science.nus.edu.sg\/#organization\"},\"image\":{\"@id\":\"https:\/\/www.science.nus.edu.sg\/blog\/2017\/12\/unlocking-the-power-of-web-text-data\/#primaryimage\"},\"thumbnailUrl\":\"https:\/\/www.science.nus.edu.sg\/wp-content\/uploads\/2017\/11\/120._Chen_Y_STA_20171108.jpg\",\"articleSection\":[\"Data science\",\"Research News\"],\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"CommentAction\",\"name\":\"Comment\",\"target\":[\"https:\/\/www.science.nus.edu.sg\/blog\/2017\/12\/unlocking-the-power-of-web-text-data\/#respond\"]}]},{\"@type\":\"WebPage\",\"@id\":\"https:\/\/www.science.nus.edu.sg\/blog\/2017\/12\/unlocking-the-power-of-web-text-data\/\",\"url\":\"https:\/\/www.science.nus.edu.sg\/blog\/2017\/12\/unlocking-the-power-of-web-text-data\/\",\"name\":\"Unlocking the power of web text data - NUS Faculty of Science\",\"isPartOf\":{\"@id\":\"https:\/\/www.science.nus.edu.sg\/#website\"},\"primaryImageOfPage\":{\"@id\":\"https:\/\/www.science.nus.edu.sg\/blog\/2017\/12\/unlocking-the-power-of-web-text-data\/#primaryimage\"},\"image\":{\"@id\":\"https:\/\/www.science.nus.edu.sg\/blog\/2017\/12\/unlocking-the-power-of-web-text-data\/#primaryimage\"},\"thumbnailUrl\":\"https:\/\/www.science.nus.edu.sg\/wp-content\/uploads\/2017\/11\/120._Chen_Y_STA_20171108.jpg\",\"datePublished\":\"2017-12-06T06:12:48+00:00\",\"dateModified\":\"2019-11-06T06:21:35+00:00\",\"breadcrumb\":{\"@id\":\"https:\/\/www.science.nus.edu.sg\/blog\/2017\/12\/unlocking-the-power-of-web-text-data\/#breadcrumb\"},\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\/\/www.science.nus.edu.sg\/blog\/2017\/12\/unlocking-the-power-of-web-text-data\/\"]}]},{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/www.science.nus.edu.sg\/blog\/2017\/12\/unlocking-the-power-of-web-text-data\/#primaryimage\",\"url\":\"https:\/\/www.science.nus.edu.sg\/wp-content\/uploads\/2017\/11\/120._Chen_Y_STA_20171108.jpg\",\"contentUrl\":\"https:\/\/www.science.nus.edu.sg\/wp-content\/uploads\/2017\/11\/120._Chen_Y_STA_20171108.jpg\",\"width\":550,\"height\":431},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\/\/www.science.nus.edu.sg\/blog\/2017\/12\/unlocking-the-power-of-web-text-data\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\/\/www.science.nus.edu.sg\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"Unlocking the power of web text data\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\/\/www.science.nus.edu.sg\/#website\",\"url\":\"https:\/\/www.science.nus.edu.sg\/\",\"name\":\"NUS Faculty of Science\",\"description\":\"Leading in Science Education, Research and Innovation to Transform Our Future\",\"publisher\":{\"@id\":\"https:\/\/www.science.nus.edu.sg\/#organization\"},\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\/\/www.science.nus.edu.sg\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"en-US\"},{\"@type\":\"Organization\",\"@id\":\"https:\/\/www.science.nus.edu.sg\/#organization\",\"name\":\"NUS Faculty of Science\",\"url\":\"https:\/\/www.science.nus.edu.sg\/\",\"logo\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/www.science.nus.edu.sg\/#\/schema\/logo\/image\/\",\"url\":\"https:\/\/www.science.nus.edu.sg\/wp-content\/uploads\/2022\/06\/fos-logo-768x190-1.png\",\"contentUrl\":\"https:\/\/www.science.nus.edu.sg\/wp-content\/uploads\/2022\/06\/fos-logo-768x190-1.png\",\"width\":768,\"height\":190,\"caption\":\"NUS Faculty of Science\"},\"image\":{\"@id\":\"https:\/\/www.science.nus.edu.sg\/#\/schema\/logo\/image\/\"},\"sameAs\":[\"https:\/\/www.facebook.com\/NUSFacultyofScience\/\",\"https:\/\/www.instagram.com\/nus.fos\/?hl=en\"]},{\"@type\":\"Person\",\"@id\":\"https:\/\/www.science.nus.edu.sg\/#\/schema\/person\/49b52e0adf731af61b2678f59bb0e364\",\"name\":\"nussdo\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/www.science.nus.edu.sg\/#\/schema\/person\/image\/\",\"url\":\"https:\/\/secure.gravatar.com\/avatar\/f516fb037b7f05bc070a0f7a9d8f5811?s=96&d=mm&r=g\",\"contentUrl\":\"https:\/\/secure.gravatar.com\/avatar\/f516fb037b7f05bc070a0f7a9d8f5811?s=96&d=mm&r=g\",\"caption\":\"nussdo\"},\"url\":\"https:\/\/www.science.nus.edu.sg\/blog\/author\/nussdo\/\"}]}<\/script>\n<!-- \/ Yoast SEO Premium plugin. -->","yoast_head_json":{"title":"Unlocking the power of web text data - NUS Faculty of Science","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/www.science.nus.edu.sg\/blog\/2017\/12\/unlocking-the-power-of-web-text-data\/","og_locale":"en_US","og_type":"article","og_title":"Unlocking the power of web text data","og_description":"6\u00a0Dec 2017. NUS statisticians have developed the Regularised Text Logistic (RTL) regression model to extract informative word features from digital...","og_url":"https:\/\/www.science.nus.edu.sg\/blog\/2017\/12\/unlocking-the-power-of-web-text-data\/","og_site_name":"NUS Faculty of Science","article_publisher":"https:\/\/www.facebook.com\/NUSFacultyofScience\/","article_published_time":"2017-12-06T06:12:48+00:00","article_modified_time":"2019-11-06T06:21:35+00:00","og_image":[{"width":550,"height":431,"url":"https:\/\/www.science.nus.edu.sg\/wp-content\/uploads\/2017\/11\/120._Chen_Y_STA_20171108.jpg","type":"image\/jpeg"}],"author":"nussdo","twitter_card":"summary_large_image","twitter_misc":{"Written by":"nussdo","Est. reading time":"2 minutes"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"Article","@id":"https:\/\/www.science.nus.edu.sg\/blog\/2017\/12\/unlocking-the-power-of-web-text-data\/#article","isPartOf":{"@id":"https:\/\/www.science.nus.edu.sg\/blog\/2017\/12\/unlocking-the-power-of-web-text-data\/"},"author":{"name":"nussdo","@id":"https:\/\/www.science.nus.edu.sg\/#\/schema\/person\/49b52e0adf731af61b2678f59bb0e364"},"headline":"Unlocking the power of web text data","datePublished":"2017-12-06T06:12:48+00:00","dateModified":"2019-11-06T06:21:35+00:00","mainEntityOfPage":{"@id":"https:\/\/www.science.nus.edu.sg\/blog\/2017\/12\/unlocking-the-power-of-web-text-data\/"},"wordCount":380,"commentCount":0,"publisher":{"@id":"https:\/\/www.science.nus.edu.sg\/#organization"},"image":{"@id":"https:\/\/www.science.nus.edu.sg\/blog\/2017\/12\/unlocking-the-power-of-web-text-data\/#primaryimage"},"thumbnailUrl":"https:\/\/www.science.nus.edu.sg\/wp-content\/uploads\/2017\/11\/120._Chen_Y_STA_20171108.jpg","articleSection":["Data science","Research News"],"inLanguage":"en-US","potentialAction":[{"@type":"CommentAction","name":"Comment","target":["https:\/\/www.science.nus.edu.sg\/blog\/2017\/12\/unlocking-the-power-of-web-text-data\/#respond"]}]},{"@type":"WebPage","@id":"https:\/\/www.science.nus.edu.sg\/blog\/2017\/12\/unlocking-the-power-of-web-text-data\/","url":"https:\/\/www.science.nus.edu.sg\/blog\/2017\/12\/unlocking-the-power-of-web-text-data\/","name":"Unlocking the power of web text data - NUS Faculty of Science","isPartOf":{"@id":"https:\/\/www.science.nus.edu.sg\/#website"},"primaryImageOfPage":{"@id":"https:\/\/www.science.nus.edu.sg\/blog\/2017\/12\/unlocking-the-power-of-web-text-data\/#primaryimage"},"image":{"@id":"https:\/\/www.science.nus.edu.sg\/blog\/2017\/12\/unlocking-the-power-of-web-text-data\/#primaryimage"},"thumbnailUrl":"https:\/\/www.science.nus.edu.sg\/wp-content\/uploads\/2017\/11\/120._Chen_Y_STA_20171108.jpg","datePublished":"2017-12-06T06:12:48+00:00","dateModified":"2019-11-06T06:21:35+00:00","breadcrumb":{"@id":"https:\/\/www.science.nus.edu.sg\/blog\/2017\/12\/unlocking-the-power-of-web-text-data\/#breadcrumb"},"inLanguage":"en-US","potentialAction":[{"@type":"ReadAction","target":["https:\/\/www.science.nus.edu.sg\/blog\/2017\/12\/unlocking-the-power-of-web-text-data\/"]}]},{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/www.science.nus.edu.sg\/blog\/2017\/12\/unlocking-the-power-of-web-text-data\/#primaryimage","url":"https:\/\/www.science.nus.edu.sg\/wp-content\/uploads\/2017\/11\/120._Chen_Y_STA_20171108.jpg","contentUrl":"https:\/\/www.science.nus.edu.sg\/wp-content\/uploads\/2017\/11\/120._Chen_Y_STA_20171108.jpg","width":550,"height":431},{"@type":"BreadcrumbList","@id":"https:\/\/www.science.nus.edu.sg\/blog\/2017\/12\/unlocking-the-power-of-web-text-data\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/www.science.nus.edu.sg\/"},{"@type":"ListItem","position":2,"name":"Unlocking the power of web text data"}]},{"@type":"WebSite","@id":"https:\/\/www.science.nus.edu.sg\/#website","url":"https:\/\/www.science.nus.edu.sg\/","name":"NUS Faculty of Science","description":"Leading in Science Education, Research and Innovation to Transform Our Future","publisher":{"@id":"https:\/\/www.science.nus.edu.sg\/#organization"},"potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/www.science.nus.edu.sg\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"en-US"},{"@type":"Organization","@id":"https:\/\/www.science.nus.edu.sg\/#organization","name":"NUS Faculty of Science","url":"https:\/\/www.science.nus.edu.sg\/","logo":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/www.science.nus.edu.sg\/#\/schema\/logo\/image\/","url":"https:\/\/www.science.nus.edu.sg\/wp-content\/uploads\/2022\/06\/fos-logo-768x190-1.png","contentUrl":"https:\/\/www.science.nus.edu.sg\/wp-content\/uploads\/2022\/06\/fos-logo-768x190-1.png","width":768,"height":190,"caption":"NUS Faculty of Science"},"image":{"@id":"https:\/\/www.science.nus.edu.sg\/#\/schema\/logo\/image\/"},"sameAs":["https:\/\/www.facebook.com\/NUSFacultyofScience\/","https:\/\/www.instagram.com\/nus.fos\/?hl=en"]},{"@type":"Person","@id":"https:\/\/www.science.nus.edu.sg\/#\/schema\/person\/49b52e0adf731af61b2678f59bb0e364","name":"nussdo","image":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/www.science.nus.edu.sg\/#\/schema\/person\/image\/","url":"https:\/\/secure.gravatar.com\/avatar\/f516fb037b7f05bc070a0f7a9d8f5811?s=96&d=mm&r=g","contentUrl":"https:\/\/secure.gravatar.com\/avatar\/f516fb037b7f05bc070a0f7a9d8f5811?s=96&d=mm&r=g","caption":"nussdo"},"url":"https:\/\/www.science.nus.edu.sg\/blog\/author\/nussdo\/"}]}},"_links":{"self":[{"href":"https:\/\/www.science.nus.edu.sg\/wp-json\/wp\/v2\/posts\/710"}],"collection":[{"href":"https:\/\/www.science.nus.edu.sg\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.science.nus.edu.sg\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.science.nus.edu.sg\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/www.science.nus.edu.sg\/wp-json\/wp\/v2\/comments?post=710"}],"version-history":[{"count":1,"href":"https:\/\/www.science.nus.edu.sg\/wp-json\/wp\/v2\/posts\/710\/revisions"}],"predecessor-version":[{"id":712,"href":"https:\/\/www.science.nus.edu.sg\/wp-json\/wp\/v2\/posts\/710\/revisions\/712"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.science.nus.edu.sg\/wp-json\/wp\/v2\/media\/720"}],"wp:attachment":[{"href":"https:\/\/www.science.nus.edu.sg\/wp-json\/wp\/v2\/media?parent=710"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.science.nus.edu.sg\/wp-json\/wp\/v2\/categories?post=710"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.science.nus.edu.sg\/wp-json\/wp\/v2\/tags?post=710"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}