{"id":133,"date":"2016-03-11T02:37:47","date_gmt":"2016-03-11T02:37:47","guid":{"rendered":"http:\/\/intelligentonlinetools.com\/blog\/?p=133"},"modified":"2016-03-24T02:11:53","modified_gmt":"2016-03-24T02:11:53","slug":"133","status":"publish","type":"post","link":"http:\/\/intelligentonlinetools.com\/blog\/2016\/03\/11\/133\/","title":{"rendered":"7 Ideas for Building Text Mining Application"},"content":{"rendered":"<p>It is no doubt that the web is growing at an incredible pace. And as the most documents of the web consist of the text, the applications of text analytics or text mining are getting more use. In such applications the textual data are used for extracting intelligence from a large collection of documents.  Here are 7 ideas for building this type of applications. Later on during this year 2016 some online working demo examples on this site will be built to test the ideas. The focus is on applications for personal use. Business applications of text mining can be found in [1]<\/p>\n<p>1. Trending is collecting historical data in order to find pattern or predict future. If the usage of word phrase &#8220;python programming&#8221; is going up from month to month then it is good signal for paying attention to this. There is the tool Google Trends is a public web facility of Google Inc., based on Google Search, that shows how often a particular search-term is entered relative to the total search-volume across various regions of the world, and in various languages. The horizontal axis of the main graph represents time (starting from 2004), and the vertical is how often a term is searched for relative to the total number of searches, globally. [2] <\/p>\n<p>However what if we more focused on the future and we want know what terms will be popular in the future. For example &#8220;data science&#8221; or &#8220;big data&#8221; are now popular search terms but back to the time when the usage was the lowest &#8211; if the tool can predict high increase of usage in such situation &#8211; that would be very useful.<\/p>\n<p>2. Building news feed is another example of application for text mining. Newly published web based content if it matches the user interest has a great value. So the application should allow for user to set the topics for the desired content.<br \/>\nFor the same topic the user might be interesting in the ability to set other filters such as source of content, type of content and some other characteristics of content.<br \/>\nOver the time the user interests will change and so the application should learn and adapt to user interests too.<\/p>\n<p>3.Post editing. While someone online is typing article for blog or paper there is always the need to find something related to topic on the web. Imagine that it will automatically show an additional text box with similar content from the web. This would eliminate switching back to search engine and also can bring something new that the author even did not think about.<\/p>\n<p>4. Automatic creation of data reports is very helpful for people who needs data for their research or business needs. There a lot of data and information freely available on the web however it is tedious go to online on weekly or monthly basis and manually extract data and put in some file or database for further analysis. Such task often will include such things as formatting data, merging the information from different sources and some other processing operations.<\/p>\n<p>5. Sentiment analysis (also known as opinion mining) refers to the use of natural language processing, text analysis and computational linguistics to identify and extract subjective information in source materials. Sentiment analysis is widely applied to reviews and social media for a variety of applications, ranging from marketing to customer service. [3]<br \/>\nAlmost each online text analytics web based service is offering sentiment analysis option for users. There are also many online examples how to do sentiment analysis using python or R as programming language. [4]<br \/>\nSentiment analysis application would be useful for predicting some financial event or collecting opinion about some product.<\/p>\n<p>6. Content Organizing application can organize documents into groups by topic, keywords or by some other means.<br \/>\nWe can subscribe and receive email notifications about site news, latest post or new article. The application that is saving links that we liked or decided to review later can help us be more productive. In addition to links it would save some information about links like keywords, topics or description. Such application could also group information by topic or keyword and automatically assign additional keywords.<br \/>\nText document clustering and classification would be used a lot for this application.<\/p>\n<p>7. Topic detection application can be used for automatic text categorization, for understanding what people are talking about, for automatic processing or preprocessing emails or user submitted online articles, comments.<br \/>\nThe task of topic detection might also require the<br \/>\ndevelopment of approaches related to the presentation<br \/>\nof topics: topic ranking, relevant image retrieval, title<br \/>\nand keyword extraction. One of the example of using topic detection is shown in [5]<br \/>\nObviously one document can consist of several segments on different topics. In one of researches simple clustering algorithm was used to group the semantically-related sentences. The distance between two sentences was calculated based on the distance between<br \/>\nall nouns that appear in the sentences.  The distance between two nouns was calculated using the Wordnet thesaurus. [6],[7]<\/p>\n<p><strong>References<\/strong><\/p>\n<p>1. Text Mining and its Business Applications    http:\/\/www.codeproject.com\/Articles\/822379\/Text-Mining-and-its-Business-Applications <\/p>\n<p>2. https:\/\/en.wikipedia.org\/wiki\/Google_Trends  Google Trends<br \/>\nFrom Wikipedia<\/p>\n<p>3. https:\/\/en.wikipedia.org\/wiki\/Sentiment_analysis  Sentiment analysis<br \/>\nFrom Wikipedia<\/p>\n<p>4. https:\/\/support.sas.com\/resources\/papers\/proceedings14\/1288-2014.pdf<br \/>\nAnalysis of Unstructured Data: Applications of Text Analytics<br \/>\nand Sentiment Mining, Dr. Goutam Chakraborty,  Murali Krishna Pagolu<\/p>\n<p>5. http:\/\/ceur-ws.org\/Vol-1150\/petkos.pdf<br \/>\nTwo-level message clustering for topic detection in<br \/>\nTwitter. Georgios Petkos ,Symeon Papadopoulos, Yiannis Kompatsiaris<\/p>\n<p>6. A Non-Linear Topic Detection Method for Text<br \/>\nSummarization Using Wordnet<\/p>\n<p><a href=\"https:\/\/www.cs.kent.ac.uk\/people\/staff\/aaf\/pub_papers.dir\/Wksp-Tec-Info-Ling-Silla-2003.pdf\" target=\"_blank\" rel=\"noopener noreferrer nofollow\">Click to access Wksp-Tec-Info-Ling-Silla-2003.pdf<\/a><\/p>\n<p>Carlos N. Silla Jr. , Celso A. A. Kaestner,  Alex A. Freitas<\/p>\n<p>7. https:\/\/www.uni-weimar.de\/medien\/webis\/events\/tir-08\/tir08-papers-final\/wartena08-topic-detection-by-clustering-keywords.pdf<br \/>\nTopic Detection by Clustering Keywords.   Christian Wartena and Rogier Brussee<\/p>\n","protected":false},"excerpt":{"rendered":"<p>It is no doubt that the web is growing at an incredible pace. And as the most documents of the web consist of the text, the applications of text analytics or text mining are getting more use. In such applications the textual data are used for extracting intelligence from a large collection of documents. Here &#8230; <a title=\"7 Ideas for Building Text Mining Application\" class=\"read-more\" href=\"http:\/\/intelligentonlinetools.com\/blog\/2016\/03\/11\/133\/\">Read more<\/a><\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"jetpack_post_was_ever_published":false,"jetpack_publicize_message":"","jetpack_is_tweetstorm":false,"jetpack_publicize_feature_enabled":true,"jetpack_social_post_already_shared":true,"jetpack_social_options":[]},"categories":[2],"tags":[],"jetpack_publicize_connections":[],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v20.4 - https:\/\/yoast.com\/wordpress\/plugins\/seo\/ -->\n<title>7 Ideas for Building Text Mining Application - Machine Learning Applications<\/title>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"http:\/\/intelligentonlinetools.com\/blog\/2016\/03\/11\/133\/\" \/>\n<meta property=\"og:locale\" content=\"en_US\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"7 Ideas for Building Text Mining Application - Machine Learning Applications\" \/>\n<meta property=\"og:description\" content=\"It is no doubt that the web is growing at an incredible pace. And as the most documents of the web consist of the text, the applications of text analytics or text mining are getting more use. In such applications the textual data are used for extracting intelligence from a large collection of documents. Here ... Read more\" \/>\n<meta property=\"og:url\" content=\"http:\/\/intelligentonlinetools.com\/blog\/2016\/03\/11\/133\/\" \/>\n<meta property=\"og:site_name\" content=\"Machine Learning Applications\" \/>\n<meta property=\"article:published_time\" content=\"2016-03-11T02:37:47+00:00\" \/>\n<meta property=\"article:modified_time\" content=\"2016-03-24T02:11:53+00:00\" \/>\n<meta name=\"author\" content=\"owygs156\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:label1\" content=\"Written by\" \/>\n\t<meta name=\"twitter:data1\" content=\"owygs156\" \/>\n\t<meta name=\"twitter:label2\" content=\"Est. reading time\" \/>\n\t<meta name=\"twitter:data2\" content=\"5 minutes\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":\"WebPage\",\"@id\":\"http:\/\/intelligentonlinetools.com\/blog\/2016\/03\/11\/133\/\",\"url\":\"http:\/\/intelligentonlinetools.com\/blog\/2016\/03\/11\/133\/\",\"name\":\"7 Ideas for Building Text Mining Application - Machine Learning Applications\",\"isPartOf\":{\"@id\":\"http:\/\/intelligentonlinetools.com\/blog\/#website\"},\"datePublished\":\"2016-03-11T02:37:47+00:00\",\"dateModified\":\"2016-03-24T02:11:53+00:00\",\"author\":{\"@id\":\"http:\/\/intelligentonlinetools.com\/blog\/#\/schema\/person\/7a886dc5eb9758369af2f6d2cb342478\"},\"breadcrumb\":{\"@id\":\"http:\/\/intelligentonlinetools.com\/blog\/2016\/03\/11\/133\/#breadcrumb\"},\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"http:\/\/intelligentonlinetools.com\/blog\/2016\/03\/11\/133\/\"]}]},{\"@type\":\"BreadcrumbList\",\"@id\":\"http:\/\/intelligentonlinetools.com\/blog\/2016\/03\/11\/133\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"http:\/\/intelligentonlinetools.com\/blog\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"7 Ideas for Building Text Mining Application\"}]},{\"@type\":\"WebSite\",\"@id\":\"http:\/\/intelligentonlinetools.com\/blog\/#website\",\"url\":\"http:\/\/intelligentonlinetools.com\/blog\/\",\"name\":\"Machine Learning Applications\",\"description\":\"Artificial intelligence, data mining and machine learning for building web based tools and services.\",\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"http:\/\/intelligentonlinetools.com\/blog\/?s={search_term_string}\"},\"query-input\":\"required name=search_term_string\"}],\"inLanguage\":\"en-US\"},{\"@type\":\"Person\",\"@id\":\"http:\/\/intelligentonlinetools.com\/blog\/#\/schema\/person\/7a886dc5eb9758369af2f6d2cb342478\",\"name\":\"owygs156\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"http:\/\/intelligentonlinetools.com\/blog\/#\/schema\/person\/image\/\",\"url\":\"http:\/\/2.gravatar.com\/avatar\/b351def598609cb4c0b5bca26497c7e5?s=96&d=mm&r=g\",\"contentUrl\":\"http:\/\/2.gravatar.com\/avatar\/b351def598609cb4c0b5bca26497c7e5?s=96&d=mm&r=g\",\"caption\":\"owygs156\"}}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"7 Ideas for Building Text Mining Application - Machine Learning Applications","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"http:\/\/intelligentonlinetools.com\/blog\/2016\/03\/11\/133\/","og_locale":"en_US","og_type":"article","og_title":"7 Ideas for Building Text Mining Application - Machine Learning Applications","og_description":"It is no doubt that the web is growing at an incredible pace. And as the most documents of the web consist of the text, the applications of text analytics or text mining are getting more use. In such applications the textual data are used for extracting intelligence from a large collection of documents. Here ... Read more","og_url":"http:\/\/intelligentonlinetools.com\/blog\/2016\/03\/11\/133\/","og_site_name":"Machine Learning Applications","article_published_time":"2016-03-11T02:37:47+00:00","article_modified_time":"2016-03-24T02:11:53+00:00","author":"owygs156","twitter_card":"summary_large_image","twitter_misc":{"Written by":"owygs156","Est. reading time":"5 minutes"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"WebPage","@id":"http:\/\/intelligentonlinetools.com\/blog\/2016\/03\/11\/133\/","url":"http:\/\/intelligentonlinetools.com\/blog\/2016\/03\/11\/133\/","name":"7 Ideas for Building Text Mining Application - Machine Learning Applications","isPartOf":{"@id":"http:\/\/intelligentonlinetools.com\/blog\/#website"},"datePublished":"2016-03-11T02:37:47+00:00","dateModified":"2016-03-24T02:11:53+00:00","author":{"@id":"http:\/\/intelligentonlinetools.com\/blog\/#\/schema\/person\/7a886dc5eb9758369af2f6d2cb342478"},"breadcrumb":{"@id":"http:\/\/intelligentonlinetools.com\/blog\/2016\/03\/11\/133\/#breadcrumb"},"inLanguage":"en-US","potentialAction":[{"@type":"ReadAction","target":["http:\/\/intelligentonlinetools.com\/blog\/2016\/03\/11\/133\/"]}]},{"@type":"BreadcrumbList","@id":"http:\/\/intelligentonlinetools.com\/blog\/2016\/03\/11\/133\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"http:\/\/intelligentonlinetools.com\/blog\/"},{"@type":"ListItem","position":2,"name":"7 Ideas for Building Text Mining Application"}]},{"@type":"WebSite","@id":"http:\/\/intelligentonlinetools.com\/blog\/#website","url":"http:\/\/intelligentonlinetools.com\/blog\/","name":"Machine Learning Applications","description":"Artificial intelligence, data mining and machine learning for building web based tools and services.","potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"http:\/\/intelligentonlinetools.com\/blog\/?s={search_term_string}"},"query-input":"required name=search_term_string"}],"inLanguage":"en-US"},{"@type":"Person","@id":"http:\/\/intelligentonlinetools.com\/blog\/#\/schema\/person\/7a886dc5eb9758369af2f6d2cb342478","name":"owygs156","image":{"@type":"ImageObject","inLanguage":"en-US","@id":"http:\/\/intelligentonlinetools.com\/blog\/#\/schema\/person\/image\/","url":"http:\/\/2.gravatar.com\/avatar\/b351def598609cb4c0b5bca26497c7e5?s=96&d=mm&r=g","contentUrl":"http:\/\/2.gravatar.com\/avatar\/b351def598609cb4c0b5bca26497c7e5?s=96&d=mm&r=g","caption":"owygs156"}}]}},"jetpack_featured_media_url":"","jetpack_sharing_enabled":true,"jetpack_shortlink":"https:\/\/wp.me\/s7h1IJ-133","jetpack-related-posts":[{"id":65,"url":"http:\/\/intelligentonlinetools.com\/blog\/2016\/02\/09\/cloud-computing\/","url_meta":{"origin":133,"position":0},"title":"Cloud Computing","date":"February 9, 2016","format":false,"excerpt":"Cloud computing is the use of computing resources (hardware and software) that are delivered as a service over a network (typically the Internet). The name comes from the use of a cloud-shaped symbol as an abstraction for the complex infrastructure it contains in system diagrams. Cloud computing entrusts remote services\u2026","rel":"","context":"Similar post","img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":227,"url":"http:\/\/intelligentonlinetools.com\/blog\/2016\/05\/28\/using-python-for-mining-data-from-twitter\/","url_meta":{"origin":133,"position":1},"title":"Using Python for Mining Data From Twitter","date":"May 28, 2016","format":false,"excerpt":"Twitter is increasingly being used for business or personal purposes. With Twitter API there is also an opportunity to do data mining of data (tweets) and find interesting information. In this post we will take a look how to get data from Twitter, prepare data for analysis and then do\u2026","rel":"","context":"In &quot;Artificial Intelligence&quot;","img":{"alt_text":"Frequency of Hashtags","src":"https:\/\/i0.wp.com\/intelligentonlinetools.com\/blog\/wp-content\/uploads\/2016\/05\/Frequency-of-Hashtags-300x171.png?resize=350%2C200","width":350,"height":200},"classes":[]},{"id":521,"url":"http:\/\/intelligentonlinetools.com\/blog\/2016\/08\/26\/bio-inspired-optimization-for-text-mining-4\/","url_meta":{"origin":133,"position":2},"title":"Bio-Inspired Optimization for Text Mining-4","date":"August 26, 2016","format":false,"excerpt":"Clustering Text Data In previous post Bio-Inspired Optimization was applied for clustering of numerical data. In this post text data will be used for clustering. So python source code will be modified for clustering of text data. This data will be initialized in the beginning of this python script with\u2026","rel":"","context":"In &quot;Machine Learning&quot;","img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":498,"url":"http:\/\/intelligentonlinetools.com\/blog\/2016\/08\/13\/bio-inspired-optimization-for-text-mining-3\/","url_meta":{"origin":133,"position":3},"title":"Bio-Inspired Optimization for Text Mining-3","date":"August 13, 2016","format":false,"excerpt":"Clustering Numerical Multidimensional Data In this post we will implement Bio Inspired Optimization for clustering multidimensional data. We will use two dimensional data array \"data\" however the code can be used for any reasonable size of array. To do this parameter num_dimensions should be set to data array dimension. We\u2026","rel":"","context":"In &quot;Data Mining&quot;","img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":1446,"url":"http:\/\/intelligentonlinetools.com\/blog\/2017\/11\/06\/10-new-top-resources-on-machine-learning-from-around-the-web\/","url_meta":{"origin":133,"position":4},"title":"10 New Top Resources on Machine Learning from Around the Web","date":"November 6, 2017","format":false,"excerpt":"For this post I put new and most interesting machine learning resources that I recently found on the web. This is the list of useful resources in such areas like stock market forecasting, text mining, deep learning, neural networks and getting data from Twitter. Hope you enjoy the reading. 1.\u2026","rel":"","context":"In &quot;Machine Learning&quot;","img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":450,"url":"http:\/\/intelligentonlinetools.com\/blog\/2016\/08\/03\/bio-inspired-optimization-for-text-mining-2\/","url_meta":{"origin":133,"position":5},"title":"Bio-Inspired Optimization for Text Mining-2","date":"August 3, 2016","format":false,"excerpt":"Numerical One Dimensional Example In the previous code Bio-Inspired Optimization for Text Mining-1 Motivation we implemented source code for optimization some function using bio-inspired algorithm. Now we need to put actual function for clustering. In clustering we want to group our clusters in such way that the distance from each\u2026","rel":"","context":"In &quot;Data Mining&quot;","img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]}],"_links":{"self":[{"href":"http:\/\/intelligentonlinetools.com\/blog\/wp-json\/wp\/v2\/posts\/133"}],"collection":[{"href":"http:\/\/intelligentonlinetools.com\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"http:\/\/intelligentonlinetools.com\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"http:\/\/intelligentonlinetools.com\/blog\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"http:\/\/intelligentonlinetools.com\/blog\/wp-json\/wp\/v2\/comments?post=133"}],"version-history":[{"count":23,"href":"http:\/\/intelligentonlinetools.com\/blog\/wp-json\/wp\/v2\/posts\/133\/revisions"}],"predecessor-version":[{"id":158,"href":"http:\/\/intelligentonlinetools.com\/blog\/wp-json\/wp\/v2\/posts\/133\/revisions\/158"}],"wp:attachment":[{"href":"http:\/\/intelligentonlinetools.com\/blog\/wp-json\/wp\/v2\/media?parent=133"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"http:\/\/intelligentonlinetools.com\/blog\/wp-json\/wp\/v2\/categories?post=133"},{"taxonomy":"post_tag","embeddable":true,"href":"http:\/\/intelligentonlinetools.com\/blog\/wp-json\/wp\/v2\/tags?post=133"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}