Performance and Improvement of Linguistic Data Analysis on Different Languages Using Deep Learnedness Techniques

Authors

  • Venkateswaran Radhakrishnan Faculty-Information Technology Department, College of Computing and Information Sciences, University of Technology and Applied Sciences-Salalah, Oman Author
  • Asadi Srinivasulu Researcher, Global Canter for Environmental remediation, College of Engineering, Science and Environment, The University of Newcastle, Australia Author
  • Suresh Palarimath Faculty-Information Technology Department, College of Computing and Information Sciences, University of Technology and Applied Sciences-Salalah, Oman Author
  • Rogelio Gutierrez Faculty-Information Technology Department, College of Computing and Information Sciences, University of Technology and Applied Sciences-Salalah, Oman Author
  • Kumar C Praveen Faculty-Information Technology Department, College of Computing and Information Sciences, University of Technology and Applied Sciences-Salalah, Oman Author

Keywords:

Linguistics Data, Autoencoders, Deep learnedness, Prediction, CNN, RNN, ECNN, ERNN, Data minelaying, Feature Selection, Data Pre-processing

Abstract

India is an assorted country with various  kinds of societies in each edge of the country. This is an  organization of 30 states and 8 association regions with  various dialects and particular social legacy. Thus, there  have been many discussions on the beginning of the  constitution, and the worry of the public language. It is said  about the language of India, "The language expressed in  India changes very much like the flavour of water changes  in India each couple of kilometres". Nonetheless, India does  not have a public language as there is an impressive contrast  between an authority language and a public language.  infusion synopsis of message archives generally comprises  of positioning the record string of words and separating the  highest-level string of words subject to the rundown  dimension requirements. We investigate and commitment  of different directed acquisition calculations to the string of  words positioning assignment. That is the reason, we  present an original string of words positioning philosophy  in view of the similitude score between an up-and-comer  string of words and touchstone rundowns. The famous  direct relapse framework accomplished the high-grade  outcomes in undeniably assessed datasets. Moreover, the  direct relapse framework, which included Part-of-Speech  supported highlights, beat the same with factual elements as  it were. The proposed framework beat than the current  framework with boundaries/measurements as precision  (81.43%), blunder rate (0.13), val_loss (0.41), val_accuracy  (0.50), size of dataset utilized in research (1.30 GB), No. of  ages (50), Time-intricacy (O(n2)) and execution time (1012  ms). 

Downloads

Download data is not yet available.

References

Akaike, H. 1974. A New Look at the Statistical framework Identification. IEEE Transactions on Automatic Control 19 (6): 716-723.

Stranieri, A., Zeleznikow, J.: Knowledge Discovery from Legal Databases. In: Law and Philisohy Library. Springer, Heidelberg (2005).

Al-Hashemi, R. 2010. Text Summarization Extraction System (TSES) Using Extracted Keywords. International Arab Journal of e-Technology 1 (4): 164-168.

Tong, R., Appelbaum, L.: Machine learnedness for knowledge-supported document routing. In: Harman (ed.) Proceedings of the 2nd Text Retrieval Conference (1994).

Breiman, L., Friedman, J., Stone, C., and Olshen, R. 1984. Classification and regression trees. CRC press.

Sch¨utze, H., Hull, D., Pedersen, J.: A comparison of classifiers and document representations for the routing problem. In: SIGIR 1995, 18th ACM International Conference on Research and Developement in Information Retrieval, Seattle, US, pp. 229–237 (1995).

Fattah, M. A., and Ren, F. 2009. GA, MR, FFNN, PNN and GMM supported frameworks for automatic text summarization. Computer Speech & Language 23 (1): 126- 144.

Mladeni´c, D., Grobelnik, M.: Feature selection for unbalanced class distribution and na¨ıve Bayes. In: ICML 1999, 16th International Conference on Machine learnedness, pp. 258–267 (1999).

Galanis, D., Lampouras, G., and Androutsopoulos, I. 2012. infusion Multi-Document Summarization with Integer Linear Programming and Support Vector Regression. COLING 2012: Technical Papers. Mumbai,India. 911-926.

Joachims, T.: Transductive inference for text classification using support vector machines. In: ICML 1999, 16th International Conference on Machine learnedness (1999).

Gupta, V., and Lehal, G. 2010. A Survey of Text Summarization infusion Techniques. Journal of Emerging Technologies in Web Intelligence 2 (3): 258-268.

A.L. Berger and V.O. Mittal. 2000. OCELOT: A system for summarizing web pages. In Proceedings of the 23rd Anual International ACM SIGIR Conference on Research and Development in Information Retrieval, pages 144–151, Athens, Greece.

Ambria, Erik, Soujanya Poria, Devamanyu Hazarika, and Kenneth Kwok. (2018) “SenticNet 5: Discovering conceptual primitives for sentiment analysis by means of context embeddings.”In Proceedings of 32the AAAI Conference on Artificial Intelligence, pp. 1795–1802.

Boguraev and C. Kennedy. 1999. Salience-supported content characterisation of text documents. In Interjit Mani and T. Maybury, Mark, editors, Advances in AutomaticText Summarization,pages 99111. The MIT Press.

Ting, Mary, Rabiah Abdul Kadir, Azreen Azman, Tengku Mohd Tengku Sembok, and Sabir Ismail. (2019) “Named entity enrichment supported on sub-ject-object anaphora resolution.”In Intelligent Computing-Proceedings of the Computing Conference, pp. 873–884. https ://doi.org/10.1007/978-3-030-22868 -2_60

Cohen. 1995. Fast effective rule induction. In Machine learnedness: Proceedings of the Twelfth International Conference.

Stojanovski, Dario, and Alexander Fraser. (2019) “The Improving anaphora resolution in neural machine translation using cur-riculum learnedness.” In Proceedings of the machine translation summit XVII, pp. 140–150.

T.K. Ho. 1998. The random subspace method for constructing decision forests. IEEE Transactions on Pattern Analysis and Machine Intelligence, 20(8):832–844.

Vicedo, Jose L, and Antonio Ferrandez. (2020) “Importance of pronominal anaphora resolution in question answering systems. “In Proceedings of the 38th annual meeting on association for computational linguistics, association for computational linguistics, China. pp. 555–562.

J. Justeson and S. Katz. 1995. Technical terminology: Some linguistic properties and an algorithm for identification in text. Natural Language Engineering, (1):9–27.

Trabelsi, Fériel Ben Fraj, Chiraz Ben Othmane Zribi, and Saoussen Mathlouthi. (2016) “Arabic anaphora resolution using markov decision process.” In International Conference on Intelligent Text Processing and Computational Linguistics, Springer, pp. 520-532.

Downloads

Published

2023-11-30

How to Cite

Performance and Improvement of Linguistic Data Analysis on Different Languages Using Deep Learnedness Techniques . (2023). International Journal of Innovative Research in Computer Science & Technology, 11(6), 32–38. Retrieved from https://acspublisher.com/journals/index.php/ijircst/article/view/12122