short-paper

Towards Re-defining Relation Understanding in Financial Domain

Authors:
Chenguang Wang

IBM Research-Almaden

IBM Research-Almaden
View Profile

,
Doug Burdick

IBM Research-Almaden

IBM Research-Almaden
View Profile

,
Laura Chiticariu

IBM Research-Almaden

IBM Research-Almaden
View Profile

,
Rajasekar Krishnamurthy

IBM Research-Almaden

IBM Research-Almaden
View Profile

,
Yunyao Li

IBM Research-Almaden

IBM Research-Almaden
View Profile

,
Huaiyu Zhu

IBM Research-Almaden

IBM Research-Almaden
View Profile

DSMM'17: Proceedings of the 3rd International Workshop on Data Science for Macro--Modeling with Financial and Economic DatasetsMay 2017Article No.: 8Pages 1–6https://doi.org/10.1145/3077240.3077254

Published:14 May 2017Publication History

DSMM'17: Proceedings of the 3rd International Workshop on Data Science for Macro--Modeling with Financial and Economic Datasets

Pages 1–6

ABSTRACT

We describe our experiences in participating in the scored task for the 2017 FEIII Data Challenge. Our approach is to model the problem as a binary classification problem and train an ensemble model leveraging domain features that capture financial terminology. We share challenge results for our submission, which performed well achieving the highest score in four out of six evaluation criteria. We describe semantic complexities encountered with regards to the task definition and ambiguities in the labeled dataset. We present an alternative task formulation Relationship Validation that addresses some of these semantic complexities and demonstrate how our approach naturally extends to this simplified task definition.

References

Yi-Wei Chen and Chih-Jen Lin. 2006. Combining SVMs with various feature selection strategies. In Feature extraction. Springer Berlin Heidelberg, Berlin, Heidelberg, 315--324.Google Scholar
Laura Chiticariu, Rajasekar Krishnamurthy, Yunyao Li, Sriram Raghavan, Frederick R. Reiss, and Shivakumar Vaithyanathan. 2010. SystemT: An Algebraic Approach to Declarative Information Extraction. In Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics (ACL '10). Association for Computational Linguistics, Stroudsburg, PA, USA, 128--137. Google ScholarDigital Library
DSFIN. 2016. FEIII: Financial Entity Identification and Information Integration. (2016). https://ir.nist.gov/dsfin/Google Scholar
Yoav Freund and Robert E. Schapire. 1995. A Decision-theoretic Generalization of On-line Learning and an Application to Boosting. In Proceedings of the Second European Conference on Computational Learning Theory (EuroCOLT '95). Springer-Verlag, London, UK, UK, 23--37. Google ScholarDigital Library
Jerome Friedman, Trevor Hastie, and Rob Tibshirani. 2010. Regularization paths for generalized linear models via coordinate descent. Journal of statistical software 33, 1 (2010), 1.Google ScholarCross Ref
Louiqa Raschid, Doug Burdick, Mark Flood, John Grant, Joe Langsam, Ian Soboroff, and Elena Zotkina. 2017. Financial Entity Identification and Information Integration (FEIII) Challenge 2017: The Report of the Organizing Committee. In Proceedings of the Workshop on Data Science for Macro-Modeling (DSMM@SIGMOD). ACM, New York, NY, USA. Google ScholarDigital Library
Chenguang Wang, Yangqiu Song, Haoran Li, Ming Zhang, and Jiawei Han. 2016. Text Classification with Heterogeneous Information Network Kernels.. In AAAI. AAAI Press, 2130--2136. Google ScholarDigital Library
Ruihu Wang. 2012. AdaBoost for feature selection, classification and its relation with SVM, a review. Physics Procedia 25 (2012), 800--807.Google ScholarCross Ref
Ji Zhu, Hui Zou, Saharon Rosset, and Trevor Hastie. 2009. Multi-class adaboost. Statistics and its Interface 2, 3 (2009), 349--360.Google Scholar

Recommendations

Natural language question answering in the financial domain
CASCON '18: Proceedings of the 28th Annual International Conference on Computer Science and Software Engineering

This paper describes a natural language question answering system focused on answering financial domain questions using a daily updated corpus of financial reports. Financial entity types of interest included company stocks, country bonds, currencies, ...
Read More
An Ensemble Approach to Financial Entity Matching for the FEIII 2016 Challenge
DSMM'16: Proceedings of the Second International Workshop on Data Science for Macro-Modeling

Financial entities are often referred to with ambiguous descriptions and identifiers. To tackle this issue, the Financial Entity Identification and Information Integration1 (FEIII) Challenge requires participants to automatically reconcile financial ...
Read More
Chinese text classification by the Naïve Bayes Classifier and the associative classifier with multiple confidence threshold values

Each type of classifier has its own advantages as well as certain shortcomings. In this paper, we take the advantages of the associative classifier and the Naive Bayes Classifier to make up the shortcomings of each other, thus improving the accuracy of ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in

DSMM'17: Proceedings of the 3rd International Workshop on Data Science for Macro--Modeling with Financial and Economic Datasets
May 2017
58 pages
ISBN:9781450350310
DOI:10.1145/3077240

Copyright © 2017 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 14 May 2017
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
FEIII
Financial Domain
Information Extraction
Relation Understanding
Text Classification
Qualifiers
- short-paper
- Research
- Refereed limited
Conference

Acceptance Rates
Overall Acceptance Rate32of64submissions,50%
Upcoming Conference
CIKM '24

Sponsor:

sigir

sigir

The 33rd ACM International Conference on Information and Knowledge Management

October 21 - 25, 2024

Boise , ID , USA
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 2
  Total Citations
  View Citations
- 79
  Total Downloads
- Downloads (Last 12 months)1
- Downloads (Last 6 weeks)0
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Towards Re-defining Relation Understanding in Financial Domain

DSMM'17: Proceedings of the 3rd International Workshop on Data Science for Macro--Modeling with Financial and Economic Datasets

ABSTRACT

References

Cited By

Recommendations

Natural language question answering in the financial domain

An Ensemble Approach to Financial Entity Matching for the FEIII 2016 Challenge

Chinese text classification by the Naïve Bayes Classifier and the associative classifier with multiple confidence threshold values

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Acceptance Rates

Upcoming Conference

Funding Sources

Other Metrics

Article Metrics

Other Metrics

Cited By

PDF Format

eReader

Digital Edition

Caption

Towards Re-defining Relation Understanding in Financial Domain

DSMM'17: Proceedings of the 3rd International Workshop on Data Science for Macro--Modeling with Financial and Economic Datasets

ABSTRACT

References

Cited By

Recommendations

Natural language question answering in the financial domain

An Ensemble Approach to Financial Entity Matching for the FEIII 2016 Challenge

Chinese text classification by the Naïve Bayes Classifier and the associative classifier with multiple confidence threshold values

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Acceptance Rates

Upcoming Conference

Funding Sources

Article Metrics

Other Metrics

PDF Format

eReader

Digital Edition

Share this Publication link

Share on Social Media