article

Selection conditions in main memory

Author:
Kenneth A. Ross

Columbia University, New York, New York

Columbia University, New York, New York
View Profile

Authors Info & Claims

ACM Transactions on Database Systems Volume 29 Issue 1pp 132–161https://doi.org/10.1145/974750.974755

Published:01 March 2004Publication History

ACM Transactions on Database Systems

Abstract

We consider the fundamental operation of applying a compound filtering condition to a set of records. With large main memories available cheaply, systems may choose to keep the data entirely in main memory, in order to improve query and/or update performance.The design of a data-intensive algorithm in main memory needs to take into account the architectural characteristics of modern processors, just as a disk-based method needs to consider the physical characteristics of disk devices. An important architectural feature that influences the performance of main memory algorithms is the branch misprediction penalty. We demonstrate that branch misprediction has a substantial impact on the performance of an algorithm for applying selection conditions.We describe a space of "query plans" that are logically equivalent, but differ in terms of performance due to variations in their branch prediction behavior. We propose a cost model that takes branch prediction into account, and develop a query optimization algorithm that chooses a plan with optimal estimated cost for conjunctive conditions. We also develop an efficient heuristic optimization algorithm. We also show how records can be ordered to further reduce branch misprediction effects.We provide experimental results for a case study based on an event notification system. Our results show the effectiveness of the proposed optimization techniques. Our results also demonstrate that significant improvements in performance can be obtained by applying a methodology that takes branch misprediction latency into account.

References

Ailamaki, A., DeWitt, D., Hill, M., and Wood, D. 1999. DBMSs on a modern processor: Where does time go. In Proceedings of the VLDB Conference. 266--277. Google ScholarDigital Library
Ailamaki, A., DeWitt, D. J., Hill, M. D., and Skounakis, M. 2001. Weaving relations for cache performance. In Proceedings of the VLDB Conference. 169--180. Google ScholarDigital Library
Bhat, G. S. and Savage, C. D. 1996. Balanced Gray codes. Elect. J. Combin. 3, 1, R25.Google ScholarCross Ref
Boncz, P. A., Manegold, S., and Kersten, M. L. 1999. Database architecture optimized for the new bottleneck: Memory access. In Proceedings of the 25th VLDB Conference. 54--65. Google ScholarDigital Library
Cayley, A. 1859. On the theory of the analytical forms called trees II. Phil. Mag. 18, 374--378.Google ScholarCross Ref
Consel, C. and Noel, F. 1996. A general approach for run-time specialization and its application to C. In Proceedings of the Symposium on Principles of Programming Languages. 145--156. Google ScholarDigital Library
Fabret, F., Jacobsen, H.-A., LLirbat, F., Pereira, J., Ross, K. A., and Shasha, D. 2001. Filtering algorithms and implementation for very fast publish/subscribe. In Proceedings of the ACM SIGMOD Conference. ACM, New York. 115--126. Google ScholarDigital Library
Garcia-Molina, H. and Salem, K. 1992. Main memory database systems: An overview. IEEE Trans. Knowl. Data Eng. 4, 6, 509--516. Google ScholarDigital Library
Gray, J. and Shenoy, P. J. 2000. Rules of thumb in data engineering. In Proceedings of the International Conference on Data Engineering. 3--12. Google ScholarDigital Library
Heller, D. 2000. Rabbit: A performance counters library for intel/amd processors and linux. http://www.scl.ameslab.gov/Projects/Rabbit/.Google Scholar
Hellerstein, J. M. and Stonebraker, M. 1993. Predicate migration: Optimizing queries with expensive predicates. In Proceedings of the ACM SIGMOD Conference. ACM, New York. 267--276. Google ScholarDigital Library
Intel Corp. 1999. Intel Architecture Optimization: Reference Manual.Google Scholar
Intel Corp. 2000. Intel IA-64 Architecture Software Developer's Manual, Volume 1 Rev. 1.0. Available at http://developer.intel.com/design/ia-64/manuals/.Google Scholar
Lehman, T. J., Shekita, E. J., and Cabrera, L.-F. 1992. An evaluation of starburst's memory resident storage component. IEEE Trans. Knowl. Data Eng. 4, 6, 555--566. Google ScholarDigital Library
Manegold, S., Boncz, P. A., and Kersten, M. L. 2000. What happens during a join? Dissecting CPU and memory optimization effects. In Proceedings of the VLDB Conference. 339--350. Google ScholarDigital Library
Noel, F., Hornof, L., Consel, C., and Lawall, J. L. 1998. Automatic, template-based run-time specialization: Implementation and experimental study. In Proceedings of the International Conference on Computer Languages. 132--142. Google ScholarDigital Library
Pereira, J., Fabret, F., Llirbat, F., Preotiuc-Pietro, R., Ross, K. A., and Shasha, D. 2000. Publish/ subscribe on the web at extreme speed. In Proceedings of the VLDB Conference. 627--630. Google ScholarDigital Library
Pucheral, P., Thevenin, J.-M., and Valduriez, P. 1990. Efficient main memory data management using the DBGraph storage model. In Proceedings of the International Conference on Very Large Databases. 683--695. Google ScholarDigital Library
Rao, J. and Ross, K. A. 1999. Cache conscious indexing for decision-support in main memory. In Proceedings of the 25th VLDB Conference. 78--89. Google ScholarDigital Library
Rao, J. and Ross, K. A. 2000. Making B+-trees cache conscious in main memory. In Proceedings ACM SIGMOD Conference. ACM, New York, 475--486. Google ScholarDigital Library
Savage, C. D. 1997. A survey of combinatorial Gray codes. SIAM Review 39, 4, 605--629. Google ScholarDigital Library
Shatdal, A., Kant, C., and Naughton, J. F. 1994. Cache conscious algorithms for relational query processing. In Proceedings of the 20th VLDB Conference. 510--521. Google ScholarDigital Library
Sloane, N. J. A. 2000. The on-line encyclopedia of integer sequences. published electronically at http://www.research.att.com/&sim;njas/sequences. Google ScholarDigital Library
Vanderwiel, S. P. and Lilja, D. J. 2000. Data prefetch mechanisms. ACM Comput. Surv. 32, 2, 174--199. Google ScholarDigital Library
Whang, K.-Y. and Krishnamurthy, R. 1990. Query optimization in a memory-resident domain relational calculus database system. ACM Transactions on Database Systems 15, 1, 67--95. Google ScholarDigital Library
Wilf, H. S. 1990. Generatingfunctionology. Academic Press, New York. Google ScholarDigital Library
Yung, R. 1996. Design of the UltraSPARC instruction fetch unit. Tech. Rep. SMLI TR-96-59, Sun Microsystems Laboratories. Google ScholarDigital Library
Zhou, J. and Ross, K. A. 2002. Implementing database operations using SIMD instructions. In Proceedings of the ACM SIGMOD International Conference on Management of Data. ACM, New York. 145--156. Google ScholarDigital Library

Index Terms

Selection conditions in main memory

Recommendations

Fast branch misprediction recovery in out-of-order superscalar processors
ICS '05: Proceedings of the 19th annual international conference on Supercomputing

Current trends in modern out-of-order processors involve implementing deeper pipelines and a large instruction window to achieve high performance. However, as pipeline depth increases, the branch misprediction penalty becomes a critical factor in ...
Read More
The Misprediction Recovery Cache

In modern processors, deep pipelines couple with superscalar techniques to allow each pipe stage to process multiple instructions. When such a pipe must be flushed and refilled, as when predicted program flow beyond a branch is subsequently recognized ...
Read More
Ginger: control independence using tag rewriting
ISCA '07: Proceedings of the 34th annual international symposium on Computer architecture

The negative performance impact of branch mis-predictions can be reduced by exploiting control independence (CI). When a branch mis-predicts, the wrong-path instructions up to the point where control converges with the correct path are selectively ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Article

Published in

ACM Transactions on Database Systems Volume 29, Issue 1
March 2004
232 pages
ISSN:0362-5915
EISSN:1557-4644
DOI:10.1145/974750
Issue’s Table of Contents

Copyright © 2004 ACM
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 1 March 2004
Published in tods Volume 29, Issue 1

Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
Branch misprediction
Qualifiers
- article
Conference
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 35
  Total Citations
  View Citations
- 1,301
  Total Downloads
- Downloads (Last 12 months)37
- Downloads (Last 6 weeks)1
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Selection conditions in main memory

ACM Transactions on Database Systems

Abstract

References

Cited By

Index Terms

Recommendations

Fast branch misprediction recovery in out-of-order superscalar processors

The Misprediction Recovery Cache

Ginger: control independence using tag rewriting

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Funding Sources

Other Metrics

Article Metrics

Other Metrics

Cited By

PDF Format

eReader

Digital Edition

Caption

Selection conditions in main memory

ACM Transactions on Database Systems

Abstract

References

Cited By

Index Terms

Recommendations

Fast branch misprediction recovery in out-of-order superscalar processors

The Misprediction Recovery Cache

Ginger: control independence using tag rewriting

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Funding Sources

Article Metrics

Other Metrics

PDF Format

eReader

Digital Edition

Share this Publication link

Share on Social Media