The Algorithm Design Manual

(1)

(2)

Second Edition

(3)

Steven S. Skiena

The Algorithm Design Manual

Second Edition

123

(4)

Department of Computer Science State University of New York

at Stony Brook New York, USA [email protected]

ISBN: 978-1-84800-069-8 e-ISBN: 978-1-84800-070-4 DOI: 10.1007/978-1-84800-070-4

British Library Cataloguing in Publication Data

A catalogue record for this book is available from the British Library Library of Congress Control Number: 2008931136

c Springer-Verlag London Limited 2008, Corrected printing 2012

Apart from any fair dealing for the purposes of research or private study, or criticism or review, as permitted under the Copyright, Designs and Patents Act 1988, this publication may only be reproduced, stored or trans- mitted, in any form or by any means, with the prior permission in writing of the publishers, or in the case of reprographic reproduction in accordance with the terms of licenses issued by the Copyright Licensing Agency.

Enquiries concerning reproduction outside those terms should be sent to the publishers.

The use of registered names, trademarks, etc., in this publication does not imply, even in the absence of a speciﬁc statement, that such names are exempt from the relevant laws and regulations and therefore free for general use.

The publisher makes no representation, express or implied, with regard to the accuracy of the information contained in this book and cannot accept any legal responsibility or liability for any errors or omissions that may be made.

Printed on acid-free paper Springer Science+Business Media springer.com

(5)

Preface

Most professional programmers that I’ve encountered are not well prepared to tackle algorithm design problems. This is a pity, because the techniques of algorithm design form one of the core practical technologies of computer science. Designing correct, eﬃcient, and implementable algorithms for real-world problems requires access to two distinct bodies of knowledge:

• Techniques– Good algorithm designers understand several fundamental algorithm design techniques, including data structures, dynamic programming, depth-ﬁrst search, backtracking, and heuristics. Perhaps the single most important design technique ismodeling, the art of abstracting a messy real-world application into a clean problem suitable for algorithmic attack.

• Resources – Good algorithm designers stand on the shoulders of giants.

Rather than laboring from scratch to produce a new algorithm for every task, they can ﬁgure out what is known about a particular problem. Rather than re-implementing popular algorithms from scratch, they seek existing implementations to serve as a starting point. They are familiar with many classic algorithmic problems, which provide suﬃcient source material to model most any application.

This book is intended as a manual on algorithm design, providing access to combinatorial algorithm technology for both students and computer professionals.

It is divided into two parts: Techniques and Resources. The former is a general guide to techniques for the design and analysis of computer algorithms. The Re- sources section is intended for browsing and reference, and comprises the catalog of algorithmic resources, implementations, and an extensive bibliography.

(6)

To the Reader

I have been gratified by the warm reception the first edition ofThe Algorithm De- sign Manualhas received since its initial publication in 1997. It has been recognized as a unique guide to using algorithmic techniques to solve problems that often arise in practice. But much has changed in the world since theThe Algorithm Design Manual was first published over ten years ago. Indeed, if we date the origins of modern algorithm design and analysis to about 1970, then roughly 30% of modern algorithmic history has happened since the first coming ofThe Algorithm Design Manual.

Three aspects ofThe Algorithm Design Manualhave been particularly beloved:

(1) the catalog of algorithmic problems, (2) the war stories, and (3) the electronic component of the book. These features have been preserved and strengthened in this edition:

• The Catalog of Algorithmic Problems– Since finding out what is known about an algorithmic problem can be a difficult task, I provide a catalog of the 75 most important problems arising in practice. By browsing through this catalog, the student or practitioner can quickly identify what their problem is called, what is known about it, and how they should proceed to solve it. To aid in problem identification, we include a pair of “before” and “after” pictures for each problem, illustrating the required input and output specifications. One perceptive reviewer called my book “The Hitchhiker’s Guide to Algorithms”

on the strength of this catalog.

The catalog isthe most important part of this book. To update the catalog for this edition, I have solicited feedback from the world’s leading experts on each associated problem. Particular attention has been paid to updating the discussion of available software implementations for each problem.

• War Stories– In practice, algorithm problems do not arise at the beginning of a large project. Rather, they typically arise as subproblems when it becomes clear that the programmer does not know how to proceed or that the current solution is inadequate.

To provide a better perspective on how algorithm problems arise in the real world, we include a collection of “war stories,” or tales from our experience with real problems. The moral of these stories is that algorithm design and analysis is not just theory, but an important tool to be pulled out and used as needed.

This edition retains all the original war stories (with updates as appropriate) plus additional new war stories covering external sorting, graph algorithms, simulated annealing, and other topics.

• Electronic Component – Since the practical person is usually looking for a program more than an algorithm, we provide pointers to solid implementations whenever they are available. We have collected these implementations

(7)

P R E F A C E vii

at one central website site (http://www.cs.sunysb.edu/∼algorith) for easy re- trieval. We have been the number one “Algorithm” site on Google pretty much since the initial publication of the book.

Further, we provide recommendations to make it easier to identify the correct code for the job. With these implementations available, the critical issue in algorithm design becomes properly modeling your application, more so than becoming intimate with the details of the actual algorithm. This focus permeates the entire book.

Equally important is what we do not do in this book. We do not stress the mathematical analysis of algorithms, leaving most of the analysis as informal arguments. You will not ﬁnd a single theorem anywhere in this book. When more details are needed, the reader should study the cited programs or references. The goal of this manual is to get you going in the right direction as quickly as possible.

To the Instructor

This book covers enough material for a standardIntroduction to Algorithmscourse.

We assume the reader has completed the equivalent of a second programming course, typically titledData StructuresorComputer Science II.

A full set of lecture slides for teaching this course is available online at http://www.algorist.com. Further, I make available online audio and video lectures using these slides to teach a full-semester algorithm course. Let me help teach your course, by the magic of the Internet!

This book stresses design over analysis. It is suitable for both traditional lecture courses and the new “active learning” method, where the professor does not lecture but instead guides student groups to solve real problems. The “war stories” provide an appropriate introduction to the active learning method.

I have made several pedagogical improvements throughout the book. Textbook- oriented features include:

• More Leisurely Discussion– The tutorial material in the ﬁrst part of the book has beendoubledover the previous edition. The pages have been devoted to more thorough and careful exposition of fundamental material, instead of adding more specialized topics.

• False Starts– Algorithms textbooks generally present important algorithms as a fait accompli, obscuring the ideas involved in designing them and the subtle reasons why other approaches fail. The war stories illustrate such development on certain applied problems, but I have expanded such coverage into classical algorithm design material as well.

• Stop and Think – Here I illustrate my thought process as I solve a topic- speciﬁc homework problem—false starts and all. I have interspersed such

(8)

problem blocks throughout the text to increase the problem-solving activity of my readers. Answers appear immediately following each problem.

• More and Improved Homework Problems – This edition of The Algorithm Design Manual has twice as many homework exercises as the previous one.

Exercises that proved confusing or ambiguous have been improved or re- placed. Degree of diﬃculty ratings (from 1 to 10) have been assigned to all problems.

• Self-Motivating Exam Design– In my algorithms courses, I promise the students thatall midterm and ﬁnal exam questions will be taken directly from homework problems in this book. This provides a “student-motivated exam,”

so students know exactly how to study to do well on the exam. I have carefully picked the quantity, variety, and diﬃculty of homework exercises to make this work; ensuring there are neither too few or too many candidate problems.

• Take-Home Lessons – Highlighted “take-home” lesson boxes scattered throughout the text emphasize the big-picture concepts to be gained from the chapter.

• Links to Programming Challenge Problems – Each chapter’s exercises will contain links to 3-5 relevant “Programming Challenge” problems from http://www.programming-challenges.com. These can be used to add a pro- gramming component to paper-and-pencil algorithms courses.

• More Code, Less Pseudo-code– More algorithms in this book appear as code (written in C) instead of pseudo-code. I believe the concreteness and relia- bility of actual tested implementations provides a big win over less formal presentations for simple algorithms. Full implementations are available for study athttp://www.algorist.com .

• Chapter Notes – Each tutorial chapter concludes with a brief notes section, pointing readers to primary sources and additional references.

Acknowledgments

Updating a book dedication after ten years focuses attention on the eﬀects of time.

Since the ﬁrst edition, Renee has become my wife and then the mother of our two children, Bonnie and Abby. My father has left this world, but Mom and my brothers Len and Rob remain a vital presence in my life. I dedicate this book to my family, new and old, here and departed.

I would like to thank several people for their concrete contributions to this new edition. Andrew Gaun and Betson Thomas helped in many ways, particularly dealing with a variety of manuscript preparation issues. David Gries oﬀered valu- able feedback well beyond the call of duty. Himanshu Gupta and Bin Tang bravely in building the infrastructure for the newhttp://www.cs.sunysb.edu/∼algorith and

(9)

P R E F A C E ix

taught courses using a manuscript version of this edition. Thanks also to my Springer-Verlag editors, Wayne Wheeler and Allan Wylde.

A select group of algorithmic sages reviewed sections of the Hitchhiker’s guide, sharing their wisdom and alerting me to new developments. Thanks to:

Ami Amir, Herve Bronnimann, Bernard Chazelle, Chris Chu, Scott Cotton, Yeﬁm Dinitz, Komei Fukuda, Michael Goodrich, Lenny Heath, Cihat Imamoglu, Tao Jiang, David Karger, Giuseppe Liotta, Albert Mao, Silvano Martello, Catherine McGeoch, Kurt Mehlhorn, Scott A. Mitchell, Naceur Meskini, Gene Myers, Gonzalo Navarro, Stephen North, Joe O’Rourke, Mike Paterson, Theo Pavlidis, Seth Pettie, Michel Pocchiola, Bart Preneel, Tomasz Radzik, Edward Reingold, Frank Ruskey, Peter Sanders, Joao Setubal, Jonathan Shewchuk, Robert Skeel, Jens Stoye, Torsten Suel, Bruce Watson, and Uri Zwick.

Several exercises were originated by colleagues or inspired by other texts. Re- constructing the original sources years later can be challenging, but credits for each problem (to the best of my recollection) appear on the website.

It would be rude not to thank important contributors to the original edition.

Ricky Bradley and Dario Vlah built up the substantial infrastructure required for the WWW site in a logical and extensible manner. Zhong Li drew most of the catalog figures using xfig. Richard Crandall, Ron Danielson, Takis Metaxas, Dave Miller, Giri Narasimhan, and Joe Zachary all reviewed preliminary versions of the first edition; their thoughtful feedback helped to shape what you see here.

Much of what I know about algorithms I learned along with my graduate students. Several of them (Yaw-Ling Lin, Sundaram Gopalakrishnan, Ting Chen, Francine Evans, Harald Rau, Ricky Bradley, and Dimitris Margaritis) are the real heroes of the war stories related within. My Stony Brook friends and algorithm colleagues Estie Arkin, Michael Bender, Jie Gao, and Joe Mitchell have always been a pleasure to work and be with. Finally, thanks to Michael Brochstein and the rest of the city contingent for revealing a proper life well beyond Stony Brook.

Caveat

It is traditional for the author to magnanimously accept the blame for whatever deﬁciencies remain. I don’t. Any errors, deﬁciencies, or problems in this book are somebody else’s fault, but I would appreciate knowing about them so as to deter- mine who is to blame.

Steven S. Skiena Department of Computer Science Stony Brook University Stony Brook, NY 11794-4400 http://www.cs.sunysb.edu/∼skiena April 2008

(10)

I Practical Algorithm Design 1

1 Introduction to Algorithm Design 3

1.1 Robot Tour Optimization . . . 5

1.2 Selecting the Right Jobs . . . 9

1.3 Reasoning about Correctness . . . 11

1.4 Modeling the Problem . . . 19

1.5 About the War Stories . . . 22

1.6 War Story: Psychic Modeling . . . 23

1.7 Exercises . . . 27

2 Algorithm Analysis 31 2.1 The RAM Model of Computation . . . 31

2.2 The Big Oh Notation . . . 34

2.3 Growth Rates and Dominance Relations . . . 37

2.4 Working with the Big Oh . . . 40

2.5 Reasoning About Eﬃciency . . . 41

2.6 Logarithms and Their Applications . . . 46

2.7 Properties of Logarithms . . . 50

2.8 War Story: Mystery of the Pyramids . . . 51

2.9 Advanced Analysis (*) . . . 54

2.10 Exercises . . . 57

3 Data Structures 65 3.1 Contiguous vs. Linked Data Structures . . . 66

(11)

xii C O N T E N T S

3.2 Stacks and Queues . . . 71

3.3 Dictionaries . . . 72

3.4 Binary Search Trees . . . 77

3.5 Priority Queues . . . 83

3.6 War Story: Stripping Triangulations . . . 85

3.7 Hashing and Strings . . . 89

3.8 Specialized Data Structures . . . 93

3.9 War Story: String ’em Up . . . 94

3.10 Exercises . . . 98

4 Sorting and Searching 103 4.1 Applications of Sorting . . . 104

4.2 Pragmatics of Sorting . . . 107

4.3 Heapsort: Fast Sorting via Data Structures . . . 108

4.4 War Story: Give me a Ticket on an Airplane . . . 118

4.5 Mergesort: Sorting by Divide-and-Conquer . . . 120

4.6 Quicksort: Sorting by Randomization . . . 123

4.7 Distribution Sort: Sorting via Bucketing . . . 129

4.8 War Story: Skiena for the Defense . . . 131

4.9 Binary Search and Related Algorithms . . . 132

4.10 Divide-and-Conquer . . . 135

4.11 Exercises . . . 139

5 Graph Traversal 145 5.1 Flavors of Graphs . . . 146

5.2 Data Structures for Graphs . . . 151

5.3 War Story: I was a Victim of Moore’s Law . . . 155

5.4 War Story: Getting the Graph . . . 158

5.5 Traversing a Graph . . . 161

5.6 Breadth-First Search . . . 162

5.7 Applications of Breadth-First Search . . . 166

5.8 Depth-First Search . . . 169

5.9 Applications of Depth-First Search . . . 172

5.10 Depth-First Search on Directed Graphs . . . 178

5.11 Exercises . . . 184

6 Weighted Graph Algorithms 191 6.1 Minimum Spanning Trees . . . 192

6.2 War Story: Nothing but Nets . . . 202

6.3 Shortest Paths . . . 205

6.4 War Story: Dialing for Documents . . . 212

6.5 Network Flows and Bipartite Matching . . . 217

6.6 Design Graphs, Not Algorithms . . . 222

6.7 Exercises . . . 225

(12)

7 Combinatorial Search and Heuristic Methods 230

7.1 Backtracking . . . 231

7.2 Search Pruning . . . 238

7.3 Sudoku . . . 239

7.4 War Story: Covering Chessboards . . . 244

7.5 Heuristic Search Methods . . . 247

7.6 War Story: Only it is Not a Radio . . . 260

7.7 War Story: Annealing Arrays . . . 263

7.8 Other Heuristic Search Methods . . . 266

7.9 Parallel Algorithms . . . 267

7.10 War Story: Going Nowhere Fast . . . 268

7.11 Exercises . . . 270

8 Dynamic Programming 273 8.1 Caching vs. Computation . . . 274

8.2 Approximate String Matching . . . 280

8.3 Longest Increasing Sequence . . . 289

8.4 War Story: Evolution of the Lobster . . . 291

8.5 The Partition Problem . . . 294

8.6 Parsing Context-Free Grammars . . . 298

8.7 Limitations of Dynamic Programming: TSP . . . 301

8.8 War Story: What’s Past is Prolog . . . 304

8.9 War Story: Text Compression for Bar Codes . . . 307

8.10 Exercises . . . 310

9 Intractable Problems and Approximation Algorithms 316 9.1 Problems and Reductions . . . 317

9.2 Reductions for Algorithms . . . 319

9.3 Elementary Hardness Reductions . . . 323

9.4 Satisﬁability . . . 328

9.5 Creative Reductions . . . 330

9.6 The Art of Proving Hardness . . . 334

9.7 War Story: Hard Against the Clock . . . 337

9.8 War Story: And Then I Failed . . . 339

9.9 P vs. NP . . . 341

9.10 Dealing with NP-complete Problems . . . 344

9.11 Exercises . . . 350

10 How to Design Algorithms 356

II The Hitchhiker’s Guide to Algorithms 361

11 A Catalog of Algorithmic Problems 363

(13)

xiv C O N T E N T S

12 Data Structures 366

12.1 Dictionaries . . . 367

12.2 Priority Queues . . . 373

12.3 Suﬃx Trees and Arrays . . . 377

12.4 Graph Data Structures . . . 381

12.5 Set Data Structures . . . 385

12.6 Kd-Trees . . . 389

13 Numerical Problems 393 13.1 Solving Linear Equations . . . 395

13.2 Bandwidth Reduction . . . 398

13.3 Matrix Multiplication . . . 401

13.4 Determinants and Permanents . . . 404

13.5 Constrained and Unconstrained Optimization . . . 407

13.6 Linear Programming . . . 411

13.7 Random Number Generation . . . 415

13.8 Factoring and Primality Testing . . . 420

13.9 Arbitrary-Precision Arithmetic . . . 423

13.10 Knapsack Problem . . . 427

13.11 Discrete Fourier Transform . . . 431

14 Combinatorial Problems 434 14.1 Sorting . . . 436

14.2 Searching . . . 441

14.3 Median and Selection . . . 445

14.4 Generating Permutations . . . 448

14.5 Generating Subsets . . . 452

14.6 Generating Partitions . . . 456

14.7 Generating Graphs . . . 460

14.8 Calendrical Calculations . . . 465

14.9 Job Scheduling . . . 468

14.10 Satisﬁability . . . 472

15 Graph Problems: Polynomial-Time 475 15.1 Connected Components . . . 477

15.2 Topological Sorting . . . 481

15.3 Minimum Spanning Tree . . . 484

15.4 Shortest Path . . . 489

15.5 Transitive Closure and Reduction . . . 495

15.6 Matching . . . 498

15.7 Eulerian Cycle/Chinese Postman . . . 502

15.8 Edge and Vertex Connectivity . . . 505

15.9 Network Flow . . . 509

15.10 Drawing Graphs Nicely . . . 513

(14)

15.11 Drawing Trees . . . 517

15.12 Planarity Detection and Embedding . . . 520

16 Graph Problems: Hard Problems 523 16.1 Clique . . . 525

16.2 Independent Set . . . 528

16.3 Vertex Cover . . . 530

16.4 Traveling Salesman Problem . . . 533

16.5 Hamiltonian Cycle . . . 538

16.6 Graph Partition . . . 541

16.7 Vertex Coloring . . . 544

16.8 Edge Coloring . . . 548

16.9 Graph Isomorphism . . . 550

16.10 Steiner Tree . . . 555

16.11 Feedback Edge/Vertex Set . . . 559

17 Computational Geometry 562 17.1 Robust Geometric Primitives . . . 564

17.2 Convex Hull . . . 568

17.3 Triangulation . . . 572

17.4 Voronoi Diagrams . . . 576

17.5 Nearest Neighbor Search . . . 580

17.6 Range Search . . . 584

17.7 Point Location . . . 587

17.8 Intersection Detection . . . 591

17.9 Bin Packing . . . 595

17.10 Medial-Axis Transform . . . 598

17.11 Polygon Partitioning . . . 601

17.12 Simplifying Polygons . . . 604

17.13 Shape Similarity . . . 607

17.14 Motion Planning . . . 610

17.15 Maintaining Line Arrangements . . . 614

17.16 Minkowski Sum . . . 617

18 Set and String Problems 620 18.1 Set Cover . . . 621

18.2 Set Packing . . . 625

18.3 String Matching . . . 628

18.4 Approximate String Matching . . . 631

18.5 Text Compression . . . 637

18.6 Cryptography . . . 641

18.7 Finite State Machine Minimization . . . 646

18.8 Longest Common Substring/Subsequence . . . 650

18.9 Shortest Common Superstring . . . 654

(15)

xvi C O N T E N T S

19 Algorithmic Resources 657

19.1 Software Systems . . . 657

19.2 Data Sources . . . 663

19.3 Online Bibliographic Resources . . . 663

19.4 Professional Consulting Services . . . 664

Bibliography 665

Index 709

(16)

Part I

Practical Algorithm Design

(17)

1 Introduction to Algorithm Design

What is an algorithm? An algorithm is a procedure to accomplish a speciﬁc task.

An algorithm is the idea behind any reasonable computer program.

To be interesting, an algorithm must solve a general, well-specifiedproblem. An algorithmic problem is specified by describing the complete set ofinstancesit must work on and of its output after running on one of these instances. This distinction, between a problem and an instance of a problem, is fundamental. For example, the algorithmicproblemknown assortingis defined as follows:

Problem:Sorting

Input:A sequence ofnkeysa₁, . . . , a_n.

Output:The permutation (reordering) of the input sequence such thata₁≤a₂ ≤

· · · ≤a_n₋₁≤a_n.

Aninstanceof sorting might be an array of names, like{Mike, Bob, Sally, Jill, Jan}, or a list of numbers like{154, 245, 568, 324, 654, 324}. Determining that you are dealing with a general problem is your ﬁrst step towards solving it.

An algorithm is a procedure that takes any of the possible input instances and transforms it to the desired output. There are many diﬀerent algorithms for solving the problem of sorting. For example,insertion sortis a method for sorting that starts with a single element (thus forming a trivially sorted list) and then incrementally inserts the remaining elements so that the list stays sorted. This algorithm, implemented in C, is described below:

S.S. Skiena,The Algorithm Design Manual, 2nd ed., DOI: 10.1007/978-1-84800-070-4 1, c Springer-Verlag London Limited 2008

(18)

I N S E R T I O N S O R T I N S E R T I O N S O R T I N S E R T I O N S O R T I N S E R T I O N S O R T E I N S R T I O N S O R T E I N S R T I O N S O R T E I N R S T I O N S O R T E I N R S T I O N S O R T E I N R S T I O N S O R T E I I N R S T O N S O R T E I I N O R S T N S O R T E I I N O R S T N S O R T E I I N N O R S T S O R T E I I N N O R S T S O R T E I I N N O R S S T O R T E I I N N O R S S T O R T

E I I N N O O R R S S T T E I I N N O O R S S T R T E I I N N O O R R S S T T

Figure 1.1: Animation of insertion sort in action (time ﬂows down)

insertion_sort(item s[], int n) {

int i,j; /* counters */

for (i=1; i<n; i++) { j=i;

while ((j>0) && (s[j] < s[j-1])) { swap(&s[j],&s[j-1]);

j = j-1;

} }

}

An animation of the logical ﬂow of this algorithm on a particular instance (the letters in the word “INSERTIONSORT”) is given in Figure1.1

Note the generality of this algorithm. It works just as well on names as it does on numbers, given the appropriate comparison operation (<) to test which of the two keys should appear first in sorted order. It can be readily verified that this algorithm correctly orders every possible input instance according to our definition of the sorting problem.

There are three desirable properties for a good algorithm. We seek algorithms that arecorrectandefficient, while beingeasy to implement. These goals may not be simultaneously achievable. In industrial settings, any program that seems to give good enough answers without slowing the application down is often acceptable, regardless of whether a better algorithm exists. The issue of finding the best possible answer or achieving maximum efficiency usually arises in industry only after serious performance or legal troubles.

In this chapter, we will focus on the issues of algorithm correctness, and defer a discussion of eﬃciency concerns to Chapter2. It is seldom obvious whether a given

(19)

1 . 1 R O B O T T O U R O P T I M I Z A T I O N 5

0 0

1

2

3 5 4

6 7

8

Figure 1.2: A good instance for the nearest-neighbor heuristic

algorithm correctly solves a given problem. Correct algorithms usually come with a proof of correctness, which is an explanation ofwhywe know that the algorithm must take every instance of the problem to the desired result. However, before we go further we demonstrate why“it’s obvious”never suﬃces as a proof of correctness, and is usually ﬂat-out wrong.

1.1 Robot Tour Optimization

Let’s consider a problem that arises often in manufacturing, transportation, and testing applications. Suppose we are given a robot arm equipped with a tool, say a soldering iron. In manufacturing circuit boards, all the chips and other components must be fastened onto the substrate. More specifically, each chip has a set of contact points (or wires) that must be soldered to the board. To program the robot arm for this job, we must first construct an ordering of the contact points so the robot visits (and solders) the first contact point, then the second point, third, and so forth until the job is done. The robot arm then proceeds back to the first contact point to prepare for the next board, thus turning the tool-path into a closed tour, or cycle.

Robots are expensive devices, so we want the tour that minimizes the time it takes to assemble the circuit board. A reasonable assumption is that the robot arm moves with ﬁxed speed, so the time to travel between two points is proportional to their distance. In short, we must solve the following algorithm problem:

Problem:Robot Tour Optimization Input: A setS ofnpoints in the plane.

Output:What is the shortest cycle tour that visits each point in the setS?

You are given the job of programming the robot arm. Stop right now and think up an algorithm to solve this problem. I’ll be happy to wait until you ﬁnd one. . .

(20)

Several algorithms might come to mind to solve this problem. Perhaps the most popular idea is thenearest-neighborheuristic. Starting from some pointp₀, we walk ﬁrst to its nearest neighborp1. Fromp1, we walk to its nearest unvisited neighbor, thus excluding only p0 as a candidate. We now repeat this process until we run out of unvisited points, after which we return top0 to close oﬀ the tour. Written in pseudo-code, the nearest-neighbor heuristic looks like this:

NearestNeighbor(P)

Pick and visit an initial pointp₀from P p=p₀

i= 0

While there are still unvisited points i=i+ 1

Selectp_i to be the closest unvisited point top_i₋₁ Visitp_i

Return top₀from p_n₋₁

This algorithm has a lot to recommend it. It is simple to understand and implement. It makes sense to visit nearby points before we visit faraway points to reduce the total travel time. The algorithm works perfectly on the example in Figure1.2.

The nearest-neighbor rule is reasonably eﬃcient, for it looks at each pair of points (p_i, p_j) at most twice: once when addingp_i to the tour, the other when addingp_j. Against all these positives there is only one problem. This algorithm is completely wrong.

Wrong?How can it be wrong? The algorithm always ﬁnds a tour, but it doesn’t necessarily ﬁnd the shortest possible tour. It doesn’t necessarily even come close.

Consider the set of points in Figure1.3, all of which lie spaced along a line. The numbers describe the distance that each point lies to the left or right of the point labeled ‘0’. When we start from the point ‘0’ and repeatedly walk to the nearest unvisited neighbor, we might keep jumping left-right-left-right over ‘0’ as the algorithm oﬀers no advice on how to break ties. A much better (indeed optimal) tour for these points starts from the leftmost point and visits each point as we walk right before returning at the rightmost point.

Try now to imagine your boss’s delight as she watches a demo of your robot arm hopscotching left-right-left-right during the assembly of such a simple board.

“But wait,” you might be saying. “The problem was in starting at point ‘0’.

Instead, why don’t we start the nearest-neighbor rule using the leftmost point as the initial point p₀? By doing this, we will ﬁnd the optimal solution on this instance.”

That is 100% true, at least until we rotate our example 90 degrees. Now all points are equally leftmost. If the point ‘0’ were moved just slightly to the left, it would be picked as the starting point. Now the robot arm will hopscotch up-down- up-down instead of left-right-left-right, but the travel time will be just as bad as before. No matter what you do to pick the ﬁrst point, the nearest-neighbor rule is doomed to work incorrectly on certain point sets.

(21)

1 . 1 R O B O T T O U R O P T I M I Z A T I O N 7

-1 0 1 3 11

-21 -5

-1 0 1 3 11

-21 -5

Figure 1.3: A bad instance for the nearest-neighbor heuristic, with the optimal solution

Maybe what we need is a different approach. Always walking to the closest point is too restrictive, since it seems to trap us into making moves we didn’t want. A different idea might be to repeatedly connect the closest pair of endpoints whose connection will not create a problem, such as premature termination of the cycle. Each vertex begins as its own single vertex chain. After merging everything together, we will end up with a single chain containing all the points in it. Con- necting the final two endpoints gives us a cycle. At any step during the execution of thisclosest-pair heuristic, we will have a set of single vertices and vertex-disjoint chains available to merge. In pseudocode:

ClosestPair(P)

Letnbe the number of points in setP. Fori= 1 ton−1 do

d=∞

For each pair of endpoints (s, t) from distinct vertex chains ifdist(s, t)≤dthens_m=s,t_m=t, andd=dist(s, t) Connect (s_m, t_m) by an edge

Connect the two endpoints by an edge

This closest-pair rule does the right thing in the example in Figure1.3.It starts by connecting ‘0’ to its immediate neighbors, the points 1 and −1. Subsequently, the next closest pair will alternate left-right, growing the central path by one link at a time. The closest-pair heuristic is somewhat more complicated and less eﬃcient than the previous one, but at least it gives the right answer in this example.

But this is not true in all examples. Consider what this algorithm does on the point set in Figure 1.4(l). It consists of two rows of equally spaced points, with the rows slightly closer together (distance 1−e) than the neighboring points are spaced within each row (distance 1 +e). Thus the closest pairs of points stretch across the gap, not around the boundary. After we pair oﬀ these points, the closest

(22)

1−e

1+e

1−e

(l) 1+e

1−e

1+e

1−e

(r)

Figure 1.4: A bad instance for the closest-pair heuristic, with the optimal solution

remaining pairs will connect these pairs alternately around the boundary. The total path length of the closest-pair tour is 3(1−e) + 2(1 +e) +

(1−e)²+ (2 + 2e)². Compared to the tour shown in Figure 1.4(r), we travel over 20% farther than necessary when e ≈ 0. Examples exist where the penalty is considerably worse than this.

Thus this second algorithm is also wrong. Which one of these algorithms per- forms better? You can’t tell just by looking at them. Clearly, both heuristics can end up with very bad tours on very innocent-looking input.

At this point, you might wonder what a correct algorithm for our problem looks like. Well, we could try enumeratingallpossible orderings of the set of points, and then select the ordering that minimizes the total length:

OptimalTSP(P) d=∞

For each of then! permutationsP_i of point set P If (cost(P_i)≤d) thend=cost(P_i) andP_min=P_i ReturnP_min

Since all possible orderings are considered, we are guaranteed to end up with the shortest possible tour. This algorithm is correct, since we pick the best of all the possibilities. But it is also extremely slow. The fastest computer in the world couldn’t hope to enumerate all the 20! =2,432,902,008,176,640,000 orderings of 20 points within a day. For real circuit boards, where n ≈ 1,000, forget about it.

All of the world’s computers working full time wouldn’t come close to ﬁnishing the problem before the end of the universe, at which point it presumably becomes moot.

The quest for an eﬃcient algorithm to solve this problem, called thetraveling salesman problem (TSP), will take us through much of this book. If you need to know how the story ends, check out the catalog entry for the traveling salesman problem in Section16.4(page533).

(23)

1 . 2 S E L E C T I N G T H E R I G H T J O B S 9

The President’s Algorist

Halting State

"Discrete" Mathematics Calculated Bets

Programming Challenges

Steiner’s Tree Process Terminated

Tarjan of the Jungle The Four Volume Problem

Figure 1.5: An instance of the non-overlapping movie scheduling problem

Take-Home Lesson: There is a fundamental diﬀerence between algorithms, which always produce a correct result, andheuristics, which may usually do a good job but without providing any guarantee.

1.2 Selecting the Right Jobs

Now consider the following scheduling problem. Imagine you are a highly-in- demand actor, who has been presented with offers to star in n different movie projects under development. Each offer comes specified with the first and last day of filming. To take the job, you must commit to being available throughout this entire period. Thus you cannot simultaneously accept two jobs whose intervals overlap.

For an artist such as yourself, the criteria for job acceptance is clear: you want to make as much money as possible. Because each of these films pays the same fee per film, this implies you seek the largest possible set of jobs (intervals) such that no two of them conflict with each other.

For example, consider the available projects in Figure1.5.We can star in at most four ﬁlms, namely “Discrete” Mathematics, Programming Challenges, Calculated Bets, and one of eitherHalting StateorSteiner’s Tree.

You (or your agent) must solve the following algorithmic scheduling problem:

Problem:Movie Scheduling Problem Input:A setIofnintervals on the line.

Output:What is the largest subset of mutually non-overlapping intervals which can be selected fromI?

You are given the job of developing a scheduling algorithm for this task. Stop right now and try to ﬁnd one. Again, I’ll be happy to wait.

There are several ideas that may come to mind. One is based on the notion that it is best to work whenever work is available. This implies that you should start with the job with the earliest start date – after all, there is no other job you can work on, then at least during the begining of this period.

(24)

(l) (r)

War and Peace

Figure 1.6: Bad instances for the (l) earliest job ﬁrst and (r) shortest job ﬁrst heuristics.

EarliestJobFirst(I)

Accept the earlest starting jobj from I which does not overlap any previously accepted job, and repeat until no more such jobs remain.

This idea makes sense, at least until we realize that accepting the earliest job might block us from taking many other jobs if that first job is long. Check out Figure1.6(l), where the epic “War and Peace” is both the first job available and long enough to kill off all other prospects.

This bad example naturally suggests another idea. The problem with “War and Peace” is that it is too long. Perhaps we should start by taking the shortest job, and keep seeking the shortest available job at every turn. Maximizing the number of jobs we do in a given period is clearly connected to banging them out as quickly as possible. This yields the heuristic:

ShortestJobFirst(I) While (I=∅) do

Accept the shortest possible jobj fromI.

Deletej, and any interval which intersects j fromI.

Again this idea makes sense, at least until we realize that accepting the shortest job might block us from taking two other jobs, as shown in Figure1.6(r). While the potential loss here seems smaller than with the previous heuristic, it can readily limit us to half the optimal payoﬀ.

At this point, an algorithm where we try all possibilities may start to look good, because we can be certain it is correct. If we ignore the details of testing whether a set of intervals are in fact disjoint, it looks something like this:

ExhaustiveScheduling(I) j= 0

S_max=∅

For each of the 2ⁿ subsetsS_i of intervalsI thenj=size(Si) andSmax=Si. ReturnSmax

But how slow is it? The key limitation is enumerating the 2ⁿ subsets of n things. The good news is that this ismuchbetter than enumerating alln! orders

If (Si is mutually non-overlapping) and (size(Si)> j)

(25)

1 . 3 R E A S O N I N G A B O U T C O R R E C T N E S S 11

ofn things, as proposed for the robot tour optimization problem. There are only about one million subsets when n = 20, which could be exhaustively counted within seconds on a decent computer. However, when fedn= 100 movies, 2¹⁰⁰ is much much greater than the 20! which made our robot cry “uncle” in the previous problem.

The difference between our scheduling and robotics problems are that thereisan algorithm which solves movie scheduling both correctly and efficiently. Think about the first job to terminate—i.e. the interval xwhich contains the rightmost point which is leftmost among all intervals. This role is played by“Discrete” Mathematics in Figure1.5. Other jobs may well have started beforex, but all of these must at least partially overlap each other, so we can select at most one from the group. The first of these jobs to terminate isx, so any of the overlapping jobs potentially block out other opportunities to the right of it. Clearly we can never lose by pickingx.

This suggests the following correct, eﬃcient algorithm:

OptimalScheduling(I) While (I=∅) do

Accept the jobj from I with the earliest completion date.

Deletej, and any interval which intersectsj fromI.

Ensuring the optimal answer over all possible inputs is a difficult but often achievable goal. Seeking counterexamples that break pretender algorithms is an important part of the algorithm design process. Efficient algorithms are often lurk- ing out there; this book seeks to develop your skills to help you find them.

Take-Home Lesson:Reasonable-looking algorithms can easily be incorrect. Al- gorithm correctness is a property that must be carefully demonstrated.

1.3 Reasoning about Correctness

Hopefully, the previous examples have opened your eyes to the subtleties of algorithm correctness. We need tools to distinguish correct algorithms from incorrect ones, the primary one of which is called aproof.

A proper mathematical proof consists of several parts. First, there is a clear, precise statement of what you are trying to prove. Second, there is a set of assumptions of things which are taken to be true and hence used as part of the proof.

Third, there is a chain of reasoning which takes you from these assumptions to the statement you are trying to prove. Finally, there is a little square ( ) orQEDat the bottom to denote that you have ﬁnished, representing the Latin phrase for “thus it is demonstrated.”

This book is not going to emphasize formal proofs of correctness, because they are very diﬃcult to do right and quite misleading when you do them wrong. A proof is indeed ademonstration. Proofs are useful only when they are honest; crisp arguments explaining why an algorithm satisﬁes a nontrivial correctness property.

(26)

Correct algorithms require careful exposition, and eﬀorts to show both correctness and not incorrectness. We develop tools for doing so in the subsections below.

1.3.1 Expressing Algorithms

Reasoning about an algorithm is impossible without a careful description of the sequence of steps to be performed. The three most common forms of algorithmic notation are (1) English, (2) pseudocode, or (3) a real programming language.

We will use all three in this book. Pseudocode is perhaps the most mysterious of the bunch, but it is best defined as a programming language that never complains about syntax errors. All three methods are useful because there is a natural tradeoff between greater ease of expression and precision. English is the most natural but least precise programming language, while Java and C/C++ are precise but difficult to write and understand. Pseudocode is generally useful because it represents a happy medium.

The choice of which notation is best depends upon which method you are most comfortable with. I usually prefer to describe theideasof an algorithm in English, moving to a more formal, programming-language-like pseudocode or even real code to clarify suﬃciently tricky details.

A common mistake my students make is to use pseudocode to dress up an ill- deﬁned idea so that it looks more formal. Clarity should be the goal. For example, theExhaustiveSchedulingalgorithm on page10could have been better written in English as:

ExhaustiveScheduling(I)

Test all 2ⁿ subsets of intervals fromI, and return the largest subset consisting of mutually non-overlapping intervals.

Take-Home Lesson: The heart of any algorithm is an idea. If your idea is not clearly revealed when you express an algorithm, then you are using too low-level a notation to describe it.

1.3.2 Problems and Properties

We need more than just an algorithm description in order to demonstrate correctness. We also need a careful description of the problem that it is intended to solve.

Problem speciﬁcations have two parts: (1) the set of allowed input instances, and (2) the required properties of the algorithm’s output. It is impossible to prove the correctness of an algorithm for a fuzzily-stated problem. Put another way, ask the wrong problem and you will get the wrong answer.

Some problem speciﬁcations allow too broad a class of input instances. Suppose we had allowed ﬁlm projects in our movie scheduling problem to have gaps in

(27)

Then the schedule associated with any particular film would consist of a givenset of intervals. Our star would be free to take on two interleaving but not overlapping projects (such as the film above nested with one filming in August and October).

The earliest completion algorithm would not work for such a generalized scheduling problem. Indeed,noeﬃcient algorithm exists for this generalized problem.

Take-Home Lesson: An important and honorable technique in algorithm design is to narrow the set of allowable instances until there is a correct and eﬃcient algorithm. For example, we can restrict a graph problem from general graphs down to trees, or a geometric problem from two dimensions down to one.

There are two common traps in specifying the output requirements of a problem.

One is asking an ill-deﬁned question. Asking for thebestroute between two places on a map is a silly question unless you deﬁne whatbest means. Do you mean the shortest route in total distance, or the fastest route, or the one minimizing the number of turns?

The second trap is creating compound goals. The three path-planning criteria mentioned above are all well-defined goals that lead to correct, efficient optimization algorithms. However, you must pick a single criteria. A goal like Find the shortest path fromatob that doesn’t use more than twice as many turns as neces- saryis perfectly well defined, but complicated to reason and solve.

I encourage you to check out the problem statements for each of the 75 catalog problems in the second part of this book. Finding the right formulation for your problem is an important part of solving it. And studying the deﬁnition of all these classic algorithm problems will help you recognize when someone else has thought about similar problems before you.

1.3.3 Demonstrating Incorrectness

The best way to prove that an algorithm isincorrectis to produce an instance in which it yields an incorrect answer. Such instances are called counter-examples.

No rational person will ever leap to the defense of an algorithm after a counter- example has been identiﬁed. Very simple instances can instantly kill reasonable- looking heuristics with a quicktouch´e. Good counter-examples have two important properties:

• Veriﬁability– To demonstrate that a particular instance is a counter-example to a particular algorithm, you must be able to (1) calculate what answer your algorithm will give in this instance, and (2) display a better answer so as to prove the algorithm didn’t ﬁnd it.

Since you must hold the given instance in your head to reason about it, an important part of veriﬁability is. . .

production (i.e., ﬁlming in September and November but a hiatus in October).

(28)

• Simplicity– Good counter-examples have all unnecessary details boiled away.

They make clear exactly whythe proposed algorithm fails. Once a counter- example has been found, it is worth simplifying it down to its essence. For example, the counter-example of Figure 1.6(l) could be made simpler and

Hunting for counter-examples is a skill worth developing. It bears some similarity to the task of developing test sets for computer programs, but relies more on inspiration than exhaustion. Here are some techniques to aid your quest:

• Think small– Note that the robot tour counter-examples I presented boiled down to six points or less, and the scheduling counter-examples to only three intervals. This is indicative of the fact that when algorithms fail, there is usually a very simple example on which they fail. Amateur algorists tend to draw a big messy instance and then stare at it helplessly. The pros look carefully at several small examples, because they are easier to verify and reason about.

• Think exhaustively – There are only a small number of possibilities for the smallest nontrivial value ofn. For example, there are only three interesting ways two intervals on the line can occur: (1) as disjoint intervals, (2) as overlapping intervals, and (3) as properly nesting intervals, one within the other. All cases of three intervals (including counter-examples to both movie heuristics) can be systematically constructed by adding a third segment in each possible way to these three instances.

• Hunt for the weakness– If a proposed algorithm is of the form “always take the biggest” (better known as the greedy algorithm), think about why that might prove to be the wrong thing to do. In particular, . . .

• Go for a tie– A devious way to break a greedy heuristic is to provide instances where everything is the same size. Suddenly the heuristic has nothing to base its decision on, and perhaps has the freedom to return something suboptimal as the answer.

• Seek extremes– Many counter-examples are mixtures of huge and tiny, left and right, few and many, near and far. It is usually easier to verify or reason about extreme examples than more muddled ones. Consider two tightly bunched clouds of points separated by a much larger distanced. The optimal TSP tour will be essentially 2dregardless of the number of points, because what happens within each cloud doesn’t really matter.

Take-Home Lesson:Searching for counterexamples is the best way to disprove the correctness of a heuristic.

better by reducing the number of overlapped segments from ﬁve to two.

(29)

1.3.4 Induction and Recursion

Failure to ﬁnd a counterexample to a given algorithm does not mean “it is obvious”

that the algorithm is correct. A proof or demonstration of correctness is needed.

Often mathematical induction is the method of choice.

When I ﬁrst learned about mathematical induction it seemed like complete magic. You proved a formula like n

i=1i =n(n+ 1)/2 for some basis case like 1 or 2, thenassumedit was true all the way ton−1 before proving it was true for generalnusing the assumption. That was a proof? Ridiculous!

When I ﬁrst learned the programming technique of recursion it also seemed like complete magic. The program tested whether the input argument was some basis case like 1 or 2. If not, you solved the bigger case by breaking it into pieces and calling the subprogram itselfto solve these pieces. That was a program? Ridiculous!

The reason both seemed like magic is because recursionismathematical induction. In both, we have general and boundary conditions, with the general condition breaking the problem into smaller and smaller pieces. Theinitialor boundary condition terminates the recursion. Once you understand either recursion or induction, you should be able to see why the other one also works.

I’ve heard it said that a computer scientist is a mathematician who only knows how to prove things by induction. This is partially true because computer scientists are lousy at proving things, but primarily because so many of the algorithms we study are either recursive or incremental.

Consider the correctness ofinsertion sort, which we introduced at the beginning of this chapter. Thereasonit is correct can be shown inductively:

• The basis case consists of a single element, and by deﬁnition a one-element array is completely sorted.

• In general, we can assume that the ﬁrstn−1 elements of array A are completely sorted aftern−1 iterations of insertion sort.

• To insert one last elementxtoA, we ﬁnd where it goes, namely the unique spot between the biggest element less than or equal to x and the smallest element greater thanx. This is done by moving all the greater elements back by one position, creating room forxin the desired location.

One must be suspicious of inductive proofs, however, because very subtle reasoning errors can creep in. The ﬁrst areboundary errors. For example, our insertion sort correctness proof above boldly stated that there was a unique place to insert xbetween two elements, when our basis case was a single-element array. Greater care is needed to properly deal with the special cases of inserting the minimum or maximum elements.

The second and more common class of inductive proof errors concerns cavallier extension claims. Adding one extra item to a given problem instance might cause the entire optimal solution to change. This was the case in our scheduling problem (see Figure1.7). The optimal schedule after inserting a new segment may contain

(30)

Figure 1.7: Large-scale changes in the optimal solution (boxes) after inserting a single interval (dashed) into the instance

none of the segments of any particular optimal solution prior to insertion. Boldly ignoring such diﬃculties can lead to very convincing inductive proofs of incorrect algorithms.

Take-Home Lesson: Mathematical induction is usually the right way to verify the correctness of a recursive or incremental insertion algorithm.

Stop and Think: Incremental Correctness

Problem: Prove the correctness of the following recursive algorithm for increment- ing natural numbers, i.e.y→y+ 1:

Increment(y)

ify= 0thenreturn(1)else if(y mod 2) = 1then

return(2·Increment( y/2)) elsereturn(y+ 1)

Solution: The correctness of this algorithm is certainlynotobvious to me. But as it is recursive and I am a computer scientist, my natural instinct is to try to prove it by induction.

The basis case of y = 0 is obviously correctly handled. Clearly the value 1 is returned, and 0 + 1 = 1.

Now assume the function works correctly for the general case ofy=n−1. Given this, we must demonstrate the truth for the case of y =n. Half of the cases are easy, namely the even numbers (For which (y mod 2) = 0), sincey+ 1 is explicitly returned.

For the odd numbers, the answer depends upon what is returned by Increment( y/2). Here we want to use our inductive assumption, but it isn’t quite right. We have assumed thatincrementworked correctly fory=n−1, but not for a value which is about half of it. We can ﬁx this problem by strengthening our assumption to declare that the general case holds for ally≤n−1. This costs us nothing in principle, but is necessary to establish the correctness of the algorithm.

(31)

Now, the case of oddy (i.e.y= 2m+ 1 for some integerm) can be dealt with as:

2·Increment( (2m+ 1)/2) = 2·Increment( m+ 1/2)

= 2·Increment(m)

= 2(m+ 1)

= 2m+ 2 =y+ 1 and the general case is resolved.

1.3.5 Summations

Mathematical summation formulae arise often in algorithm analysis, which we will study in Chapter 2. Further, proving the correctness of summation formulae is a classic application of induction. Several exercises on inductive proofs of summations

n

i=1

f(i) =f(1) +f(2) +. . .+f(n)

There are simple closed forms for summations of many algebraic functions. For example, since nones isn,

n

i=1

1 =n

n

i=1

i= n/2

i=1

(i+ (n−i+ 1)) =n(n+ 1)/2

Recognizing two basic classes of summation formulae will get you a long way in algorithm analysis:

S(n, p) =

n

i

i^p = Θ(n^p+1)

appear as exercises at the end of this chapter. To make these more accessible, I review the basics of summations here.

Summation formulae are concise expressions describing the addition of an ar- bitrarily large set of numbers, in particular the formula

The sum of the ﬁrstneven integers can be seen by pairing up theith and (n−i+1)th integers:

• Arithmetic progressions – We will encounter the arithmetic progression S(n) = n

i i = n(n+ 1)/2 in the analysis of selection sort. From the big picture perspective, the important thing is that the sum is quadratic, not that the constant is 1/2. In general,

(32)

forp≥1. Thus the sum of squares is cubic, and the sum of cubes is quartic (if you use such a word). The “big Theta” notation (Θ(x)) will be properly explained in Section2.2.

Forp < −1, this sum always converges to a constant, even asn→ ∞. The interesting case is between results in. . .

• Geometric series – In geometric progressions, the index of the loop eﬀects the exponent, i.e.

G(n, a) =

n

i=0

aⁱ

How we interpret this sum depends upon thebaseof the progression, i.e. a.

Whena <1, this converges to a constant even asn→ ∞.

This series convergence proves to be the great “free lunch” of algorithm analysis. It means that the sum of a linear number of things can be constant, not linear. For example, 1 + 1/2 + 1/4 + 1/8 +. . .≤2 no matter how many terms we add up.

Whena >1, the sum grows rapidly with each new term, as in 1 + 2 + 4 + 8 + 16 + 32 = 63. Indeed,G(n, a) = Θ(aⁿ⁺¹) fora >1.

Stop and Think: Factorial Formulae Problem: Prove thatn

i=1i×i! = (n+ 1)!−1 by induction.

Solution: The inductive paradigm is straightforward. First verify the basis case (here we don= 1, althoughn= 0 would be even more general):

1

i=1

i×i! = 1 = (1 + 1)!−1 = 2−1 = 1

Now assume the statement is true up ton. To prove the general case of n+ 1, observe that rolling out the largest term

n+1

i=1

i×i! = (n+ 1)×(n+ 1)! +

n

i=1

i×i!

reveals the left side of our inductive assumption. Substituting the right side gives us

n+1

i=1

i×i! = (n+ 1)×(n+ 1)! + (n+ 1)!−1

= (aⁿ⁺¹−1)/(a−1)

(33)

1 . 4 M O D E L I N G T H E P R O B L E M 19

= (n+ 1)!×((n+ 1) + 1)−1

= (n+ 2)!−1

This general trick of separating out the largest term from the summation to reveal an instance of the inductive assumption lies at the heart of all such proofs.

1.4 Modeling the Problem

Modeling is the art of formulating your application in terms of precisely described, well-understood problems. Proper modeling is the key to applying algorithmic design techniques to real-world problems. Indeed, proper modeling can eliminate the need to design or even implement algorithms, by relating your application to what has been done before. Proper modeling is the key to eﬀectively using the “Hitch- hiker’s Guide” in Part II of this book.

Real-world applications involve real-world objects. You might be working on a system to route traffic in a network, to find the best way to schedule classrooms in a university, or to search for patterns in a corporate database. Most algorithms, however, are designed to work on rigorously defined abstract structures such as permutations, graphs, and sets. To exploit the algorithms literature, you must learn to describe your problem abstractly, in terms of procedures on fundamental structures.

1.4.1 Combinatorial Objects

Odds are very good that others have stumbled upon your algorithmic problem before you, perhaps in substantially diﬀerent contexts. But to ﬁnd out what is known about your particular “widget optimization problem,” you can’t hope to look in a book underwidget. You must formulate widget optimization in terms of computing properties of common structures such as:

• Permutations– which are arrangements, or orderings, of items. For example, {1,4,3,2}and{4,3,2,1}are two distinct permutations of the same set of four integers. We have already seen permutations in the robot optimization problem, and in sorting. Permutations are likely the object in question whenever your problem seeks an “arrangement,” “tour,” “ordering,” or “sequence.”

• Subsets– which represent selections from a set of items. For example,{1,3,4} and {2} are two distinct subsets of the ﬁrst four integers. Order does not matter in subsets the way it does with permutations, so the subsets{1,3,4} and{4,3,1}would be considered identical. We saw subsets arise in the movie scheduling problem. Subsets are likely the object in question whenever your problem seeks a “cluster,” “collection,” “committee,” “group,” “packaging,”

or “selection.”

(34)

Sol

Steve Len Rob Richard Laurie Jim Lisa Jeff

Morris Eve Sid

Stony Brook

Orient Point

Montauk Shelter Island

Sag Harbor Riverhead

Islip

Greenport

Figure 1.8: Modeling real-world structures with trees and graphs

• Trees – which represent hierarchical relationships between items. Figure 1.8(a) shows part of the family tree of the Skiena clan. Trees are likely the object in question whenever your problem seeks a “hierarchy,” “dominance relationship,” “ancestor/descendant relationship,” or “taxonomy.”

• Graphs – which represent relationships between arbitrary pairs of objects.

Figure 1.8(b) models a network of roads as a graph, where the vertices are cities and the edges are roads connecting pairs of cities. Graphs are likely the object in question whenever you seek a “network,” “circuit,” “web,” or

“relationship.”

• Points – which represent locations in some geometric space. For example, the locations of McDonald’s restaurants can be described by points on a map/plane. Points are likely the object in question whenever your problems work on “sites,” “positions,” “data records,” or “locations.”

• Polygons – which represent regions in some geometric spaces. For example, the borders of a country can be described by a polygon on a map/plane.

Polygons and polyhedra are likely the object in question whenever you are working on “shapes,” “regions,” “conﬁgurations,” or “boundaries.”

• Strings– which represent sequences of characters or patterns. For example, the names of students in a class can be represented by strings. Strings are likely the object in question whenever you are dealing with “text,” “characters,” “patterns,” or “labels.”

These fundamental structures all have associated algorithm problems, which are presented in the catalog of Part II. Familiarity with these problems is important, because they provide the language we use to model applications. To become ﬂuent in this vocabulary, browse through the catalog and study theinputandoutputpic- tures for each problem. Understanding these problems, even at a cartoon/deﬁnition level, will enable you to know where to look later when the problem arises in your application.

(35)

1 . 4 M O D E L I N G T H E P R O B L E M 21

A L G O R I T H M 4+{1,4,2,3}

9+{1,2,7}

{1,2,7,9}

{4,1,5,2,3}

L G O R I T H M A

Figure 1.9: Recursive decompositions of combinatorial objects. (left column) Permutations, subsets, trees, and graphs. (right column) Point sets, polygons, and strings

Examples of successful application modeling will be presented in the war stories spaced throughout this book. However, some words of caution are in order. The act of modeling reduces your application to one of a small number of existing problems and structures. Such a process is inherently constraining, and certain details might not ﬁt easily into the given target problem. Also, certain problems can be modeled in several diﬀerent ways, some much better than others.

Modeling is only the first step in designing an algorithm for a problem. Be alert for how the details of your applications differ from a candidate model, but don’t be too quick to say that your problem is unique and special. Temporarily ignoring details that don’t fit can free the mind to ask whether they really were fundamental in the first place.

Take-Home Lesson:Modeling your application in terms of well-deﬁned structures and algorithms is the most important single step towards a solution.

1.4.2 Recursive Objects

Learning to think recursively is learning to look for big things that are made from smaller things ofexactly the same type as the big thing. If you think of houses as sets of rooms, then adding or deleting a room still leaves a house behind.

Recursive structures occur everywhere in the algorithmic world. Indeed, each of the abstract structures described above can be thought about recursively. You just have to see how you can break them down, as shown in Figure 1.9:

• Permutations– Delete the ﬁrst element of a permutation of{1, . . . , n}things and you get a permutation of the remaining n−1 things. Permutations are recursive objects.

The Algorithm Design Manual

Second Edition

Steven S. Skiena

The Algorithm Design Manual

Second Edition

123

Preface

To the Reader

To the Instructor

Acknowledgments

Caveat

Contents

I Practical Algorithm Design 1

II The Hitchhiker’s Guide to Algorithms 361

Part I

Practical Algorithm Design

1

Introduction to Algorithm Design

1.1 Robot Tour Optimization

1.2 Selecting the Right Jobs

1.3 Reasoning about Correctness

1.3.1 Expressing Algorithms

1.3.2 Problems and Properties

1.3.3 Demonstrating Incorrectness

1.3.4 Induction and Recursion

1.3.5 Summations

1.4 Modeling the Problem

1.4.1 Combinatorial Objects

1.4.2 Recursive Objects