You are currently browsing the archives for the Business Intelligence category.
| M | T | W | T | F | S | S |
|---|---|---|---|---|---|---|
| « Mar | ||||||
| 1 | 2 | 3 | 4 | 5 | 6 | 7 |
| 8 | 9 | 10 | 11 | 12 | 13 | 14 |
| 15 | 16 | 17 | 18 | 19 | 20 | 21 |
| 22 | 23 | 24 | 25 | 26 | 27 | 28 |
| 29 | 30 | 31 | ||||
- Business Intelligence (5)
- Education (1)
- INFORMS (1)
- Intechné (4)
- Methodology (4)
- Operations Research (13)
- optimization (1)
- Probability Management (1)
- Risk (5)
- software (1)
- Teaching (1)
- Vision (4)
- 8 March 2009: Searching for Answers to Life’s Persistent Questions
- 6 March 2009: INFORMS 1.5
- 27 February 2009: Orthogonal Skills
- 18 February 2009: The Science of Better Search
- 13 February 2009: Living in Interesting Times
- 4 February 2009: Remembering the Master of All Trades
- 30 January 2009: Less is More
- 16 January 2009: Certifiably Analytic
- 9 January 2009: Whom the Gods Wish to Destroy, they First Call Risk-Protected
- 2 January 2009: Those Who Can, Teach
Archive for the Business Intelligence Category
Searching for Answers to Life’s Persistent Questions
8 March 2009 by Sanjay Saigal.
I recently mentioned LaserSearch, an engine for user-guided web search. Now comes the news that Steve Wolfram, among other things the inventor of Mathematica and author of A New Kind of Science (NKS), has also devised something new: a computation-driven semantically-aware knowledge search engine called Wolfram Alpha. (Was Alpha Wolfram taken?)
Here is Wolfram’s description of his approach:
… what about all the actual knowledge that we as humans have accumulated?
A lot of it is now on the web—in billions of pages of text. And with search engines, we can very efficiently search for specific terms and phrases in that text.
But we can’t compute from that. And in effect, we can only answer questions that have been literally asked before. We can look things up, but we can’t figure anything new out.
So how can we deal with that? Well, some people have thought the way forward must be to somehow automatically understand the natural language that exists on the web. Perhaps getting the web semantically tagged to make that easier.
But armed with Mathematica and NKS I realized there’s another way: explicitly implement methods and models, as algorithms, and explicitly curate all data so that it is immediately computable.
It’s not easy to do this. Every different kind of method and model—and data—has its own special features and character. But with a mixture of Mathematica and NKS automation, and a lot of human experts, I’m happy to say that we’ve gotten a very long way.
As with everything Wolfram touches, there is much that intrigues. Let’s suppose that Wolfram Alpha can answer a question of the following form: in which US counties did the Libertarian Party get more than 1% of the vote in the 2008 election cycle? If so, can it also compute for you the covariance of the Libertarian party’s electoral success (so to speak) with the prevalence of advanced degrees? And if it can do that, can it automatically estimate Libertarian votes in 2012 based on country demographic forecasts? Etc.
Taking the above process to its limit, can Wolfram Alpha – say, sometime in the distant future – become an all-purpose web-based Analytics engine?
Posted in Business Intelligence | 1 Comment »
Those Who Can, Teach
2 January 2009 by Sanjay Saigal.
Weighing the opportunity to teach Analytic Decision-Making to executive MBA students, I realized that the value in such a project is essentially internal. The financial compensation, to not put too fine a point on it, was low. But the chance to explore the tools of my profession with highly motivated and skeptical, yet captive, mid-career professionals was too attractive to pass up.
I’m teaching something called a “breadth” course. Not quite a part of the core curriculum, but necessary to fulfill requirements of the specialization, e.g., Technology Management or Business Analytics & Technologies. The catalog describes breadth courses as specially focused on relevance. That led to my first surprise: the course description includes Optimization and Decision Analysis (DA), but no mention of Simulation or Queuing! (Forecasting is covered in another breadth course, and further in an elective. Data Mining and Business Intelligence are covered in an entirely separate data management elective, as if they were not part of the same Analytics continuum.)
The absence of Simulation in the syllabus initially struck me as astonishing. In the ill-structured fuzz of real-world business decision-making, Monte Carlo Simulation is the inescapable Swiss Army knife. However, as I began evaluating alternative texts, I had an epiphany! Textbooks purporting to teach quantitative decision-making to business students are typically adaptations of texts originally designed to teach Operations Research (OR) to engineering and other technical majors. For instance, a widely-used text has an entire chapter dedicated to Minimum Spanning Trees, an artifact I have not had to explain to a non-technical client in seventeen years! Authors also persist in continuing to sacrifice thickets, if not entire forests, explaining theoretical notions such as duality and sensitivity to students who would much rather be off discussing consumer product strategies!
After rapidly filtering through a number of OR-centric treatments, my choice reduced to two. The text I did not choose, though only by a hair, is Data, Models, and Decisions, by Bertsemas and Freund (both at MIT). I was nearly swayed by the authors’ smarts in simplifying very complex material, and a near-perfect pedagogic progression – starting with DA, proceeding through probability and statistics to Simulation and Regression, and finally to Optimization. The exposition is case-oriented and jargon-avoiding. However, its immersive use of mathematics raises the possibility that a treatment designed for the very technical Sloan School students might not work as well for more diverse audiences.
The text I chose is Decision Making with Insight, by Sam Savage. For business school use, this text has three key advantages: First, its DNA is that of practical business decision-making, not OR technology. Its spare style makes it practically useful as a cookbook (after the class is over) in the vein of the Numerical Recipes series. Second, Savage – a thought leader in spreadsheet-based Analytics – almost exclusively uses Excel as his computational sandbox. He reasons that while spreadsheets have easily-reached limitations in analytic use, they are the business analyst’s tool of choice, which guarantees a faster uptake than an unnatural specialized environment like, say, Lindo. Finally, the exposition is playful and exploratory. It elevates insight derived by computational experimentation over traditional mathematical development. The business school classroom requires business engineering, not business mathematics.
The main weakness of Savage’s book relates to its Excel-centrism. The book is designed to turn the reader into a self-motivated desktop analyst. But Savage doesn’t even hint at the vast and valuable world of specialized analytical tools and applications that power large-scale systems such as airline revenue management to power plant operations. Since there are few pointers on the possibilities and limits of Analytics, the book elides the equally important role of the reader as a potential consumer of hardcore Enterprise-level OR technologies. (Bertsimas and Freund do a better job in this respect.) Fortunately, given my background in large-scale system development, supplementing the text should be straightforward. So this lacuna does not concern me.
Tune in again after Q1 of 2009 for a postmortem on my choice!
Posted in Teaching, Business Intelligence, Operations Research | 1 Comment »
Competing for Analytics
26 December 2008 by Sanjay Saigal.
The BBC recently reported on the long-simmering power struggle in Christiandom’s holiest site: the Church of the Holy Sepulchre in Jerusalem. This site – by tradition the place of Jesus Christ’s burial and resurrection – is contested by six denominations that occupy tiny bits of it: Roman Catholic, Greek Orthodox, Armenian Orthodox, Syrian Orthodox, Egyptian Copt and Ethiopian Orthodox. Each faction constantly maneuvers to improve its territory, their shenanigans (real and perceived) often boiling over in eruptions of violence.
The church itself is in a state of near-collapse. But since its ownership is in dispute, all attempts at repair are stymied. The situation is dismally Pareto optimal: any effort at improving matters makes at least one other party unhappy. The following heartrending yet funny anecdote captures the hopelessness of the situation:
The intractable nature of the territorial arguments over the site are epitomised by the short wooden ladder that rests on a ledge above the church’s main entrance.
It has been there since the 19th Century because rival groups cannot agree who has the right to take it down.
Under the Status Quo agreement, rights to the windows reached by the ladder belong to the Armenians, but the ledge below is controlled by the Greeks.
So the messiness persists, despite its ongoing cost to the Christian community. (Not to mention the potential catastrophic downside – complete collapse of the complex’s badly deteriorating roof.)
I was reminded of the Church of the Holy Sepulchre while navigating the recent brouhaha in the Analytics blogonook created by a post in the Intelligent Enterprise blog. Doug Henschen quizzes IBM’s Ambuj Goyal on Big Blue’s “analytics strategy” following its recent acquisition of Ilog, a leader in decision technology components (and my past employer). Henschen’s didactic frame is the notional question: “Will IBM Add Analytics to its Toolbelt?”
Henschen summarizes the interview’s takeaway in his lede as “[IBM contends that] predictive and statistical modeling — key offerings for the likes of SAS and SPSS — are overrated. IBM has what Goyal describes as better, cheaper alternatives in a mix of techniques developed for industry- and domain-specific challenges.” This startling conclusion has been met with (I’ll use a polite word) skepticism by Analytics-oriented blogs. James Taylor at the EDM blog smells the ghost of sales campaigns past:
Sadly this reminded me of the old days of IBM - when FUD (fear, uncertainty and doubt) was IBM’s reponse (sic!) to anything they did not do well. Predictive analytics are not overrated, at least not by anyone who understands them. It is true that predictive analytics, like all good technologies, are sometimes overused by over-enthusiastic supporters and that they can’t do everything. IBM’s lack of this technology is a mistake as without it their solution set is incomplete and no amount of FUD will change that.
Anne Milley at SAS’s company blog SAScom is predictably indignant in light of Goyal’s presumed attack on SAS’s knitting. Milley labels Goyal’s comments “dizzying spin” and suggests that IBM execs “deride the value [of Analytics] because they haven’t been able to monetize the analytics in their research labs even as others achieve significant returns”.
The BBC report came to mind because, as in Jerusalem, the disagreement occasioned by Goyal’s interview is fundamentally doctrinal. It centers on the existential (and ungrammatical) question – what is Analytics?
I have previously mentioned Tom Davenport’s HBR article called “Competing on Analytics”. (Davenport has also published a subsequent book with this title.) Davenport never quite defines Analytics, but his view is obviously expansive; his keystone success story is Marriott Corporation, whose Total Hotel Optimization system relies, at its core on Linear Programming (i.e., Optimization). Davenport sows confusion by using interchangeably using Analytics and labels like “statistical masters”. Analytics is not Statistics. Statistics is a tool in the arsenal of the Analytics professional. But it doesn’t describe the category.
The SAS Institute has adopted the branding framework of Analytics to compete in the Decision Management space. Historically the purveyor of a statistical toolkit (which later morphed into a Data Warehouse platform,) SAS had weak or non-existent offerings in Optimization and Inferencing/Rules, and came late to the Business Intelligence and Predictive Analytics (BI/PA) wave. So its stance of statistical methods as somehow defining Analytics is natural.
Conversely, prior to the Cognos acquisition, IBM had negligible footprint in the BI/PA space. In terms of Optimization, while boasting incredibly qualified R&D and Consulting groups, it has been unable to develop a profitable software offering. (In the ‘80s IBM attempted to market an optimization toolkit called OSL, made many sub-optimal decisions along the way, and finally killed the product in the late ‘90s. More recently, IBM has tried to leverage an open source toolkit called COIN-OR as a brand-builder, in my opinion essentially futilely.) The acquisition of Ilog provides IBM immediately with best-of-breed offerings in Optimization and Inferencing (or Business Rules). However, it still lacks anything like SAS’s statistical platform. Goyal’s pitch suggests that they know it. And they are trying to advance the notion that what they have is what the market needs.
As is not uncommon, differences born of necessity are being painted in Marketing’s primary colors to manipulate each vendor’s “rightness”. While this makes perfect sense to advance each party’s immediate business imperatives, it does not help the cause of Analytics in the Enterprise. James Taylor correctly observes that:
decisionBusiness rules, optimization, data mining, predictive analytics and adaptive control [are all] necessary ingredients for Enterprise Decision Management and business success.
Indeed. Let me illustrate with an Analytics success story. In a highly successful Space and Assortment Planning project at a greeting card manufacturer, most of the techniques in Taylor’s list were invoked. SAS was used to derive the space elasticity curves that drove revenue forecasts, CPLEX was used to optimized a complex sequence of layout models, and JRules was used to manage scenarios, preferences and exceptions. As the then Ilog project lead, I cannot recall a discussion where we, or any technology partisan on the large project team, attempted to push a technology agenda. The team uniformly focused on creating the fastest-performing, most accurate and most efficiently usable application.
Along similarly cooperative lines, instead of advocating the relative importance of one’s own bag of tricks and denigrating the competition’s, actors in the Analytics ecosystem need to adopt Davenport’s catholic approach. Let’s focus on expanding the impact of the field, on making it as pervasive a function in business as Accounting or Human Resources. To mash up the rallying cry of Bill Clinton’s 1992 campaign with the verbal stylings of the logorrheic Sarah Palin, “It’s all of the above, stupid”!
Posted in Business Intelligence, Operations Research, Vision, Intechné | 1 Comment »
Probability Management
12 December 2008 by Sanjay Saigal.
In recent weeks I have been working with Sam Savage, well-known OR personality and a consulting professor at Stanford. We’re focusing on developing a practice framework for Probability Management. Whazzat, you ask? In sum, Probability Management is all about robust decision-making in the presence of uncertainty. (Pretty much the vision for Intechné!)
Since real world decision problems are almost always ill-structured and fuzzy, our tools of choice belong to the worlds of simulation, and statistical visualization. Stochastic optimization plays a role too, but in a very different form than typically understood, say, in Operations Research circles. In general, we are not interested in creating IT systems that generate “best possible” recommendations. Rather, we enable managers to interactively explore the decision space of good solutions, using something similar to a business intelligence (BI) approach. The key difference between BI and Probability Management is that while BI is essentially descriptive (identifying multi-factorial relationships, typically for historical data) Probability Management is prescriptive: our clients learn what to do better.
I intend to write further on this topic, but for now let me point interested readers to our newly redesigned web site. The organization is a loose consortium of academic and commercial folks involved in the field, as vendors, users, and advisors. Check out the Interact! tab. It contains illustrative Excel models that describe the relevant concepts far better than long-winded descriptions. If you find it interesting and wish to discuss further, contact me.
Posted in Probability Management, Business Intelligence, Intechné, Risk | 1 Comment »
Business Intelligence and Operations Research
6 May 2008 by Sanjay Saigal.
Mike Trick writes about mutually assured incomprehension between the Business Intelligence (BI) and Operations Research (OR) communities. The two provide similar sounding approaches to intelligent decision-making, but appear to exist in parallel universes in terms of practice and research. He ends by expressing the hope that:
…OR people should see the BI community as a great source of problems and inspiration (and should make an effort to learn their language). But BI will inevitably use non-OR methods for some of their issues, so is rightly not “the same as” OR. But we as a field should know more about what they are doing if we are going to be part of this business direction.
Can BI and OR cooperate for mutual benefit? I look at the issue from the perspective of two technology experiences.
A traveling salesman story
The baffling non-communication between BI and OR immediately reminds me of Traveling Salesman Problem (TSP) research back in the late 80s. The OR (Groestchel, Cook, and others using polyhedral combinatorics), Computer Science (Bell Labs folks using Lin-Kernighan type heuristics) and Artificial Intelligence (using ANN, Simulated Annealing and related “bio-physical” approaches) communities were all working on the same problem, with little cross-pollination. I was fortunate to attend a conference at Rice University that invited leading lights from camp to develop a holistic vision of TSP research.
As a graduate student, I recall sharp surprise by how the limitations and strengths of each school drove its research. The AI folks armed with elegant learning-oriented methods, focused on small – say a few dozen cities – instances, often drawn directly from the real world. Their algorithms would solve series of iteratively changing instances, reliably and fast. Though there was little emphasis on mathematical optimality, their results were accessible to non-expert readers.
The CS folks were at the other end of the size spectrum. They were interested in instances drawn from telecom networks, with millions of cities, which needed to be solved in seconds. Their algorithms were essentially heuristics, so their results focused on empirical performance measures – solve time and distance from optimality. The application areas often involved real-time setting, so narrowing the range of outcome variability was the key research driver.
The OR approach – based largely on cutting plane methods for integer programming – implicitly assumed that provable optimality was the goal. The focus was on solving larger and larger problems to optimality ever faster. At that time, if I recall correctly, the largest solved involved a few thousand cities. However, the methods were extremely non-robust. There were many small “deviant” instances, for which known cutting planes were insufficient. Even worse, within any instance class and size, there could be 10x (or more) variability in solution times.
I recall that by the end of the conference, each research school far better appreciated the others’ methods. For example, in my own research (on a different but related problem) I discovered that a hybrid technique combining a randomized Lin-Kernighan type “kick” heuristic with a cutting plane method worked best. In the broader research community however, the TSP dialog did not lead to a common “business direction”. Perhaps this was due to the mechanics of the research enterprise, which can be quite territorial.
However, it is possible for commonality of goals to induce a commonality of trajectory. This is evident from the CPLEX experience at Ilog.
CPLEX et Ilog
In the late 90s, Ilog was an emerging vendor of optimization tools to the business community. Faced with the challenge of expanding beyond its European base, it found that its bread-and-butter CS-based optimization technology – Constraint Programming – was little known and unappreciated in the US. There, OR techniques such as linear and integer programming were pervasive. A small company called CPLEX Optimization was the leading US vendor. Though the initial idea was for Ilog to license and embed CPLEX technology, Ilog ended up acquiring the company.
The CPLEX acquisition created a host of integration issues for Ilog. Most of them are irrelevant to our story, but a key problem was the mix of mutual incomprehension and suspicion between the two technical groups. The Ilog folks, hailing from a CS background and flush with a string of European successes based on Constraint Programming (CP), viewed OR methods as non-robust and old school. (Constraint Programming gained prominence in the 1980s in the AI community. The simplex method in OR dates all the way back to 1948!) CPLEX folks had a hard time moving beyond the fact that CP offered neither provable optimality nor guaranteed solve times.
As CPLEX rose to become Ilog’s most successful product over the years, an interesting convergence occurred. CPLEX increasingly incorporated CP-like heuristics into its core engine. And CP product development increasingly became driven by CPLEX’s strengths: fast performance and large-scale numerical robustness. Ilog also developed products incorporating CPLEX and CP, creating offerings still without competition in the marketplace. Today, tensions between the two product lines continue to generate tectonic rumbles. But ten years down the road, ILOG has surmounted the original mutual incomprehension between the two teams and grown to the premier player in its space.
BI and OR: Quo Vadis?
In TSP research, though all parties shared goals, convergence did not occur in any meaningful sense. Within Ilog, CP and CPLEX were able to cross-pollinate and cooperate to mutual benefit. What are we to make of these two cases?
One difference, perhaps the key difference, is that the cooperative motivations for TSP researchers were weak. Research is a Darwinian enterprise; schools of knowledge play in a zero-sum sandbox. At Ilog, it was clear that CPLEX needed to succeed for Ilog to grow. Shared purpose – helped along by managerial directives – drove the cooperation of two technologies.
On the face of it, the BI/OR situation is more akin to the TSP case: there is (in Al Gore’s evocative phrase) no controlling authority forcing cooperation. However, from the OR perspective, the situation is far more fraught. The OR profession is in a period of existential self-reflection. What is the proper domain of OR? Is OR mainly a research area or is it a profession (or both)? If it’s a profession, should it include the usual attributes such as licensure, professional closure, and a professional code of conduct?
In contrast, BI has captured much of the “intelligent decision-making” mind-share in the business world in a very short period of time. And all that, on the basis of market-ready IT-aware technologies, with a less established research base. The strengths and weaknesses of the two communities are starkly complementary. And as the need for smart IT expands, the multiplicative impact of cooperation (in whatever form) has inescapable logic.
Posted in Business Intelligence, Operations Research | 1 Comment »