Marketing Deal Offers

  • Subscribe to our RSS feed.
  • Twitter
  • StumbleUpon
  • Reddit
  • Facebook
  • Digg

Tuesday, 27 November 2007

Just How Scalable Is QlikTech?

Posted on 14:07 by Unknown
A few days ago, I replied to a question regarding QlikTech scalability. (See What Makes QlikTech So Good?, August 3, 2007) I asked QlikTech itself for more information on the topic but haven’t learned anything new. So let me simply discuss this based on my own experience (and, once again, remind readers that while my firm is a QlikTech reseller, comments in this blog are strictly my own.)

The first thing I want to make clear is that QlikView is a wonderful product, so it would be a great pity if this discussion were to be taken as a criticism. Like any product, QlikView works within limits that must be understood to use it appropriately. No one benefits from unrealistic expectations, even if fans like me sometimes create them.

That said, let’s talk about what QlikTech is good at. I find two fundamental benefits from the product. The first is flexibility: it lets you analyze data in pretty much any way you want, without first building a data structure to accommodate your queries. By contrast, most business intelligence tools must pre-aggregate large data sets to deliver fast response. Often, users can’t even formulate a particular query if the dimensions or calculated measures were not specified in advance. Much of the development time and cost of conventional solutions, whether based in standard relational databases or specialized analytical structures, is spent on this sort of work. Avoiding it is the main reason QlikTech is able to deliver applications so quickly.

The other big benefit of QlikTech is scalability. I can work with millions of records on my desktop with the 32-bit version of the system (maximum memory 4 GB if your hardware allows it) and still get subsecond response. This is much more power than I’ve ever had before. A 64-bit server can work with tens or hundreds of millions of rows: the current limit for a single data set is apparently 2 billion rows, although I don’t know how close anyone has come to that in the field. I have personally worked with tables larger than 60 million rows, and QlikTech literature mentions an installation of 300 million rows. I strongly suspect that larger ones exist.

So far so good. But here’s the rub: there is a trade-off in QlikView between really big files and really great flexibility. The specific reason is that the more interesting types of flexibility often involve on-the-fly calculations, and those calculations require resources that slow down response. This is more a law of nature (there’s no free lunch) than a weakness in the product, but it does exist.

Let me give an example. One of the most powerful features of QlikView is a “calculated dimension”. This lets reports construct aggregates by grouping records according to ad hoc formulas. You might want to define ranges for a value such as age, income or unit price, or create categories using if/then/else statements. These formulas can get very complex, which is generally a good thing. But each formula must be calculated for each record every time it is used in a report. On a few thousand rows, this can happen in an instant, but on tens of millions of rows, it can take several minutes (or much longer if the formula is very demanding, such as on-the-fly ranking). At some point, the wait becomes unacceptable, particularly for users who have become accustomed to QlikView’s typically-immediate response.

As problems go, this isn’t a bad one because it often has a simple solution: instead of on-the-fly calculations, precalculate the required values in QlikView scripts and store the results on each record. There’s little or no performance cost to this strategy since expanding the record size doesn’t seem to slow things down. The calculations do add time to the data load, but that happens only once, typically in an unattended batch process. (Another option is to increase the number and/or speed of processors on the server. QlikTech makes excellent use of multiple processors.)

The really good news is you can still get the best of both worlds: work out design details with ad hoc reports on small data sets; then, once the design is stabilized, add precalculations to handle large data volumes. This is vastly quicker than prebuilding everything before you can see even a sample. It’s also something that’s done by business analysts with a bit of QlikView training, not database administrators or architects.

Other aspects of formulas and database design also more important in QlikView as data volumes grow larger. The general solution is the same: make the application more efficient through tighter database and report design. So even though it’s true that you can often just load data into QlikView and work with it immediately, it’s equally true that very large or sophisticated applications may take some tuning to work effectively. In other words, QlikView is not pure magic (any result you want for absolutely no work), but it does deliver much more value for a given amount of work than conventional business intelligence systems. That’s more than enough to justify the system.

Interestingly, I haven’t found that the complexity or over-all size of a particular data set impacts QlikView performance. That is, removing tables which are not used in a particular query doesn’t seem to speed up that query, nor does removing fields from tables within the query. This probably has to do with QlikTech’s “associative” database design, which treats each field independently and connects related fields directly to each other. But whatever the reason, most of the performance slow-downs I’ve encountered seem related to processing requirements.

And, yes, there are some upper limits to the absolute size of a QlikView implementation. Two billions rows is one, although my impression (I could be wrong) is that could be expanded if necessary. The need to load data into memory is another limit: even though the 64-bit address space is effectively infinite, there are physical limits to the amount of memory that can be attached to Windows servers. (A quick scan of the Dell site finds a maximum of 128 GB.) This could translate into more input data, since QlikView does some compression. At very large scales, processing speed will also impose a limit . Whatever the exact upper boundary, it’s clear that no one will be loading dozens of terabytes into QlikView any time soon. It can certainly be attached a multi-terabyte warehouse, but would have to work with multi-gigabyte extracts. For most purposes, that’s plenty.

While I’m on the topic of scalability, let me repeat a couple of points I made in the comments on the August post. One addresses the notion that QlikTech can replace a data warehouse. This is true in the sense that QlikView can indeed load and join data directly from operational systems. But a data warehouse is usually more than a federated view of current operational tables. Most warehouses include data integration to link otherwise-disconnected operational data. For example, customer records from different systems often can only be linked through complex matching techniques because there is no shared key such as a universal customer ID. QlikView doesn’t offer that kind of matching. You might be able to build some of it using QlikView scripts, but you’d get better results at a lower cost from software designed for the purpose.

In addition, most warehouses store historical information that is not retained in operational systems. A typical example is end-of-month account balance. Some of these values can be recreated from transaction details but it’s usually much easier just to take and store a snapshot. Other data may simply be removed from operational systems after a relatively brief period. QlikView can act as a repository for such data: in fact, it’s quite well suited for this. Yet in such cases, it’s probably more accurate to say that QlikView is acting as the data warehouse than to say a warehouse is not required.

I hope this clarifies matters without discouraging anyone from considering QlikTech. Yes QlikView is a fabulous product. No it won’t replace your multi-terabyte data warehouse. Yes it will complement that warehouse, or possibly substitute for a much smaller one, by providing a tremendously flexible and efficient business intelligence system. No it won’t run itself: you’ll still need some technical skills to do complicated things on large data volumes. But for a combination of speed, power, flexibility and cost, QlikTech can’t be beat.
Email ThisBlogThis!Share to XShare to Facebook
Posted in analysis systems, analytics tools, business intelligence, database technology, qliktech, qlikview | No comments
Newer Post Older Post Home

0 comments:

Post a Comment

Subscribe to: Post Comments (Atom)

Popular Posts

  • eBay Offers $2.4 Billion for GSI Commerce: More Support for Marketing Automation
    eBay ’s $2.4 billion offer for e-commerce services giant GSI Commerce has been described largely in terms of helping eBay to compete with ...
  • Selligent Brings a New B2C Marketing Automation Option to the U.S.
    I’m writing this post on my old DOS-based WordPerfect software, to get in the proper mood for discussing business-to-consumer marketing auto...
  • Infer Keeps It Simple: B2B Lead Scores and Nothing Else
    I’ve nearly finished gathering information from vendors for my new study on Customer Data Platform systems and have started to look for patt...
  • 4 Marketing Tech Trends To Watch in 2014
    I'm not a big fan of year-end summaries and forecasts, mostly because I produce summaries and forecasts all year round.  But I pulled to...
  • NICE Buys Causata to Extend Its Customer Experience Management Position
    So, there I was around 7:30 Eastern time this morning, sending out reminder notices to vendors I need to interview for an upcoming report on...
  • thinkAnalytics Helps Marketers Optimize Customer Treatments
    Summary: thinkAnalytics provides a robust decision engine to help make optimal recommendations across channels. Too bad more people don...
  • Vocus Marketing Suite: Still Mostly Social But Marketing Automation is On the Way
    If you’ve heard of Vocus at all, it’s probably as vendor serving public relations professionals. Its core offerings include a huge databas...
  • So Many Measures, So Little Time
    I’ve been collating lists of marketing performance metrics from different sources, which is exactly as much fun as it sounds. One result th...
  • Marketo Raises Another $50 Million: Where Does the Money Go?
    Marketo this morning announced a new $50 million funding round, almost exactly one year to the day after raising $25 million in November 2...
  • Advanced Analytics and Still More Reasons I Love QlikView
    I’m at the National Center for Database Marketing Conference this week. NCDM is always a good place to get a feel for what’s on people’s mi...

Categories

  • [x+1]
  • 1010Data
  • 2009 trends
  • 2010 predictions
  • 2011 predictions
  • 2013 marketing automation revenues
  • 2014 predictions
  • account data in marketing systems
  • acquisitions
  • acquistions
  • act-on software
  • active conversion
  • activeconversion
  • acxiom
  • ad agencies
  • ad servers
  • adam needles
  • adobe
  • adometry
  • advertising effectiveness
  • advocate management
  • affiliate marketing
  • agilone
  • aida model
  • aimatch
  • algorithmic attribution
  • alterian
  • analysis systems
  • analytical database
  • analytical databases
  • analytical systems
  • analytics tools
  • app exchange
  • app marketplace
  • application design
  • aprimo
  • are
  • artificial intelligence
  • ascend2
  • asset management
  • assetlink
  • atg
  • attribution analysis
  • attribution models
  • automated decisions
  • automated dialog
  • automated modeling
  • autonomy
  • b2b demand generation
  • b2b demand generation systems
  • b2b email marketing benchmarks
  • b2b lead scoring
  • b2b marketing
  • b2b marketing automation
  • b2b marketing automation industry consolidation
  • b2b marketing automation industry growth rate
  • b2b marketing automation revenues
  • b2b marketing automation systems
  • b2b marketing automation vendor rankings
  • b2b marketing data
  • b2b marketing industry consolidation
  • b2b marketing strategy
  • b2b marketing system comparison
  • b2c marketing automation
  • b2c marketing automation vendors
  • balanced scorecard
  • balihoo
  • barriers to marketing success
  • barry devlin
  • beanstalk data
  • behavior detection
  • behavior identification
  • behavior targeting
  • behavioral data
  • behavioral targeting
  • big data
  • birst
  • bislr
  • blogging software
  • brand experience
  • brand marketing
  • business intelligence
  • business intelligence software
  • business intelligence systems
  • business marketing
  • businses case
  • callidus
  • campaign flow
  • campaign management
  • campaign management software
  • causata
  • cdi
  • cdp
  • channel management
  • channel marketing
  • channel partner management
  • chordiant
  • cio priorities
  • clickdimensions
  • clicksquared
  • clientxclient
  • cloud computing
  • cmo surveys
  • cms
  • collaboration software
  • column data store
  • column-oriented database
  • columnar database
  • community management
  • compare marketing automation vendors
  • compiled data
  • complex event processing
  • consumer marketing
  • contact center systems
  • content aggregation
  • content distribution
  • content grazing
  • content management
  • content marketing
  • content matrix
  • content recommendations
  • content selections
  • content syndication
  • context automation
  • conversen
  • coremetrics
  • crm
  • crm integration
  • CRM lead scores
  • crm software
  • crm systems
  • crmevolution
  • cross-channel marketing
  • crowd sourcing
  • custom content
  • custom media
  • customer database
  • customer analysis
  • customer data
  • customer data integration
  • customer data management
  • customer data platform
  • customer data platforms
  • customer data quality
  • customer data warehouse
  • customer database
  • customer experience
  • customer experience management
  • customer experience matrix
  • customer information
  • customer management
  • customer management software
  • customer management systems
  • customer metrics
  • customer relationship management
  • customer satisfaction
  • customer success
  • customer support
  • cxc matrix
  • dashboards
  • data analysis
  • data cleaning
  • data cleansing
  • data enhancement
  • data integration
  • data loading
  • data mining
  • data mining and terrorism
  • data quality
  • data transformation tools
  • data visualization
  • data warehouse
  • database management
  • database marketing
  • database marketing systems
  • database technology
  • dataflux
  • datallegro
  • datamentors
  • david raab
  • david raab webinar
  • david raab whitepaper
  • day software
  • decision engiens
  • decision engines
  • decision management
  • decision science
  • dell
  • demand generation
  • demand generation implementation
  • demand generation industry
  • demand generation industry growth rate
  • demand generation industry size
  • demand generation industry trends
  • demand generation marketbright
  • demand generation marketing automation
  • demand generation software
  • demand generation software revenues
  • demand generation systems
  • demand generation vendors
  • demandforce
  • digiday
  • digital marketing
  • digital marketing systems
  • digital messaging
  • distributed marketing
  • dmp
  • dreamforce
  • dreamforce 2012
  • dynamic content
  • ease of use
  • ebay
  • eglue
  • eloqua
  • eloqua10
  • elqoua ipo
  • email
  • email marketing
  • email service providers
  • engagement engine
  • enteprise marketing management
  • enterprise decision management
  • enterprise marketing management
  • enterprise software
  • entiera
  • epiphany
  • ETL
  • eTrigue
  • event detection
  • event stream processing
  • event-based marketing
  • exacttarget
  • facebook
  • feature checklists
  • flow charts
  • fractional attribution
  • freemium
  • future of marketing automation
  • g2crowd
  • gainsight
  • Genius.com
  • genoo
  • geotargeting
  • gleanster
  • governance
  • grosocial
  • gsi commerce
  • high performance analytics
  • hiring consultants
  • hosted software
  • hosted systems
  • hubspot
  • ibm
  • impact of internet on selling
  • importance of sales execution
  • in-memory database
  • in-site search
  • inbound marketing
  • industry consolidation
  • industry growth rate
  • industry size
  • industry trends
  • influitive
  • infor
  • information cards
  • infusioncon 2013
  • infusionsoft
  • innovation
  • integrated customer management
  • integrated marketing management
  • integrated marketing management systems
  • integrated marketing systems
  • integrated systems
  • intent measurement
  • interaction advisor
  • interaction management
  • interestbase
  • interwoven
  • intuit
  • IP address lookup
  • jbara
  • jesubi
  • king fish media
  • kwanzoo
  • kxen
  • kynetx
  • large company marketing automation
  • last click attribution
  • lead capture
  • lead generation
  • lead management
  • lead management software
  • lead management systems
  • lead managment
  • lead ranking
  • lead scoring
  • lead scoring models
  • leadforce1
  • leadformix
  • leading marketing automation systems
  • leadlander
  • leadlife
  • leadmd
  • leftbrain dga
  • lifecycle analysis
  • lifecycle reporting
  • lifetime value
  • lifetime value model
  • local marketing automation
  • loopfuse
  • low cost marketing software
  • low-cost marketing software
  • loyalty systems
  • lyzasoft
  • makesbridge
  • manticore technology
  • mapreduce
  • market consolidation
  • market software
  • market2lead
  • marketbight
  • marketbright
  • marketgenius
  • marketing analysis
  • marketing analytics
  • marketing and sales integration
  • marketing automation
  • marketing automation adoption
  • marketing automation benefits
  • marketing automation consolidation
  • marketing automation cost
  • marketing automation deployment
  • marketing automation features
  • marketing automation industry
  • marketing automation industry growth rate
  • marketing automation industry trends
  • marketing automation market share
  • marketing automation market size
  • marketing automation maturity model
  • marketing automation net promoter score. marketing automation effectiveness
  • marketing automation pricing
  • marketing automation software
  • marketing automation software evaluation
  • marketing automation success factors
  • marketing automation system deployment
  • marketing automation system evaluation
  • marketing automation system features
  • marketing automation system selection
  • marketing automation system usage
  • marketing automation systems
  • marketing automation trends
  • marketing automation user satisfaction
  • marketing automation vendor financials
  • marketing automation vendor selection
  • marketing automation vendor strategies
  • marketing automion
  • marketing best practices
  • marketing cloud
  • marketing content
  • marketing data
  • marketing data management
  • marketing database
  • marketing database management
  • marketing education
  • marketing execution
  • marketing funnel
  • marketing integration
  • marketing lead stages
  • marketing management
  • marketing measurement
  • marketing mix models
  • marketing operating system
  • marketing operations
  • marketing optimization
  • marketing performance
  • marketing performance measurement
  • marketing platforms
  • marketing priorities
  • marketing process
  • marketing process optimization
  • marketing resource management
  • marketing return on investment
  • marketing ROI
  • marketing sales alignment
  • marketing service providers
  • marketing services
  • marketing services providers
  • marketing skills gap
  • marketing software
  • marketing software evaluation
  • marketing software industry trends
  • marketing software product reviews
  • marketing software selection
  • marketing software trends
  • marketing softwware
  • marketing suites
  • marketing system architecture
  • marketing system evaluation
  • marketing system ROI
  • marketing system selection
  • marketing systems
  • marketing technology
  • marketing tests
  • marketing tips
  • marketing to sales alignment
  • marketing training
  • marketing trends
  • marketing-sales integration
  • marketingpilot
  • marketo
  • marketo funding
  • marketo ipo
  • master data management
  • matching
  • maturity model
  • meaning based marketing
  • media mix models
  • message customization
  • metrics
  • micro-business marketing software
  • microsoft
  • microsoft dynamics crm
  • mid-tier marketing systems
  • mindmatrix
  • mintigo
  • mma
  • mobile marketing
  • mpm toolkit
  • multi-channel marketing
  • multi-language marketing
  • multivariate testing
  • natural language processing
  • neolane
  • net promoter score
  • network link analysis
  • next best action
  • nice systems
  • nimble crm
  • number of clients
  • nurture programs
  • officeautopilot
  • omnichannel marketing
  • omniture
  • on-demand
  • on-demand business intelligence
  • on-demand software
  • on-premise software
  • online advertising
  • online advertising optimization
  • online analytics
  • online marketing
  • open source bi
  • open source software
  • optimization
  • optimove
  • oracle
  • paraccel
  • pardot
  • pardot acquisition
  • partner relationship management
  • pay per click
  • pay per response
  • pedowitz group
  • pegasystems
  • performable
  • performance marketing
  • personalization
  • pitney bowes
  • portrait software
  • predictive analytics
  • predictive lead scoring
  • predictive modeling
  • privacy
  • prospect database
  • prospecting
  • qliktech
  • qlikview
  • qlikview price
  • raab guide
  • raab report
  • raab survey
  • Raab VEST
  • Raab VEST report
  • raab webinar
  • reachedge
  • reachforce
  • real time decision management
  • real time interaction management
  • real-time decisions
  • real-time interaction management
  • realtime decisions
  • recommendation engines
  • relationship analysis
  • reporting software
  • request for proposal
  • reseller marketing automation
  • response attribution
  • revenue attribution
  • revenue generation
  • revenue performance management
  • rfm scores
  • rightnow
  • rightwave
  • roi reporting
  • role of experts
  • rule-based systems
  • saas software
  • saffron technology
  • sales automation
  • sales best practices
  • sales enablement
  • sales force automation
  • sales funnel
  • sales lead management association
  • sales leads
  • sales process
  • sales prospecting
  • salesforce acquires exacttarget
  • salesforce.com
  • salesgenius
  • sap
  • sas
  • score cards
  • search engine optimization
  • search engines
  • self-optimizing systems
  • selligent
  • semantic analysis
  • semantic analytics
  • sentiment analysis
  • service oriented architecture
  • setlogik
  • setlogik acquisition
  • silverpop
  • silverpop engage
  • silverpop engage b2b
  • simulation
  • sisense prismcubed
  • sitecore
  • small business marketing
  • small business software
  • smarter commerce
  • smartfocus
  • soa
  • social campaign management
  • social crm
  • social marketing
  • social marketing automation
  • social marketing management
  • social media
  • social media marketing
  • social media measurement
  • social media monitoring
  • social media roi
  • social network data
  • software as a service
  • software costs
  • software deployment
  • software evaluation
  • software satisfaction
  • software selection
  • software usability
  • software usability measurement
  • Spredfast
  • stage-based measurement
  • state-based systems
  • surveillance technology
  • sweet suite
  • swyft
  • sybase iq
  • system deployment
  • system design
  • system implementation
  • system requirements
  • system selection
  • tableau software
  • technology infrastructure
  • techrigy
  • Tenbase
  • teradata
  • test design
  • text analysis
  • training
  • treehouse international
  • trigger marketing
  • twitter
  • unica
  • universal behaviors
  • unstructured data
  • usability assessment
  • user interface
  • vendor comparison
  • vendor evaluation
  • vendor evaluation comparison
  • vendor rankings
  • vendor selection
  • vendor services
  • venntive
  • vertica
  • visualiq
  • vocus
  • vtrenz
  • web analytics
  • web contact management
  • Web content management
  • web data analysis
  • web marketing
  • web personalization
  • Web site design
  • whatsnexx
  • woopra
  • youcalc
  • zoho
  • zoomix

Blog Archive

  • ►  2013 (55)
    • ►  December (4)
    • ►  November (5)
    • ►  October (4)
    • ►  September (3)
    • ►  August (5)
    • ►  July (5)
    • ►  June (5)
    • ►  May (6)
    • ►  April (6)
    • ►  March (1)
    • ►  February (6)
    • ►  January (5)
  • ►  2012 (56)
    • ►  December (4)
    • ►  November (3)
    • ►  October (6)
    • ►  September (4)
    • ►  August (7)
    • ►  July (3)
    • ►  June (4)
    • ►  May (5)
    • ►  April (3)
    • ►  March (4)
    • ►  February (8)
    • ►  January (5)
  • ►  2011 (74)
    • ►  December (9)
    • ►  November (8)
    • ►  October (6)
    • ►  September (5)
    • ►  August (5)
    • ►  July (3)
    • ►  June (6)
    • ►  May (5)
    • ►  April (6)
    • ►  March (8)
    • ►  February (7)
    • ►  January (6)
  • ►  2010 (75)
    • ►  December (9)
    • ►  November (9)
    • ►  October (5)
    • ►  September (6)
    • ►  August (7)
    • ►  July (3)
    • ►  June (6)
    • ►  May (9)
    • ►  April (4)
    • ►  March (6)
    • ►  February (6)
    • ►  January (5)
  • ►  2009 (96)
    • ►  December (2)
    • ►  November (4)
    • ►  October (5)
    • ►  September (9)
    • ►  August (7)
    • ►  July (16)
    • ►  June (9)
    • ►  May (5)
    • ►  April (11)
    • ►  March (11)
    • ►  February (11)
    • ►  January (6)
  • ►  2008 (59)
    • ►  December (6)
    • ►  November (3)
    • ►  October (8)
    • ►  September (1)
    • ►  August (5)
    • ►  July (8)
    • ►  June (5)
    • ►  May (5)
    • ►  April (6)
    • ►  March (3)
    • ►  February (3)
    • ►  January (6)
  • ▼  2007 (84)
    • ►  December (4)
    • ▼  November (6)
      • Low Cost CDI from Infosolve, Pentaho and StrikeIron
      • Just How Scalable Is QlikTech?
      • SAS Adds Real Time Decisioning to Its Marketing Sy...
      • BridgeTrack Integrates Some Online Channels
      • Datran Media Sells Email Like Web Ads
      • The Next Big Leap for Marketing Software
    • ►  October (6)
    • ►  September (1)
    • ►  August (4)
    • ►  July (7)
    • ►  June (16)
    • ►  May (20)
    • ►  April (20)
Powered by Blogger.

About Me

Unknown
View my complete profile