<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Data Architecture</title>
	<atom:link href="http://www.ericjlawson.co.uk/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.ericjlawson.co.uk</link>
	<description>... Business Intelligence &#38; ETL ... advice &#38; opinion ...</description>
	<lastBuildDate>Mon, 29 Apr 2013 09:30:34 +0000</lastBuildDate>
	<language>en-US</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.5.1</generator>
		<item>
		<title>Should you bite the hand that feeds you?</title>
		<link>http://www.ericjlawson.co.uk/2013/04/29/should-you-bite-the-hand-that-feeds-you/</link>
		<comments>http://www.ericjlawson.co.uk/2013/04/29/should-you-bite-the-hand-that-feeds-you/#comments</comments>
		<pubDate>Mon, 29 Apr 2013 09:30:34 +0000</pubDate>
		<dc:creator>eric</dc:creator>
				<category><![CDATA[BI]]></category>
		<category><![CDATA[General]]></category>
		<category><![CDATA[Management]]></category>

		<guid isPermaLink="false">http://www.ericjlawson.co.uk/?p=391</guid>
		<description><![CDATA[Jamie Thompson is an MVP, but more to the point he is a very experienced SSIS developer and a prolific contributor to the SQL Server community. Surely this is why he was awarded an MVP by Microsoft. His recent blog post on GeoFlow (an add-in for Excel that delivers 3D visualisation and story telling) has [...]]]></description>
				<content:encoded><![CDATA[<p>Jamie Thompson is an MVP, but more to the point he is a very experienced SSIS developer and a prolific contributor to the SQL Server community. Surely this is why he was awarded an MVP by Microsoft.</p>
<p>His <a title="Jamiet GeoFlow blog post" href="http://sqlblog.com/blogs/jamie_thomson/archive/2013/04/14/geoflow-is-that-it-microsoft.aspx?CommentPosted=true#commentmessage" target="_blank">recent blog post on GeoFlow</a> (an add-in for Excel that delivers 3D visualisation and story telling) has raised the issue of whether MVP&#8217;s should criticise Microsoft.</p>
<p>I think its fair to say that the consensus from respondents in the blog comments, is yes criticise if its warranted, but don&#8217;t get into the habit of always moaning. Positive feedback is as crucial as negative feedback. You could also argue negative feedback can have a positive effect if handled appropriately (both the giving and receiving).</p>
<p>I should also point out that the suggestion that MVP&#8217;s should not criticise Microsoft  (that I saw) came from another MVP (not an employee of MS, I assume). Please read the blog post and comments if you want to be sure as there are many and I skimmed some of the entries.</p>
<p>Many of the comments were from IT professionals trying to deliver mobile BI solutions to their organisation or clients and were expressing their frustration at the lack of Microsoft mobile BI support. Jamie&#8217;s blog discusses the lack of integration of the GeoFlow project offerings into the core MS BI delivery platform.</p>
<p>I expect every MVP at times, will question if the role is good for them. I guess it is very beneficial, for mostly intangible reasons. I imagine being a good MVP takes a huge amount of time and personal cost to attend conferences, user groups etc. and reasons to accept or decline the role would take into account many more things than the freedom to criticise Microsoft.</p>
<p>What about biting the hand of your employer or line manager, which brings me to the other side of the coin, perhaps kissing the hand that feeds you?</p>
<p>Within an IT team or department, the trouble with always moaning is you can become a miserable git, stop enjoying your work and more importantly, be too busy whinging and miss opportunities to deal with the initial cause of the moaning. This is crudely referred to as &#8216;having your head up your &#8230;&#8221;. No-one will want to talk to you, work with you and you can forget anyone listening to you. Even when you are right. Think Peter and the Wolf.</p>
<p>But the opposite can be just as unproductive.</p>
<p>Beginning of the week &#8211; &#8220;Any issues with meeting the deadline for your [<em>insert project deliverable</em>] ?&#8221; asked the project manager.</p>
<p>&#8220;No its all good&#8221; answered the programmer.</p>
<p>End of the month &#8220;why is your [<em>project deliverable</em>] not ready?&#8221; asked the project manager.</p>
<p>&#8220;Because [<em>insert excuse</em>]&#8221; said the programmer.</p>
<p>The excuses are mostly irrelevant at this stage for the project manager. Project managers [should] want to intervene and resolve issues before deadlines are missed. Firefighting when its too late is okay every now and then but it becomes soul destroying when its endemic. I bet it&#8217;s often at the heart of good IT professionals moving onto to greener pastures.</p>
<p>Never speaking out could be considered cruising along on Prozac or sailing with the &#8220;it&#8217;s all good&#8221; and &#8220;have a great day&#8221; crowd. Or maybe its a stubborn streak, preventing you from admitting that something is defeating you. Perhaps you are in danger of being labelled &#8220;doesn&#8217;t play well with others&#8221;?  Maybe you are not at your best at the moment and need a break?</p>
<p>Or maybe you dont think its your place to question the decisions made by your boss?</p>
<p>I strongly believe that if you think a directive handed down by your team lead, project manager or IT boss is wrong (<em>good luck if all three of these lines can directly request you to do tasks!!!</em>), or misses something, you should challenge it in a respectful way at the right time. Its also the best way to learn. If you happen to be wrong or perhaps didn&#8217;t fully understand the requirement etc. a good team lead or architect will take the time to explain and get you up to speed. If you aren&#8217;t brave enough to speak up and admit you don&#8217;t understand or question a proposal or task, you wont learn.</p>
<p style="text-align: left;">Maybe you aren&#8217;t wrong. Often there is no black or white answer and every idea should get consideration (maybe 30 seconds perhaps 30 minutes). Managers and design leads / architects aren&#8217;t perfect. They (should) make considered use of the team they are working with to collaborate and produce robust solutions. The end result of the combined thinking of everybody who can contribute will generally be better than the work of an individual.</p>
<p>However, it is their job to lead and propose solutions. But as I said they aren&#8217;t perfect. So by sitting on your hands and never challenging those above you in hierarchy, experience, competence or pecking order, you are doing yourself, them and your employer a dis-service.</p>
<p>Sadly, some managers lack the experience or humility to permit &#8220;underlings&#8221; to contribute to the overall decision making process of a department. These types can be identified by the habits of not conducting meetings with more than 1 person (maybe 2 if pushed), having favourites amongst their line managers (who often turn out to be &#8216;yes&#8217; people or are playing the long game and dont rock the boat). They usually don&#8217;t respect lines of authority (ie. go around your team leader with requests) and they are often micro-managers and cant delegate.</p>
<p>It is generally very difficult to find opportunties to offer constructive criticism in these types of environments. Maybe you will be lucky to  be working within a tight team and your team lead or line manager shields you from these issues.</p>
<p>So if you have a supportive workplace, where top down, bottom up collaboration is encouraged, then work with it and find solutions to vendor shortcomings. Get hold of the GeoFlow add-in and innovate. I am sure thats what many will get on and do.</p>
<p>However, if the traits I described ring bells for you and you are feeling frustrated or stifled then you need to weigh up your options. Maybe you are considering going around your direct report to offer constructive criticism up the line. If so, its probably already too late and you should be planning your next career move. Your job might offer security and stability, but being a down trodden miserable or stressed IT professional isn&#8217;t going to make you happy and wont produce good code in the long run.</p>
<p>Above all, don&#8217;t sit on your hands and moan in the tea room (or even worse, to your significant other).</p>
<p>Get involved and contribute.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.ericjlawson.co.uk/2013/04/29/should-you-bite-the-hand-that-feeds-you/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>SQL Santa London</title>
		<link>http://www.ericjlawson.co.uk/2012/12/14/sql-santa-london/</link>
		<comments>http://www.ericjlawson.co.uk/2012/12/14/sql-santa-london/#comments</comments>
		<pubDate>Fri, 14 Dec 2012 15:15:18 +0000</pubDate>
		<dc:creator>eric</dc:creator>
				<category><![CDATA[BI]]></category>
		<category><![CDATA[Data]]></category>
		<category><![CDATA[ETL]]></category>
		<category><![CDATA[General]]></category>
		<category><![CDATA[Management]]></category>
		<category><![CDATA[ashton tate]]></category>
		<category><![CDATA[Bill Gates]]></category>
		<category><![CDATA[Bill Inmon]]></category>
		<category><![CDATA[business intelligence]]></category>
		<category><![CDATA[Dimensional modelling]]></category>
		<category><![CDATA[IBM]]></category>
		<category><![CDATA[in-memory]]></category>
		<category><![CDATA[jimmy treybig]]></category>
		<category><![CDATA[MEGO]]></category>
		<category><![CDATA[Microsoft]]></category>
		<category><![CDATA[nonstop]]></category>
		<category><![CDATA[OMR]]></category>
		<category><![CDATA[ralph kimball]]></category>
		<category><![CDATA[sql server]]></category>
		<category><![CDATA[Sqlfaq]]></category>
		<category><![CDATA[sqlsanta]]></category>
		<category><![CDATA[star schema]]></category>

		<guid isPermaLink="false">http://www.ericjlawson.co.uk/?p=363</guid>
		<description><![CDATA[Today I had the very big privilege of delivering my Star Schema Database Primer presentation at the SQL Santa 2012 event, held today at Microsoft in London. The free event, put on by the SQLFAQ group, sold out two weeks prior to the date. A sure sign of the increasing popularity and usage of Microsoft [...]]]></description>
				<content:encoded><![CDATA[<p>Today I had the very big privilege of delivering my Star Schema Database Primer presentation at the <a title="SQL Santa London 2012" href="http://sqlserverfaq.com/events/452/SQL-Santa-2012.aspx" target="_blank">SQL Santa 2012</a> event, held today at Microsoft in London.</p>
<p>The free event, put on by the SQLFAQ group, sold out two weeks prior to the date. A sure sign of the increasing popularity and usage of Microsoft SQL Server.</p>
<p>Here is the <a title="Star Schema Database Primer" href="http://www.ericjlawson.co.uk/StarSchemaPrimerV2/index" target="_blank">slide deck</a> from my presentation. Please feel free to contact me if you have any questions.</p>
<p>If you weren&#8217;t able to attend, it covers the evolution of the relational database model and the major players, what a normalised data model looks like and how a star schema database deliberately moves away from that OLTP optimsed design, to deliver the basis for high performance decision support solutions.</p>
<p>I was completely blown away by the packed room of approx 125 attendees. Thanks for listening. I really enjoyed myself and I hope you found the time you gave up worthwhile and a big thank you to Tony Rogerson and the rest of the SQLFAQ team for the opportunity.</p>
<p>&nbsp;</p>
]]></content:encoded>
			<wfw:commentRss>http://www.ericjlawson.co.uk/2012/12/14/sql-santa-london/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Data Scientists are GO</title>
		<link>http://www.ericjlawson.co.uk/2012/11/22/data-scientists-are-go/</link>
		<comments>http://www.ericjlawson.co.uk/2012/11/22/data-scientists-are-go/#comments</comments>
		<pubDate>Thu, 22 Nov 2012 15:34:28 +0000</pubDate>
		<dc:creator>eric</dc:creator>
				<category><![CDATA[BI]]></category>
		<category><![CDATA[Data]]></category>
		<category><![CDATA[Management]]></category>
		<category><![CDATA[bigdata]]></category>
		<category><![CDATA[data scientist]]></category>
		<category><![CDATA[decision support]]></category>
		<category><![CDATA[EDW]]></category>
		<category><![CDATA[KPI]]></category>
		<category><![CDATA[thunderbirds]]></category>

		<guid isPermaLink="false">http://www.ericjlawson.co.uk/?p=246</guid>
		<description><![CDATA[Was Brains of the Thunderbirds a “data scientist” as well as a genius engineer? If not, which member of the Tracey family designed the world wide data capture system for monitoring disasters &#38; crimes and the necessary analytics system for selecting from that feed the mission for the next episode? International Rescue might have started [...]]]></description>
				<content:encoded><![CDATA[<p>Was Brains of the Thunderbirds a “data scientist” as well as a genius engineer? If not, which member of the Tracey family designed the world wide data capture system for monitoring disasters &amp; crimes and the necessary analytics system for selecting from that feed the mission for the next episode?</p>
<p>International Rescue might have started out as a small family business with ‘Father’ orchestrating operations using instinct &amp; raw intelligence, but it would soon have developed into a considerable enterprise with R&amp;D, product development, branding and a network of contacts across the globe.</p>
<p>Stupid fantasy question? Perhaps it’s not so different from many of the growing enterprises across the globe. I think many businesses struggle to transition from small business to a corporate enterprise and the transition to more formal decision making processes is key challenge. Typically, directors are likely to still be using experience, instinct and if they have come to grips with delegation, a core of able lieutenants keeping operations and logistics finely tuned.</p>
<p>When it comes to ‘business intelligence’, they are likely to be struggling to shift from traditional reporting deployments (providing data on what happened) to fact based decision support solutions, driving real time KPI monitoring. Forget analytics driven insight. It’s tough enough responding to market conditions and I bet many financial controllers are still chasing that dream of daily profitability KPI’s.</p>
<p>And it not just a problem for new growing companies. Older, established enterprises with legacy computer systems also face similar challenges. Many production capture systems are still data silos that impede decision making because of the legacy host. Lets not forget many of the features of relational &amp; and other computation theory have only recently become commercially viable due to the advances in solid state memory.</p>
<p>Therefore, I think ‘Big data’ is so far down the road for most companies its got that ‘oasis’ heat shimmer thing going on. The hype that is currently being generated makes for a lot of FOMO (fear of missing out). This makes these companies a prime target for ‘marketing’ coyotes &amp; foxes and they are in plague proportions. Dont get me wrong. Big data is real, its just not the first step on the journey of analytics driven decision making.</p>
<p>Companies such as Google, Facebook and Twitter (to name a few) are inherently ‘big data’ concerns. It doesn&#8217;t follow that every other company would benefit from ‘getting on board’ this revolution. However, I firmly believe that all businesses would benefit from improved decision support solutions. Its a matter of choosing something appropriate that can scale as you grow.</p>
<p>So if you have been “tuning in” to this new IT drama series on the web, called “BIG data” and are getting a bit tired of it, you will be pleased to know that there is now a spin off series called “the data scientist”.</p>
<p>So in addition to our daily staple diet of yet another marketing led big data offering, we will very soon see a flood of recruitment agents touting a stable of “data scientists” on their books or similarly a swathe of professionals changing their job titles on LinkedIn.</p>
<p>But hang on a minute, didn’t Howard Dresner recently tell us that <em>the capabilities of a data scientist are extremely rare</em>. Those capabilities apparently being <em>“someone who provides the capability to increase the value of an organization’s information assets”</em>.</p>
<p>Too right he did. However, if the Twitter comments of some fellow SQL professionals who have been recruiting for skilled T-SQL, DBA &amp; ETL talent recently are indicative, then there will be plenty seeking work out there, who are prepared to bluff their way into this role. Many agents will struggle to filter them out as the IT industry itself comes to grips with understanding this new analytical direction.</p>
<p>Anyway Howard hosts a weekly BI twitter session and I might start joining in. Please check out his <a title="Howard Dresner" href="http://sandhill.com/article/dresners-point-data-scientists-have-keys-to-the-kingdom/" target="_blank">original article</a> that I am quoting to get the full story.</p>
<p>The Register.co.uk reported a few weeks ago about predictions of a <a title="The Register article" href="http://www.theregister.co.uk/2012/11/02/data_science_goes_to_university/" target="_blank">global skills shortages</a> citing the McKinsey Global Institute <a title="McKinsey report" href="http://www.mckinsey.com/insights/mgi/research/technology_and_innovation/big_data_the_next_frontier_for_innovation" target="_blank">report</a>. The author, Mark Whitehorn, suggests <em>a data scientist would excel at analysing data, particularly large amounts of data that does not fit easily into tabular structures, so called “big data”</em>. Sounds quite reasonable to me. I am a long time reader of the Register and recommend it highly to anyone in the tech industry. They give &#8216;Fleet Street&#8217; a run for their money on inventive headlines.</p>
<p>In another <a title="Smart Data Collective" href="http://smartdatacollective.com/michael-koploy/83086/advice-aspiring-data-scientist" target="_blank">report</a>, Bruno Aziza says he has found <em>the most successful data scientists balance brains and brawn–they can manipulate databases like its nothing, work well with other coworkers and present findings to the executive board. “Think of a data scientist more like the business analyst-plus,”</em> says Aziza.</p>
<p>In the same report Jeff Hammerbacher, who previously led the data science team at Facebook, recounted that his team would use Python, R, and Hadoop, and then have to relay the findings to a non-technical team on any given day. The more you know, the better you’ll be prepared to solve the day’s problems.</p>
<p>I do wonder what Jeff meant by &#8220;non-technical&#8221;. Given the specialisation that goes on in IT these days, that could mean the front line data analysts that support the business report writers. Or it could be the analyst in Marketing who is planning the next seasons promotions. Perhaps its director level strategic planning sessions. It does highlight that converting data into information is still a classical task that requires directed effort that is linked all the way from senior decision makers through to operational users. There is no straightforward hierarchy in this process as data and information flows are very intertwined, now more than ever.</p>
<p>I have only recently subscribed to the Smart Data Collective and follow them on Twitter (<a title="@SmartDataCo" href="https://twitter.com/SmartDataCo" target="_blank">@SmartDataCo</a>). There are some very clever, well informed contributors on this site, as well as curation of high quality vendor sponsored whitepapers.</p>
<p>A recent Gartner <a title="Gartner research note" href="http://www.gartner.com/it/page.jsp?id=2211115" target="_blank">research note</a> has injected a healthy dose of “a warp drive plasma” equivalent into the &#8220;big data&#8221; mix. The headline quote for big data (but by no means the only interesting one), is <em>&#8220;By 2015, big data demand will reach 4.4 million jobs globally, but only one-third of those jobs will be filled.&#8221;</em> Another significant trend was that <em>CMO’s (chief marketing officer) will begin to take over the budget that CIO’s are used to controlling. </em>Now that would be interesting.</p>
<p>So, clearly interesting times are ahead. Gartner think Marketing will take control of a huge chunk of what was traditional IT budget. Cloud based PaaS, SaaS &amp; IaaS are making it even easier for business units to go around IT department backlogs and unrealistic timelines. There are some very innovative cloud solutions out there and it certainly seems the way of future IT, but its not necessarily right for your business, or that specific requirement, now or ever. Gartner also think there wont be enough talent to meet demand.</p>
<p>I advocate loads of caution and pragmatism when getting out your cheque book for a third party service or solution and also if you decide to embark upon this in house. If you wish to better understand the roots of ‘big data’ and get a handle on what its really about, then I would advise starting with the book “<a title="Amazon link" href="http://www.amazon.co.uk/Analytics-Work-Smarter-Decisions-ebook/dp/B004OC07BW/ref=tmm_kin_title_0" target="_blank">Analytics at Work, Smarter Decisions Better Results</a>”.</p>
<p>In the opening paragraphs this line struck a chord with me &#8211; <em>“Rigorous analytical testing can establish whether your intervention [ … a customer communication, product discount, sales promotion, product placement, loyalty scheme … ] is causing desired changes in your business, or whether its simply the result of random statistical fluctuations”.</em></p>
<p>If you are part of a company that is facing the challenges that I have referred to, then this book is an excellent starting point. I think its an especially useful tool for understanding &amp; communicating core issues and concepts to those within an organisation not familiar with analytics. However, its primarily presented as a methodology and this is where I found it most useful. There are no shortage of excellent tech manuals on how to get the most of database engines and how to build efficient ETL pipelines, which coupled with experience, whether personal or shared within a team, can deliver top quality <strong>decision support solutions</strong>.</p>
<p>However, a well conceived and robust analytics strategy can focus a business on what is really important right now, and ensure that tactical endeavours have the greatest chance of success. The right answers to the wrong questions, delivered in near realtime don&#8217;t help. I think this is why many of us IT techies have an affinity with the answer &#8220;42&#8243;, as we cynically appreciate the importance of asking the right question. The task of formulating the right questions is a business wide task requiring strong leadership, delegation and empowerment. This is also why I strongly believe that steering of BI / decision support solutions must be lifted out of IT, as IT types aren&#8217;t traditionally the best communicators and generally don&#8217;t have a business wide perspective or remit.</p>
<p>An alternative to being drawn into the &#8220;BIG data&#8221; hype, is to start a business intelligence unit, with a reporting line and budget separate from or in tandem with the IT department (you cant do it without them).</p>
<p>I would begin with a trinity of one finance domain expert, one marketing domain expert (or supply chain or analyst representing your core business activity) and a good data analyst from IT (or failing that an Excel expert from within the business). With a remit from and reporting to the relevant departmental directors (or board), define some useful, tangible questions &amp; outcomes, with a short timeframe (weeks) and get started. Treat it like an old fashioned science project, predict the outcomes and learn from the results, expected or otherwise.</p>
<p>But don’t mistake this for a purely technical excise. Creation of an independent BI unit will also potentially shift the control of information within the business and as Francis Bacon said, &#8220;Knowledge is power&#8221;. The role of those steering the BI unit is crucial and its never easy to implement change within an organisation. Another challenge for those in charge of a business is to accept the reality of solid analysis, especially when it flies in the face of what the collective opinion currently is.</p>
<p>Seek out expertise in-house if you have it. Otherwise, hire an experienced BI consultant or data architect who has a track record and can demonstrate competency. Hiring a full time ‘data scientist’ or even a rockstar ‘data analyst’ and setting them loose from the outset, is effectively getting your tools out before you have a design or plan. Also, if this is new ground for your company, then you may not have the qualifications to hire the right skills and know when they are needed.</p>
<p>This <a title="The End of the Expert: Why No One in Marketing Knows What They’re Doing" href="http://www.forbes.com/sites/dorieclark/2012/11/11/the-end-of-the-expert-why-no-one-in-marketing-knows-what-theyre-doing/?utm_campaign=techtwittersf&amp;utm_source=twitter&amp;utm_medium=social" target="_blank">article</a> arrived as I was working on the final draft and is very timely. With a title of “The End of the Expert: Why No One in Marketing Knows What They’re Doing” its quite controversial, but makes some excellent points. This change of landscape could happen in any area of your business. Dont leave it too late to change direction. Robust BI solutions can provide very early warnings and analytics will help your company work out where to head next.</p>
<p>The <a title="latest debate" href="http://www.informationweek.com/big-data/news/big-data-analytics/big-data-debate-end-near-for-data-wareh/240142290" target="_blank">latest debate</a> is that Hadoop and other no SQL solutions will bring about the death of RDBMS and the EDW. OODBMS was going to kill RDBMS ten years ago. Didn’t happen and it won’t happen this time.</p>
<p>Purposeful analytics solutions are shaping up to be very powerful tools, but I think right now, a CIO would be crazy to embark upon a program of shifting the corporate data asset out of proven RDBMS technology platforms, underpinned by an experienced and populous talent base, into the brave new world overseen by this new breed of data scientist. It needs time to mature and even then I am not so sure it is a one size fits all solution. EDW methodology is as old as relational theory and is still relevant.</p>
<p>What also concerns me is that government &amp; retail marketing seem to be at the head of this near hysterical big data push. This combination is worrying and I am concerned about the potential standards and ethics behind some of the data mining initiatives being conceived and implemented. This &#8216;BIG data&#8217; push is coming at a time when existing protection instruments are coming apart at the seams, due to being conceived so long ago and no longer being fit for purpose.</p>
<p>Finally, I recall reading an article in one of the Computer magazines over a decade ago, which advised being wary of ’silver bullet&#8221; solutions that promise the world. Usually to be delivered in a guaranteed time frame and they will solve all your problems (usually sold by someone in a teflon coated suit). BE WARY. There are plenty of foxes &amp; coyotes out there. I think the recent bad news for HP highlight this is still very relevant today.</p>
<p>FAB Virgil</p>
]]></content:encoded>
			<wfw:commentRss>http://www.ericjlawson.co.uk/2012/11/22/data-scientists-are-go/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Star Schema Primer</title>
		<link>http://www.ericjlawson.co.uk/2012/10/03/star-schema-primer/</link>
		<comments>http://www.ericjlawson.co.uk/2012/10/03/star-schema-primer/#comments</comments>
		<pubDate>Wed, 03 Oct 2012 18:00:31 +0000</pubDate>
		<dc:creator>eric</dc:creator>
				<category><![CDATA[BI]]></category>
		<category><![CDATA[Data]]></category>
		<category><![CDATA[ETL]]></category>
		<category><![CDATA[ashton tate]]></category>
		<category><![CDATA[dbase]]></category>
		<category><![CDATA[ingres]]></category>
		<category><![CDATA[jimmy treybig]]></category>
		<category><![CDATA[kimball]]></category>
		<category><![CDATA[nonstop]]></category>
		<category><![CDATA[oracle]]></category>
		<category><![CDATA[sql server]]></category>
		<category><![CDATA[star schema]]></category>
		<category><![CDATA[sybase]]></category>
		<category><![CDATA[system / r]]></category>
		<category><![CDATA[tandem]]></category>

		<guid isPermaLink="false">http://www.ericjlawson.co.uk/?p=229</guid>
		<description><![CDATA[Tonight I was privileged to be able to make my first presentation within the SQL Server community at the monthly SQL Server Southampton user group. The tagline of the presentation was &#8220;the core principles of the Star Schema database, used for decision support solutions and as the bedrock for Analysis Services cubes and the new [...]]]></description>
				<content:encoded><![CDATA[<p>Tonight I was privileged to be able to make my first presentation within the SQL Server community at the monthly <a title="SQL Server Southampton user group" href="http://sqlsoton.blogspot.co.uk" target="_blank">SQL Server Southampton user group</a>.</p>
<p>The tagline of the presentation was &#8220;the core principles of the Star Schema database, used for decision support solutions and as the bedrock for Analysis Services cubes and the new Tabular model&#8221;.</p>
<p>It covers the evolution of the relational database model and the major players, what a normalised data model looks like and how a star schema database deliberately moves away from that OLTP optimsed design, to deliver the basis for high performance decision support solutions.</p>
<p>I cover the basic features of the star schema database, dimension and fact table examples and considerations for best use of these database designs.</p>
<p>If you are starting out in this field then I hope this presentation provides some context and useful tips. I extensively used Wikipedia for my research and have provided loads of links. So whenever you want to find out when Larry Ellison spotted an opportunity waiting to be exploited, which RDBMS system spawned many others and who is Jimmy Treybig, this primer brings many of these trivial pursuit answers for data professional together in one place.</p>
<p>Click to view the <a title="Star Schema Primer" href="http://www.ericjlawson.co.uk/StarSchemaPrimer/index" target="_blank">Apple Keynote presentation</a> in your browser.</p>
<p>PS. if you spot any errors, or think I have done someone or some company a disfavour by omission, please let me know.</p>
<p>&nbsp;</p>
]]></content:encoded>
			<wfw:commentRss>http://www.ericjlawson.co.uk/2012/10/03/star-schema-primer/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Developing SSIS solutions on OS X : Part2 &#8211; Equipment and software list</title>
		<link>http://www.ericjlawson.co.uk/2012/09/08/developing-ssis-solutions-on-os-x-part2-equipment-and-software-list/</link>
		<comments>http://www.ericjlawson.co.uk/2012/09/08/developing-ssis-solutions-on-os-x-part2-equipment-and-software-list/#comments</comments>
		<pubDate>Sat, 08 Sep 2012 11:51:18 +0000</pubDate>
		<dc:creator>eric</dc:creator>
				<category><![CDATA[ETL]]></category>
		<category><![CDATA[General]]></category>

		<guid isPermaLink="false">http://www.ericjlawson.co.uk/?p=225</guid>
		<description><![CDATA[(Part 1 &#8211; Introduction) Here is my ideal kit list; MacBook Pro 15&#8243; SSD or Momentus hybrid drive Optional conversion of DVD bay to a second HD bay Apple Thunderbolt Display (or similar external monitor) The software that you will need does depend on whether you are operating as an independent consultant or whether you [...]]]></description>
				<content:encoded><![CDATA[<p>(<a href="http://www.ericjlawson.co.uk/2012/04/20/developing-ssis-solutions-on-os-x-part1-introduction/" target="_blank">Part 1 &#8211; Introduction</a>)</p>
<p>Here is my ideal kit list;</p>
<ul>
<li>MacBook Pro 15&#8243;</li>
<li>SSD or Momentus hybrid drive</li>
<li>Optional conversion of DVD bay to a second HD bay</li>
<li>Apple Thunderbolt Display (or similar external monitor)</li>
</ul>
<p>The software that you will need does depend on whether you are operating as an independent consultant or whether you are working within a coproate domain environment. The main difference stemming from this will be the software you personally need to deploy and it may not differ.</p>
<p>To run Windows you will require a copy of VMWare Fusion 4 (version 5 is now out). This runs as a native 64bit OS X application. I have tried Parallels (an early version and version 6) and I found it very poor for setup and the support was sub standard.</p>
<p>You absolutley need loads of RAM and fast disk storage. I think a minimum of 8gb of RAM is essential if you are too get the best of both operating sytems. These MBP’s run 16gb.</p>
<p>I think a mobile computing device is essential these days and the MacBook Air is not really designed for user upgrading, so the MacBook Pro is the obvious choice. However, I started writing this piece before the mid 2012 hardware refresh and there are now some alternatives.</p>
<p>The Macbook Air, configured with 8gb of RAM and 512gb of SSD is sufficiently configured to run a Windows VM with plenty of on board storage. The 2.0ghz i7 CPU packs plenty of power. The Macbook Pro 13“ isnt much different from the Macbook Air other than a faster CPU and the option for a traditional HD. However the 15” MacBook Pro has the separate graphics card (now from NVIDIA). This could make a difference for graphics intensive work, potentially making a difference with SSIS, however I can be certain of that. I do know that software like Pro Tools (music creation) suffers from the lack of a distinct graphics card, where heavy compute for maths operations plus simultaenous screen painting strains the system.</p>
<p>In the end I still recommend the Macbook Pro 15&#8243; (non retina simply due to the excessive price and lack of user upgradeable options).</p>
<p>The best new feature with the mid 2012 refresh is the inclusion of USB 3. This is a super fast interface, much better than Firewire 800 and not far off Thunderbolt. Also, you can buy a very decent 320gb drive for under £60. I did a copy using CCC (Carbon Copy Cloner) from my SSD drive to the USB 3 drive of a 40gb Windows VM file. The first gigabyte was done in 20 seconds. it completed the 40gb in 10 minutes.</p>
<p>I have also tested running the Windows 7 VM machine straight off the USB 3 drive and it started up in an adequate time. Certainly not as fast as my onboard OCZ Vertex 4 SSD but very good when compared with a stock onboard 7200rpm HD. This is certainly and excellent option if you dont want to do any DIY work replacing your DVD drive with a HD / SSD caddy.</p>
<p>The biggest risk using Windows is Windows OS corruption. This is even more likely when using virtual machines. I have had instances where the VM has become corrupted and I have had to use a backup. However, beware if your VM is attached to a Windows domain as if the backup is too old, you can get a ‘untrusted’ error message and recovery from that wasn’t possible.</p>
<p>So I have two uses for Carbon Copy Cloner and external drives. I am now using a G-Tech Firewire 800 1tb external drive to contain a clone of my entire system drive. This is scheduled to run every night in the wee hours. So if I am at a customer site and I get the spinning wheel of death (the OS X equivalent of the Windows BSOD) then I can boot the system from the external drive. This means no loss of time diagnosing problems and rebuilding the system. This drive usually stays in my bag as this type of issue is a rare occurence.</p>
<p><strong>Next installment – Integration and software options</strong></p>
]]></content:encoded>
			<wfw:commentRss>http://www.ericjlawson.co.uk/2012/09/08/developing-ssis-solutions-on-os-x-part2-equipment-and-software-list/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Super(hung out to)Dry</title>
		<link>http://www.ericjlawson.co.uk/2012/06/11/superhung-out-todry/</link>
		<comments>http://www.ericjlawson.co.uk/2012/06/11/superhung-out-todry/#comments</comments>
		<pubDate>Mon, 11 Jun 2012 15:00:07 +0000</pubDate>
		<dc:creator>eric</dc:creator>
				<category><![CDATA[BI]]></category>
		<category><![CDATA[Data]]></category>
		<category><![CDATA[Management]]></category>
		<category><![CDATA[Attunity]]></category>
		<category><![CDATA[Bill Inmon]]></category>
		<category><![CDATA[CFO]]></category>
		<category><![CDATA[CIO]]></category>
		<category><![CDATA[column based indexing]]></category>
		<category><![CDATA[COO]]></category>
		<category><![CDATA[FT]]></category>
		<category><![CDATA[IT]]></category>
		<category><![CDATA[Matt Benati]]></category>
		<category><![CDATA[ralph kimball]]></category>
		<category><![CDATA[Reporting]]></category>
		<category><![CDATA[SuperDry]]></category>

		<guid isPermaLink="false">http://www.ericjlawson.co.uk/?p=212</guid>
		<description><![CDATA[The admission in April 2012 by SuperGroup (the parent company of fashion brand SuperDry), that the entry of a &#8216;+&#8217; sign instead of a &#8216;-&#8217; sign on some forecasting figures, led to its 3rd &#8216;profit&#8217; warning being issued in 6 months. This led to a revised annual profit shortfall of £8m. Robust business intelligence (BI) [...]]]></description>
				<content:encoded><![CDATA[<p>The admission in April 2012 by SuperGroup (the parent company of fashion brand SuperDry), that the entry of a &#8216;+&#8217; sign instead of a &#8216;-&#8217; sign on some forecasting figures, led to its 3rd &#8216;profit&#8217; warning being issued in 6 months. This led to a revised annual profit shortfall of £8m. Robust business intelligence (BI) processes might have prevented this data error.</p>
<p>At the time a city analyst declared the group&#8217;s finance department &#8216;not fit for purpose&#8217;. Its shares tumbled 38% on this news. <a title="FT article reporting on SuperGroup recruitment" href="http://www.ft.com/cms/s/0/1af3b680-9a88-11e1-9c98-00144feabdc0.html#axzz1xPhCSu30" target="_blank">A month ago</a>, the group announced the recruitment of a new COO (new role), a new CFO and a new Head of Retail in an effort to boost the management team and increase the professionalism of the group.</p>
<p>In general terms IT provisioning for a business such as SuperGroup falls into two broad categories. The first are the type of solutions that support the procurement and sale of goods, wholesale or retail. This is traditional IT and it has been around for a long time, delivering improvements and cost savings in areas like HR, Supply Chain, Marketing etc. New sales channels and models have developed over time (eg. the web, eBay etc), but the focus is still on response times, availability and scalability. It&#8217;s all about getting the job done, modeling business processes and providing automation and capture systems as appropriate.</p>
<p>The second category of IT provisioning is centred on decision making. Business relies very heavily on information but not all are fortunate enough to have timely and reliable data to work with. Manual number crunching and intuition (&#8220;gut feel&#8221;) have served many a business well but it&#8217;s not enough anymore, especially as a business grows beyond the point where everybody knows each other by first name.</p>
<p>Traditionally, information for decision making in larger businesses was delivered either via the core &#8216;online&#8217; system or through nightly or cyclic report production runs. Information delivered via online systems generally took the form of single record data items, such as information about the customer, supplier or order that a clerk / call centre operator might be queried about. Reports were run to arrive at the summary totals on the last page, to assess the outcome of the previous day&#8217;s business, for order picking, invoicing etc.</p>
<p>Over the last 10-15 years operational data stores and data marts came into vogue with the purpose of feeding derived business intelligence back into the core online systems. Combined with the transition from dumb terminals to PC based client interfaces, IT was able to arm the customer facing areas of the business (and other areas too) with as much relevant information about the &#8220;customer&#8221; as possible.</p>
<p>Decision support systems began to get airplay in IT departments. Data Services teams were formed to handle the growing number of multi purpose databases and data oriented solutions. The methodologies of data warehousing advocates <a title="Ralph Kimball" href="http://www.kimballgroup.com/" target="_blank">Ralph Kimball</a> and <a title="Bill Inmon" href="http://www.inmoncif.com/about/" target="_blank">Bill Inmon</a>, gained significant traction. Programs specific for moving data between disparate systems and databases, got the new moniker Extract, Transform &amp; Load (ETL).</p>
<p>However, still the primary focus of many IT departments was on the core online systems, with a centralised infrastructure and traditional project methodologies. And fair enough too as there has been plenty to contend with. This would be especially true for small, rapidly expanding businesses. However, I am sure there a many exceptions to this generalisation of focus and perhaps some of them are now Fortune 500 companies (and the equivalent across the world markets).</p>
<p>Solutions deployed were quite often &#8216;brittle&#8217; (a term I heard Matt Benati of <a title="Attunity" href="http://www.attunity.com" target="_blank">Attunity</a> use recently and I think it&#8217;s quite apt). That is, not quick to adapt to changes in the business and prone to breaking. An architect I have had the fortune of working with lately has been advocating significant levels of isolation to shield downstream systems (such as a data warehouse) from the impact of change. However, it&#8217;s not always easy or practical to do this.</p>
<p>Another new term has been cropping up recently (Attunity again), in the big data discourse. <a title="Extract, Expedite &amp; Load" href="http://resources.attunity.com/the-new-world-of-data-integration---augmenting-etl-with-eel/" target="_blank">Extract, Expedite and Load (EEL)</a>. It&#8217;s interesting because the premise here is on getting data out of capture systems in real time and consuming it as fast as possible in the target system. This concept is very compelling. I personally think that it will gain momentum.</p>
<p>The SuperGroup example is a timely reminder that robust BI processes are critical to any organisation operating in todays business environment. A rapidly growing business should treat BI with equal or more importance than core IT systems. Aggressive business expansion without commensurate capability to measure those business activities themselves, is a recipe for disaster. &#8220;Reporting&#8221; as BI is often lazily categorised, should not be 2nd string. Failure may not be as immediate and obvious as an &#8220;offline&#8221; order taking system, but it can be just as damaging to the bottom line (or more so).</p>
<p>Larger organisations needs to get tough with in place IT power bases. Core IT systems require stability combined with continuous improvement. BI solutions are a completely different proposition, requiring many traditional IT skills &amp; features. When combined with some of the new amazing new database solutions and storage advances (column based indexing, SSD&#8217;s, cloud deployments, CDC software etc), significant improvements can be made.</p>
<p>I am not talking about traditional operational reporting (which can be a deliverable of the BI team). That&#8217;s the straightforward, routine, but bottom line piece of the puzzle. BI solutions must also take the tactical and operational goals of the business (derivable from the strategic vision and plans) and provide an integrated view, all the way from executive dashboards, through tactical performance analysis down to operational workbenches and shop floor devices (and all the way back to the top).</p>
<p>However, the often overlooked challenges for BI are the human ones, the process based issues. This links back to how IT systems development needs to embrace and inform business process. Unfortunately this is when resistance to change is encountered, throughout all levels of the business. This was a factor in the 80s and 90s and clearly remains relevant today. Back then the change mainly impacted the front line and more blue collar oriented workers. Now, transitional BI impacts the strategic &amp; tactical decision makers.</p>
<p>There needs to be acknowledgement of the direct and crucial relationship between steering a business through a rapidly changing landscape and the continuous need for timely and accurate data to support the process. If a business is forced to make rapid changes, the supporting data needs to be delivered into the BI system <strong><em>now</em></strong>, sometimes next week, but definitely not next quarter or next year.</p>
<p>This is essential to create the integrated view of business performance as well as ensuring the impact of tactical decisions are measurable in a timely and accurate manner. In other words, a BI decision support system that is directly coupled to the operational systems.</p>
<p>To give a rapidly growing business its best shot at transitioning from &#8220;reporting&#8221; into a true BI culture, the BI team needs a degree of independence from mainstream IT activities. It may require its own budget &amp; project planning and should report directly into the CxO and / or board of directors. Its role is to serve the entire business management structure directly involved in critical decision making. It is not just another IT system.</p>
<p>So whilst all the buzz is about big data and the cloud, it&#8217;s important to keep our eyes on the main goal. Delivery of timely and accurate data into the warehouse, to support business decision making. The new options can enable benefits not possible a couple of years ago, if they are adopted with a coherent business strategy.</p>
<p>I was fortunate to be indoctrinated into the &#8216;data centric&#8217; view of IT very early in my career. But it has been a backseat for a long time. More of an &#8216;engine room&#8217; gig. I think in time, the legacy that the current focus on big data and cloud infrastructure movement will deliver, is the re-focusing of the industry onto that most valuable of business assets. Its &#8216;data&#8217;.</p>
<p>&nbsp;</p>
<p>&nbsp;</p>
<p>&nbsp;</p>
]]></content:encoded>
			<wfw:commentRss>http://www.ericjlawson.co.uk/2012/06/11/superhung-out-todry/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>RTFM and the Software Dummy</title>
		<link>http://www.ericjlawson.co.uk/2012/06/05/rtfm-and-the-software-dummy/</link>
		<comments>http://www.ericjlawson.co.uk/2012/06/05/rtfm-and-the-software-dummy/#comments</comments>
		<pubDate>Tue, 05 Jun 2012 14:00:23 +0000</pubDate>
		<dc:creator>eric</dc:creator>
				<category><![CDATA[Data]]></category>
		<category><![CDATA[ETL]]></category>
		<category><![CDATA[General]]></category>
		<category><![CDATA[Coding]]></category>
		<category><![CDATA[Programming]]></category>
		<category><![CDATA[RTFM]]></category>
		<category><![CDATA[the software dummy]]></category>
		<category><![CDATA[Y2K]]></category>

		<guid isPermaLink="false">http://www.ericjlawson.co.uk/?p=175</guid>
		<description><![CDATA[Developing software is a rewarding career, but at times it can be tedious and very frustrating. The ability to debug your own code and that written by others is crucial. As those of us working in IT know, from the outside we are all bunched together, but within the industry the skillsets today are extremely [...]]]></description>
				<content:encoded><![CDATA[<p>Developing software is a rewarding career, but at times it can be tedious and very frustrating. The ability to debug your own code and that written by others is crucial.</p>
<p>As those of us working in IT know, from the outside we are all bunched together, but within the industry the skillsets today are extremely diverse. The pace of change is incredible and organisational change is a given, not something that occurs every now and then.</p>
<p>One thing that remains the same is coding syntax. It seems every year another language is deployed. It all conforms to logic, but blimey it can do your head in.</p>
<p>In the deep past of my career I had experienced significant protectionism of skills outside of work teams but today sharing and mentoring seem to be mainstream.</p>
<p>The IT Crowd, Episode 1, Series 1, featured Moss wearing a custom tshirt with the acronym RTFM. I first heard this one uttered in 1991 by Graham K in response to me asking one to many questions. It expands to Read The F#*k!ng Manual if you weren’t sure.</p>
<p>The second half of my post title, the Software Dummy, came to my attention whilst working on the Y2K bug (yes it really happened and was not a conspiracy). I was part of a team of 4 Tandem SQL / COBOL analysts developing a large star schema decision support system and we were coding for weeks on end. Frequently one of us would sidle over to an associates desk and start describing an intractable problem. So in this scenario you have the coder with the issue they are struggling with and the listener, the “software dummy” (or peer, mentor, sounding board etc).</p>
<p>Now many of you will know that resolving your own software bugs is a tricky business. Helping someone else resolve their software bugs is altogether another thing.</p>
<p>Firstly, you are most probably very busy with your own work. Secondly, even though it’s likely you have similar skills you probably aren’t familiar with the specific business logic or requirement. Thirdly, there is the matter of sustainability; do you catch them the fish or show them how to make their own rod (see RTFM).</p>
<p>What we observed during that Y2K project was the software dummy effect. It’s just human nature to ask someone else for help rather than picking up the manual and working it out yourself. And frankly the manual is often no use when you have a team of similarly experienced programmers and it’s a bug and not a matter of technique.</p>
<p>So what tended to happen is the software dummy would acknowledge the petitioner (the frustrated coder with the software bug) but continue to focus on the monitor in front of them. The coder would continue describing the bug in detail, relaying the events and conditions that have led to the current impasse. The software dummy might occasionally ask a question but usually it was enough to make appropriate noises giving the impression that we were actually listening to the problem description. Invariably we were not. We were trying to hold onto the thread of what we were immersed in.</p>
<p>However, in most cases the coder would have a eureka moment and exclaim “that’s it, I haven’t initialised the host variables for the …” or some other mind numbingly boring detail and return to their desk, grateful for the assistance.</p>
<p>We discussed this as a team and quickly realised we were all guilty of not really listening. However the results spoke for themselves. Clearly the coder with the bug was working out the problem through the process of describing it to someone else. The software dummy was born.</p>
<p>Many years later on and I have to admit I have tried this at home on my wife. I resorted to describing a coding problem to her (she is not an IT person) and she made all the right noises. If I had checked I am sure I would have seen her eyes glazing over, but I was on the hunt for that elusive bug. Eureka, problem solved. Careful with this though. It could shorten your prospects or those of your genetic line.</p>
<p>Back to the 21st century and I found myself software debugging once again. Recently I have begun coding ETL processes using SSIS and adapting Andy Leonard’s (<a href="https://twitter.com/#!/AndyLeonard" target="_blank">@andyleonard</a>) <a title="Download the SSIS framework" href="http://sqlblog.com/blogs/andy_leonard/archive/2012/03/07/updated-ssis-framework-now-includes-reports.aspx" target="_blank">SSIS framework</a> for deployment at a client site. I was grappling with the subtleties of defining flexible connection managers using expressions &amp; variables and design time versus batch execution time.</p>
<p>Consequently, I had loads of questions. I read the blog posts that Jamie Thompson (<a href="https://twitter.com/#!/jamiet" target="_blank">@jamiet</a>) and others had made on the topic of parent child operations. However I was really stuck on grabbing parent variable values from the child package. One minute I had it working and the next nada. The cause turned out to be simple but I burnt many hours getting there.</p>
<p>As I was working alone on this and often remotely from my home office, I had no one to bounce my issues off. Enter the internet, forums and bloggers. The support network available now has become distributed across the planet, rather than a bay of office cubicles. It’s very powerful, but at the same time there are new challenges.</p>
<p>Everyone still has their own workloads, but now its very likely your potential software dummys have different employers or work for themselves. @andyleonard &amp; @jamiet were extremely helpful and gave plenty of initial assistance but I could not nail the issue.</p>
<p>Contribution to the SQL community is a mainstay of the SQL MVP ethos. It appears to me that maintaining the balance between supporting the community, reputation, earning money and family time &amp; free time would be very challenging; especially in an age where many people in your network will be working and needing peer support across multiple time zones.</p>
<p>Forums have been around for a long while and I posted my issue on SQL Server Central.com. This is a great resource, but either my problem was too complicated or too obvious for anyone to address.</p>
<p>And I need to apologise to Andy and Jamie here, because it was all to easy for me to fire tweets and DM’s at them when I could not get through this issue.</p>
<p>However, this is where Twitter and to a degree forums have parallels to the software dummy process. I expect Jamie and Andy made appropriate noises whilst getting on with their own business, perhaps thinking, “I hope he works this out soon”. But for me knowing these guys would respond with a quick answer if there was one and the act of writing down the issues, especially the long dialog I had with the sqlservercentral.com forum, helped me work out the problem. It made me review and examine my testing method (remember I had it working and then lost it). It is this methodology of being able to repeat a process in a controlled manner that is essential to debugging.</p>
<p>So whilst the internet and always on exposes the mentor to potentially more and potentially constant interruption, the virtual team relationship can be managed, kept at arms length. The mentor can dip in and out of the work environment to suit their own needs. The coder needing help has a virtual sounding board with many people listening and I am willing to bet that the software dummy effect is alive and well and a member of every software developers team.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.ericjlawson.co.uk/2012/06/05/rtfm-and-the-software-dummy/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>The DBA&#8217;s addiction</title>
		<link>http://www.ericjlawson.co.uk/2012/05/02/the-dbas-addiction/</link>
		<comments>http://www.ericjlawson.co.uk/2012/05/02/the-dbas-addiction/#comments</comments>
		<pubDate>Wed, 02 May 2012 16:28:15 +0000</pubDate>
		<dc:creator>eric</dc:creator>
				<category><![CDATA[Data]]></category>
		<category><![CDATA[General]]></category>
		<category><![CDATA[disks]]></category>
		<category><![CDATA[HD]]></category>
		<category><![CDATA[IT Crowd]]></category>
		<category><![CDATA[SSD]]></category>
		<category><![CDATA[VM]]></category>

		<guid isPermaLink="false">http://www.ericjlawson.co.uk/?p=153</guid>
		<description><![CDATA[I confess. I have an addiction. Recently I have observed the symptoms in a work associate (also a DBA). I cautiously broached the subject with him and he confessed with a big leering grin on his face. Absolutely hooked. The particular vice I refer to is the marvelleous hard disk drive. They are wonderful things, [...]]]></description>
				<content:encoded><![CDATA[<p>I confess. I have an addiction. Recently I have observed the symptoms in a work associate (also a DBA). I cautiously broached the subject with him and he confessed with a big leering grin on his face. Absolutely hooked.</p>
<p>The particular vice I refer to is the marvelleous hard disk drive.</p>
<p>They are wonderful things, these addictive objects of our desire. Shiny, varied, formatable, stackable, raidable and available from a multitude of sources at a price range to suit any budget.</p>
<p>However, like any other addiction, the craving for more comes again, usually sooner than one would like.</p>
<p>I remember once upon a time, when big 8&#8243; floppy disks were still in use, a genuine propellor head techie telling me they used to write disk access routines that would cause the big IBM washing machine sized disk cabinets to wobble walk across the computer room floor. What a hoot. Something to do with achieving the right sort of oscillation. I think the capacity of that early commercial spinning platter was 4 meg. I wish I had thought of the IT Crowd TV concept back then. The comedy material has been there for decades.</p>
<p>Nowadays us techie types are counting our storage in terabytes. Its all very justifiable, high speed connected storage vital to back up and maintain our Virtual Machines, video, music &amp; photo collections etc.</p>
<p>If you are a SQL Server DBA you could be talking orders of magnitude more for testing and evaluation, honing your skills before proposing the next range of improvments at work.</p>
<p>Nothing to do with the coloured LED lights and racing stripes. Whoops, sorry wrong group of people.</p>
<p>Just when we thought spin speeds and storage density were peaking, along came the SSD. Solid State. Has a ring to it doesn’t it. It was stuff of labs and scifi for so long. So was Dick Tracy’s wrist watch (check out the <a title="Pebble" href="http://getpebble.com/" target="_blank">pebble kickstarter project</a>).</p>
<p>None of this waiting around for 1 or 2 minutes for my MacBook Pro to boot up. No way. That’s for the average Joe. I cant afford to wait more that 20 seconds. I need to rush out the door at 5:30pm (<a title="Sheryl Sandberg Leaves Work at 5:30pm" href="http://mashable.com/2012/04/05/sheryl-sandberg-leaves-work-at-530/" target="_blank">its okay, its all right now cos Sheryl says so</a>) so that shutdown or VM suspend needs to fly.</p>
<p>Then there is the equivalent of the guys standing around your car with the hood up, admiring the chrome, the turbo charger, injectors etc.</p>
<p>In the end, it always comes down to speed and grunt. I wonder if neandertals compared their clubs in a similar fashion.</p>
<p>So our other half&#8217;s might think at best its an indulgence, at worst a complete waste of money. However, the skills acquired do pay dividends. Those in this fields who don&#8217;t keep up their professional proficiency risk their careers.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.ericjlawson.co.uk/2012/05/02/the-dbas-addiction/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Developing SSIS solutions on OS X : Part1 &#8211; Introduction</title>
		<link>http://www.ericjlawson.co.uk/2012/04/20/developing-ssis-solutions-on-os-x-part1-introduction/</link>
		<comments>http://www.ericjlawson.co.uk/2012/04/20/developing-ssis-solutions-on-os-x-part1-introduction/#comments</comments>
		<pubDate>Fri, 20 Apr 2012 13:45:20 +0000</pubDate>
		<dc:creator>eric</dc:creator>
				<category><![CDATA[ETL]]></category>
		<category><![CDATA[General]]></category>
		<category><![CDATA[Bootcamp]]></category>
		<category><![CDATA[COBOL]]></category>
		<category><![CDATA[Lion]]></category>
		<category><![CDATA[Matrox]]></category>
		<category><![CDATA[OS X]]></category>
		<category><![CDATA[SSIS]]></category>
		<category><![CDATA[VMWare Fusion]]></category>
		<category><![CDATA[Windows]]></category>

		<guid isPermaLink="false">http://www.ericjlawson.co.uk/?p=158</guid>
		<description><![CDATA[I need to state from the outset that this series of articles is not intended to sway anyone into choosing the Apple platform for SSIS development. Nor do I wish to enter into any debate about Apple hardware versus Windows based hardware. Been there, done that and I regularly lose on many fronts. If you [...]]]></description>
				<content:encoded><![CDATA[<p>I need to state from the outset that this series of articles is not intended to sway anyone into choosing the Apple platform for SSIS development. Nor do I wish to enter into any debate about Apple hardware versus Windows based hardware. Been there, done that and I regularly lose on many fronts.</p>
<p>If you are considering the switch or already have, then hopefully I can save you some time and provide some guidance on how to proceed with tooling up for SSIS (or general Windows) development on this platform.</p>
<p>My reasons for choosing the MacBook Pro and subsequently using it as a Windows / SSIS development platform are simple. I really like it. It’s quality equipment, performs well and reliably and despite a few annoying niggles, its very productive.</p>
<p>So, if you have decided to go down this route, or are considering the switch, then read on.</p>
<p>The iPhone and iPad are driving the enormous growth in Mac sales via the ‘halo’ effect. At the other end of this discussion, Microsoft is absolutely dominating the ETL space with SSIS. If you have arrived at this blog post through a route other than SSIS, ETL means Extract, Transform &amp; Load and SQL Server Integration Services (SSIS) is Microsoft’s solution.</p>
<p>In a nutshell, ETL is all about moving, consolidating &amp; transforming data to make it more useful and valuable.</p>
<p>I used Windows based PC’s to do my ETL consultancy work (COBOL/SQL) right up until Apple decided to use Intel chips in their hardware platforms. I had always wanted to have a Mac but couldnt justify have two separate platforms. Enter Bootcamp.</p>
<p>Bootcamp allows a Mac OS X user (Tiger 10.4 onwards) to boot their system into Windows. This was a big step forward. I started with a Core 2 Duo, white Macbook with 4gig of RAM. Six years later I am using a 15&#8243; MacBook Pro with 16gig of RAM, a 256gig SSD boot disk with a 750gb 7200rpm Momentus Hybrid hard drive in the DVD drive bay. This is all driven by OS X Lion (10.7) and allows me to run mulitple Windows XP and Windows 7 virtual machines on VMWare Fusion 4.1.1.</p>
<p>I also connect it to an Apple 27“ Thunderbolt display in my home office and to a 24” monitor (via a VGA adaptor) at a client site. The Apple Cinema / Thunderbolt displays are excellent. The resolution and image quality is so good. The Dell monitor is also fine, but not as good quality. However, there is a hefty price difference.</p>
<p>I was using two 24“ Apple Cinema displays connected via a Matrox DualHead2Go graphics device. This was great for parking SSMS in one screen and Visual Studio / BIDS in the other, with a bit of room for email and file explorer near the bottom. However, with the release of the Thunderbolt 27” monitor and its built in Ethernet, Thunderbolt and USB ports, I have gone back to the single, larger monitor. I keep the MBP screen open at the client site, using the 24&#8243; monitor, so I still have a parking area for Mail and file Explorer. The Matrox device works with the new Thunderbolt comms port, but it did bring with it a few issues reviving the external monitor from sleep conditions. Matrox are fairly good with releasing firmware and software updates. I also make heavy use of Spaces, enabling fast switching between Windows 7, Windows XP and OS X Lion.</p>
<p>So thats the platform I have evolved over the last few years.</p>
<p>I think this is superb SSIS development option. As a data architect, communicating ideas, stratagies and plans are as important as the design and build process. This dual OS platform gives me the best of both worlds.</p>
<p><strong>Next installment &#8211; Equipment and software list</strong></p>
<div class="woo-sc-twitter left"><a href="http://twitter.com/share" class="twitter-share-button" data-via="@ericjlawson"data-count="vertical">Tweet</a><script type="text/javascript" src="http://platform.twitter.com/widgets.js"></script></div>
]]></content:encoded>
			<wfw:commentRss>http://www.ericjlawson.co.uk/2012/04/20/developing-ssis-solutions-on-os-x-part1-introduction/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>SQL Server leads the (verti)paq</title>
		<link>http://www.ericjlawson.co.uk/2012/04/16/sql-server-leads-the-vertipaq/</link>
		<comments>http://www.ericjlawson.co.uk/2012/04/16/sql-server-leads-the-vertipaq/#comments</comments>
		<pubDate>Mon, 16 Apr 2012 11:00:34 +0000</pubDate>
		<dc:creator>eric</dc:creator>
				<category><![CDATA[BI]]></category>
		<category><![CDATA[Data]]></category>
		<category><![CDATA[big data]]></category>
		<category><![CDATA[columnstore]]></category>
		<category><![CDATA[in-memory]]></category>
		<category><![CDATA[olap]]></category>
		<category><![CDATA[ralph kimball]]></category>
		<category><![CDATA[vertipaq]]></category>

		<guid isPermaLink="false">http://www.ericjlawson.co.uk/?p=131</guid>
		<description><![CDATA[Over the last few years, many large organisations have very likely been taking a serious look at the Microsoft Business Intelligence product stack. It’s a no brainer to give it a try as in general the licensing is free, but there are other offerings. Some big players out there simply attract attention because of their [...]]]></description>
				<content:encoded><![CDATA[<p>Over the last few years, many large organisations have very likely been taking a serious look at the Microsoft Business Intelligence product stack. It’s a no brainer to give it a try as in general the licensing is free, but there are other offerings.</p>
<p>Some big players out there simply attract attention because of their high price tags (it must be good right?). Informatica, Business Objects, Cognos, SAP and plenty of others offer the tantalising promise of end user BI solutions.</p>
<p>Recently there appears to have been significant activity from newer outfits deploying very compelling BI solutions, interfacing into SQL Server. They don&#8217;t all attempt to be all singing and dancing, but target specific needs within the BI arena. <a title="Varigence" href="http://www.varigence.com/products.html" target="_blank">Varigence</a> is one such example.</p>
<p>Whilst Oracle has been sitting on a <a title="System Change Number bug" href="http://www.infoworld.com/d/security/fundamental-oracle-flaw-revealed-184163-0" target="_blank">ticking bug</a>, possibly set to rival Y2K, IBM are what they are and niche vendors like Tandem NonStop SQL evoke questions like “who”, Microsoft has continued to roll out significant product improvements that genuinely raise the bar in corporate database solutions.</p>
<p>During a course I attended nearly a year ago, Ralph Kimball mentioned his “big data” white paper he had been commissioned to do by Informatic, casually dropping a few <strong>BIG</strong> company names and quoting outrageously <strong>BIGGER</strong> data volumes. It began to get airplay a few weeks later. Since then every vendor (and their best mate) have been dropping the “big data” buzzword.</p>
<p>Microsoft’s response has been to promote Vertipaq from SSAS into the core database engine. This is one of the biggest upgrades to the basic RDBMS infrasturcture in a very long. Previous enhancements across the industry have been focused on faster disk hardware, comms channels, RAM increases, very clever caching, DBA focused performance and tuning, SSD&#8217;s, virtualisation etc. I am sure there are plenty of other techniques and other vendors solutions won&#8217;t have been static.</p>
<p>But, ColumnStore goes right to the core of processing mind blowing datasets that are now becoming commonplace. Big data handling frameworks like MapReduce &amp; then Hadoop kick started the trend, but Microsoft has delivered it into the heart of BI. Clearly they have been cooking this up for some time. If you want to read up on it then get yourself the <a title="Introducing Microsoft SQL Server 2012" href="http://www.google.co.uk/url?sa=t&amp;rct=j&amp;q=sql%20server%202012%20ebook&amp;source=web&amp;cd=7&amp;ved=0CFwQFjAG&amp;url=http%3A%2F%2Fdownload.microsoft.com%2Fdownload%2FF%2FF%2F6%2FFF62CAE0-CE38-4228-9025-FBF729312698%2FMicrosoft_Press_eBook_Introducing_Microsoft_SQL_Server_2012_PDF.pdf&amp;ei=eleIT6nNLoP98QPtkPz5CQ&amp;usg=AFQjCNHknz_H0bgdMSS6f9BpV_IxmIxcQg&amp;sig2=6kyfBmpFX0OVT03xKERi2A" target="_blank">free ebook</a> released by Microsoft last week.</p>
<p><a title="Dave Campbell" href="http://www.linkedin.com/pub/david-campbell/4/33/10a" target="_blank">Dave Campbell</a>, a Microsoft Technical Fellow, dropped this article on us early last week about “The coming in-memory database tipping point”. It was absolutely fascinating reading, especially for someone who “grew up” on b-trees, block splits, file reorgs, file contention measurement etc, on a system with only 512meg of memory per each of 8 CPU’s (and some of that had to be allocated as disk cache). I still can&#8217;t believe we are prepping a BI server with 96gig of RAM.</p>
<p>From what David has indicated, we have a lot more developments in this area to look forward to over the coming decade. This is one snippet, lifted directly from his article, of whats in store</p>
<blockquote><p><em><a title="Article" href="http://blogs.technet.com/b/dataplatforminsider/archive/2012/04/09/the-coming-in-memory-database-tipping-point.aspx" target="_blank">Microsoft is also investing in other in-memory database technologies which will ship as the technology and opportunities mature. As a taste of what’s to come, we’re working on an in-memory database solution in our lab and building our real-world scenarios to demonstrate the potential. One such scenario, based upon one of Microsoft’s online services businesses, contains a fact table of 100 billion rows. In this scenario we can perform three calculations per fact – 300 billion calculations in total, with a query response time of 1/3 of a second. There are no user defined aggregations in this implementation; we actually scan over the compressed column store in real time.</a></em></p></blockquote>
<p>As John would say, &#8220;Thats awesome dude&#8221;.</p>
<p>&nbsp;</p>
<p>&nbsp;</p>
<h2 id="withsqlserver2012hotoffthepressithinktheprospectsfortheyearaheadareexcellent."></h2>
]]></content:encoded>
			<wfw:commentRss>http://www.ericjlawson.co.uk/2012/04/16/sql-server-leads-the-vertipaq/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>
