QuicksearchContentsthe tree![]() Blog Administration |
Saturday, April 19. 2008
DiffEngineX Used To Compare Over ... Posted by Martin
in DiffEngineX at
14:31
Comments (0) Trackbacks (0) DiffEngineX Used To Compare Over 750, 000 Rows
Last week a customer used DiffEngineX to compare two Excel 2007 spreadsheets each with at least 750, 000 rows. The comparison took approximately 4 hours 30 minutes to complete. The customer confirmed they were pleased with the results. Align Rows was turned on. We generally recommend the data should be pre-sorted in Excel using functionality available from its Data tab or menu before a comparison, if necessary.
We have compared spreadsheets with a million rows ourselves and seen results in a couple of minutes. So why do some comparisons take hours and some only minutes? The answer is that DiffEngineX is much faster when the number of differences buried in a million rows is small, rather than large. With a large number of differences, more blank rows have to be inserted to get the data to line up and this is a time-consuming, Excel mediated operation. This is the first report of DiffEngineX being used with such a large amount of data. The key points are that the comparison may take hours and that you should ensure the Align Rows feature is selected. Unless your data is guaranteed to be in sorted order, you should get Excel to sort it before a comparison. (This does not apply when comparing formulae based models.) In the days of Excel 2003, one customer told us he used DiffEngineX to compare two worksheets containing 50, 000 rows a piece. With such large amounts of data, it is probably difficult to spot the differences in the colour highlighted worksheets. That is why we recommend using the Extra dialog to turn on the Hide Matching Rows feature. Friday, April 18. 2008The Case For Upgrading To The Latest Version
When we updated DiffEngineX to work with Windows Vista and Excel 2007, we internally re-factored the software to be faster and to take up less memory. We needed to do this because of the greatly increased upper row limit in Excel 2007.
However as a consequence it made DiffEngineX faster at comparing spreadsheets when run on Windows XP with previous versions of Excel as well. If your spreadsheets contain tens of thousands of rows you will notice the difference. Since the first release of DiffEngineX we have added the following features:
We have also added a whole host of other small changes. Although in general we offer free updates, if you are using Version 1.28 or lower and wish to upgrade to the latest version we make a small charge. Thursday, December 20. 2007VeriSign Code Signing Digital ID
We purchased a 3 year code signing digital id from VeriSign this week. Our previous one was about to expire.
Digital ids allow software publishers to digitally sign the applications they produce. Code signing provides a guarantee that the software has not been tampered with or altered in transit and that the software is produced by who says they produce it. Windows Vista warns users that unsigned software comes from an unknown publisher when installing it. Wednesday, November 7. 2007Powerful DiffEngineX Options
DiffEngineX's Options dialog contains several useful features.
The two most powerful are Color in red precisely the parts of formulae and text constants that differ and Compact like changes with contiguous. The effects of turning the first option on are shown below. ![]() The differences between two spreadsheet cells are highlighted using the colour red on the difference report. Additionally cells will be made larger so that lengthy amounts of text can be easily read. Compact like changes with contiguous is intended for workbooks that contain blocks of equivalent formulae. Equivalent formulae are different in the sense that they reference different cells, but the different cells are the same relative offsets away. Financial models often contain blocks of tens to hundreds of equivalent formulae which are modified as a whole unit. When this option is turned on, only one change will be reported on the difference report, instead of hundreds of individual changes. Sunday, September 30. 2007
DiffEngineX Also Available As Excel ... Posted by Martin
in DiffEngineX at
14:02
Comments (0) Trackbacks (0) DiffEngineX Also Available As Excel Add-In
The main version of DiffEngineX is a stand-alone executable that runs outside of Microsoft Excel. This is what the vast majority of users download.
DiffEngineX is also available as an Excel add-in. It has slightly different functionality. Over the past year, one customer needed DiffEngineX to compare two Excel workbooks, which use a third-party add-in to dynamically fetch database information. When the executable form of DiffEngineX compared the workbooks, it did so without loading the third-party add-in and as such did not give the desired results. However the add-in version of DiffEngineX did load the third-party add-in before comparing the worksheets. It gave the desired results. The add-in version is available on request. Friday, February 2. 2007
Making DiffEngineX Work With Excel 2007 Posted by Martin
in DiffEngineX at
14:06
Comments (0) Trackbacks (0) Making DiffEngineX Work With Excel 2007
One of the major differences between Excel 2007 and 2003, it that the maximum number of rows has increased from 65, 536 to 1, 048, 576. The maximum number of columns has increased from 256 to 16, 384.
We have to assume that at some point a user will try to compare two spreadsheets containing a million rows. I compared two sample spreadsheets each containing a million rows with a slightly modified copy of DiffEngineX (Version 1.27). A computer with 3.5 Gb of physical memory was used. The old version of DiffEngineX worked away until it ran out of memory! To solve this problem, I made the internal data structures used by DiffEngineX more memory efficient. Aligning a million rows is on a very different scale to aligning 50, 000 rows and so at the same time I made the algorithms faster as well. One small part of DiffEngineX was taking quadratic time and its inefficiency was made glaringly obvious when trying to deal with a million rows. Fortunately it was very easy to convert it to take linear time. The new version of DiffEngineX can compare a million rows in a time measured in minutes, not hours. However the actual insertion of a blank row in order to line up existing rows requires DiffEngineX to make a call to Excel. This is intrinsically slow and so there may be pathological cases that will take longer. As a consequence of making DiffEngineX work with Excel 2007, it now works faster with Excel 2003 as well. I should point out that for most users a comparison of even a million rows should take minutes. If it is taking hours then it is a fair bet that the user is missing out a step. If you are comparing rows of data, as opposed to formulae based models, then you must get Excel to sort your data first in both workbooks. Sort functionality can be found under Excel's Data menu or tab. Then re-save your Excel workbooks. When you start up DiffEngineX make sure the Align Rows box is checked. I should point out that a prior sorting step is not always needed. DiffEngineX will insert blank rows to get existing rows to line up (and therefore not be flagged as different), but it will not reorder rows. That is why they must be sorted first. Data imported into Excel from databases and third party applications will likely need sorting first. If the data originates in the form of Excel workbooks, it probably does not require sorting. Please refer to the below tutorial for full details. Tutorial: How to Compare Two Excel Lists Friday, February 2. 2007
Making DiffEngineX Work With Windows ... Posted by Martin
in DiffEngineX at
10:32
Comments (0) Trackbacks (0) Making DiffEngineX Work With Windows Vista
Windows XP allows the creation of both standard and administrative users. When logged in as a standard user, to some extent your computer is protected machine wide modifications, some of which may be harmful.
However most Windows XP users always log in as an administrator and so the advantages of running as a standard user are lost. Windows Vista runs applications by default as a standard process, even when the user is logged in as an administrator. If an application or control panel requires administrative privileges the user has to explicitly allow it. Thus the computer is more protected against harmful machine wide changes. Software has to be modified so that when it runs on Windows Vista it does so as a good citizen and does so explicitly rather than taking advantage of Vista's backwards compatibility. To this end, a manifest has to be added. This manifest tells Vista that the application is aware of the change in policy and also what security level it should be run at. DiffEngineX could already run as a standard process and so the changes it needed to run on Vista were minimal. DiffEngineX was submitted to VeriTest for testing and was awarded the Microsoft Certified for Windows Vista logo. Application List: Certified for Windows Vista ![]() In order to gain the logo, DiffEngineX had to pass many tests. One of which is that its uninstaller has to contain a dialog box that informs the user that DiffEngineX cannot be uninstalled if it is currently being used! Unfortunately Visual Studio 2005 Professional does not add this dialog box by default. The information on how to add this dialog box was generously provided on one Microsoft employee blog.
« previous page
(Page 1 of 1, totaling 7 entries)
next page »
|
Calendar
ArchivesCategoriesSyndicate This Blog |
|||||||||||||||||||||||||||||||||||||||||||||||||

