Dec 21

Non-English Excel Exports

In case you are also a non-English user, you may be interested in the following information:

This is a great bit of software, thanks: I’m using 1.3.3 Installer. The release notes to 1.3.0 suggest that some characters display strangely in Excel. I found this in spades when importing French and German text to Excel97 but with an up-to-date version of OpenOffice Calc everything was fine provided I changed the character set on Import to UTF-8.

Jun 20

DaleyKlippings v1.3.2

    This version includes several core improvements to better support other operating systems (especially OSX) and several minor fixes to new features in 1.3.1:

    • CSV Export uses the Date Interpreter setting to format output
    • All of the default patterns have been reviewed and any issues were corrected. This should not affect working patterns.
    • Settings dialog will default to the ‘Application’ page
    • In Settings dialog, the default Import and Export Patterns will be automatically displayed
    • The program will now place ‘settings.txt’ in a User’s Application Data folder for non-portable installations:
      • If you’re migrating from a previous version, your current settings should be copied automatically.
      • The precise location of this folder varies by OS.
      • This will improve support for systems (like OSX) that do not allow an application to write to the folder that contains the program executable.
    • On the back-end, PyQt4 has been replaced by PySide.
      • PySide improves compatibility with virtualenv thanks to pip compatibility.
      • As an added bonus, it provides a more flexible licensing arrangement.
      • If you experience anything odd with the new UI, please report it!  I don’t expect any major issues, but I have a brief window where I can reverse this change if it causes widespread problems.
    • DaleyKlippings icon has been implemented in more places (installer, uninstaller, main window, taskbar, etc.)
    May 21

    DaleyKlippings v1.3.1

      This version includes several minor fixes to new features in 1.3.0:

      • Dates will now display (not just edit) in the local format
      • When the date language is changed, the editor (and corrected view) will now update
      May 05

      DaleyKlippings v1.3.0

        This version includes several significant improvements:

        • Import Localization – this version significantly improves support for non-English versions of the Kindle.
          • The date-interpreter is now able to accept localized dates (i.e. where months and days-of-week are in a local language).  Under Settings, use the “Date Interpreter” dropdown to select the language you wish to use.  NOTE:  In some cases, date patterns need to be updated to ignore utility words like the Spanish ‘de’ that may be included in the date pattern.
          • Some localized Kindles do not use the hyphen to indicate a page range. Instead, they use a word like “152 a 167”.  The “Range Divider” option in the settings permits users to specify a custom connector for location and page ranges.  NOTE:  The default pattern for Page and Location only matches hyphens and numbers.  Additional characters must be added to this pattern if you are using this feature.
          • Sample patterns for Spanish and German are included.  NOTE:  These patters will only function correctly if all of the language settings are updated.
          • The Date Language setting is also used to determine the way date and time are displayed in the main table.
        • CSV Import/Export – This version offers CSV import and export using a fully featured CSV module instead of the Import/Export Pattern system.
          • CSV Export was always possible, but this should improve the stability and reliability of the process.
          • CSV Import was difficult or impossible using an Import Pattern.  This new approach should address most of the potential issues.
          • This system has been tested with Excel 2010 and is likely to work with Excel 2007 or newer.  Some older versions of Excel do not correctly handle the file type used. Users report that the newest version of OpenOffice Calc does not exhibit these issues.
          • Even in newer versions, some characters displayed strangely in Excel.  In my testing, the characters were not corrupted and looked fine once the file was imported back into DaleyKlippings (even if other pats of the file were changed)
          • While this feature should be a significant improvement on the existing system, it’s impossible to anticipate or test every circumstance.  If you experience any issues with the system, please contact us.
        • Upgrade Preserves Custom Settings – this version uses separate files for default and personalized settings.  This allows you to install a new version of DaleyKlippings over an old version without losing your customizations.  In previous versions, you had to manually backup and restore your settings file.
          • A side-effect of this change is that Patterns are no longer permanently deleted.  If you have not created a new patter with the same name, it can be recovered by manually modifying the “settings.txt” file.
          • NOTE:  If you create another pattern with the same name, the old pattern will get overwritten.
        • Backup Settings – To minimize the odds of data loss, this version writes both a “settings.txt” and “settings.bak”.  Even if one of the files gets corrupted during an abnormal exit, the other should not.
        • New Version Numbering – From this point forward, DaleyKlippings will use three version numbers: <major>.<minor>.<release>.
          • Major versions would involve significant (and usually not-backwards-compatible) changes to the architecture.  This is unlikely to occur while this remains a side project of mine.
          • Minor versions indicate additional features and are recommended.  The mailing list will get notifications about these versions.
          • Release versions are reserved primarily for bug fixes.  If you aren’t experiencing any of the bugs included in the release, upgrading isn’t critical. The mailing list will only get a notification for these releases if the bug is impacting a wide number of users (whether or not the users realize).
          • To ensure that all systems recognize this version as an upgrade, it uses the version number 1.3.0
        • Bug Fixes – This version includes a few minor bug fixes
          • The Author column should no longer use the Type dropdown
          • The Type column should correctly use the Type dropdown
          • Both the Location and Page fields should enforce the location patterns
          • Fixed default Delimiter to use “==========” instead of “/r/n” to better reflect Kindle defaults.
          • If you have localized the terms for “Highlight” and/or “Note”, matching between highlights and notes should be fully functional.
        Jan 05

        DaleyKlippings v1.02

          This version includes several improvements and bug fixes:

          • Fixed a bug preventing the system from importing a Note that includes multiple lines.
          • Fixed an error in CommaSafe
          • Upgraded import patterns to better handle titles with parentheses (followed by authors with parentheses).  Titles with parenthesis but no author will continue to be processed incorrectly as DaleyKlippings will assume the parentheses surround the author.
          • Improved Author Match
          • Updated several instances where the program still showed the old (Klippings) name.
          Jan 15

          DaleyKlippings v1.0

            With over 100 downloads of v0.7 and no new issues reported, I am removing the “beta” tag off the program.  I’d also like to extend a special thanks to everyone who has donated!

            • There are no functional changes in v1.0
            • Kindle Paperwhite Import Patterns are included

            If you want the Paperwhite patterns and don’t want to bother with the new software (since it currently overwrites preferences), the pattern can be viewed at the Import Pattern for Kindle Paperwhite post.

            Jan 03

            Import Pattern for Kindle Paperwhite 5.3.1

            This pattern addresses another date-time issue.  A file from the Paperwhite was provided that include date-times in the following format:

            Added on Monday, 23 April 12 22:51:41

            While this is roughly the same date-time pattern found in the previous GMT example, it was necessary to make more significant changes to the expression around the Date tag.  In previous patterns we could depend on AM/PM or GMT to clearly indicate the last characters in the date-time.  This pattern has no obvious terminator.  Instead, we had the time tag include everything up to the end-line characters “\r\n”.  The regex pattern [^…] tells the system to include all characters up and until  the characters listed in the area indicated by the ellipsis.

            Notes delimiter:

            ==========

            Notes pattern:

            # Import notes and highlights from "My Clippings.txt" and ignore
            # bookmarks. Warnings with information on ignored bookmarks
            # will be added to the log - this is the app normal behaviour
            
            # Note that VERBOSE and UNICODE options are always on
            
            ^\s* #
            (?P<Book>.*?) # Book name
            (\s*\((?P<Author>[^\(]*)\))? # Author name (optional)
            \s*-\ Your\ #
            (?P<Type>(Highlight|Note|Bookmark)) # Clipping type - 'Highlight' or 'Note'
            (\ on\ Page\ #
            (?P<Page>[\d-]*)\ \|)? # Page (optional)
            (.*(Location|Loc\.)\ #
            (?P<Location>[\d-]*))? # Location (optional)
            .*?Added\ on\ #
            (?P<Date>([^\r\n]*)) # Date & time
            \s* #
            (?P<Text>.*?) # Text
            \s*$ #

            Date Format:

            %A, %d %B %y %H:%M:%S

            Encoding:

            UTF-8 (all languages)
            Dec 21

            Import Pattern for Kindle 3

            The original Kindle 3 pattern had a bug so this improved pattern was included in v0.7.  Note that this version excludes bookmarks.

            Notes delimiter:

            ==========

            Notes pattern:

            # Import notes and highlights from "My Clippings.txt" and ignore
            # bookmarks. Warnings with information on ignored bookmarks
            # will be added to the log - this is the app normal behaviour
            # Note that VERBOSE and UNICODE options are always on
            ^\s*                         #
            (?P<Book>.*?)                # Book name
            (\s*\((?P<Author>[^\(]*)\))? # Author name (optional)
            \s*-\                        #
            (?P<Type>(Highlight|Note))   # Clipping type - 'Highlight' or 'Note'
            (\ on\ Page\                 #
            (?P<Page>[\d-]*)\ \|)?       # Page (optional)
            (.*(Location|Loc\.)\         #
            (?P<Location>[\d-]*))?       # Location (optional)
            .*?Added\ on\                #
            (?P<Date>(.*)(AM|PM))        # Date & time
            \s*                          #
            (?P<Text>.*?)                # Text
            \s*$                         #

            Date Format – This field is left empty because the default matching pattern works with everything we tested.

            
            

            Encoding – While we pick utf-8, most files aren’t encoded this way.  However, (as of v0.7) the system will automatically check utf-16 and windows-1252 if the configured encoding fails.  By selecting utf-8, we catch anything that happens to be encoded in utf-8 before falling back on utf-16 and windows-1252..

            UTF-8 (all languages)
            Dec 21

            DaleyKlippings v0.7

              This version includes several improvements and bug fixes:

              • Kindle 3 pattern upgraded to use all new fields
                • Bug in old Kindle 3 pattern eliminated as a result
              • Improved feedback for Note-Highlight matches
              • Improved logic for file imports which should reduce/eliminate BOM errors
              • Improved errors reporting when the default file encoding (for imports) is not used