Typology of Text Manipulation (for Center for Literary Computing, WVU):   
    Alan Sondheim 
    sondheim at panix.com
       
    Mon Jun 14 05:26:17 CEST 2004
    
    
  
Typology of Text Manipulation (for Center for Literary Computing, WVU):
Thanks Sandy Baldwin -
We need to implement with a simple interface if possible:
Please note many of the following overlap -
Input: Text of any sort
          keep spacings / eliminate spacings
          keep tabs / eliminate tabs
Input placed within: single file
                     array
                     hash
Output: single file of manipulated text
         with original file intact
        "core dump" of file with protocols / processes attached
Manipulations: substitution (similar to awk)
               word for word
               letter for letter
               line for line
               etc.
               In other words: Y for X
               reverse line (similar to rev)
               reverse file
               reverse words (words in reverse "reverse in words")
               reversed words ("sdrow ni esrever")
               reversed lines (etc.) (similar to tac)
Eliminations:  first instance word / letter / line < rest eliminated
Functions (Doublings etc.): Given X, then f(X)
Fields: Reordering fields (similar to awk)
Filtering lines / words / letters: (similar to grep)
Randomizing: Generating (lines / words / letters) > texts
              Various Grammars
              Various Lists (nouns / verbs / adverbs / etc.)
Randomizing: Filtering (ability to set parameters - for example:
              randomizing field order
              randomizing elimination order)
              etc.
Topological: Folding Texts
             Splitting Texts
             Joining Texts (end to end / side by side)
             Internal block removal (see emacs)
             "Crumpling" Texts
Numeric and code: Translating texts into hex, binary, octal, digital
                    f(X) on hex, binary, octal, digital
                  Translating full text file / inodes etc.
                  Codes - see /usr/games Unix/Linux directory:
                    Morse / figlets
                    Caesar (rot 13 as special example)
                    pig / bcd / ppt / banner
Later: Embedded Babel etc. translation
       Pictographic translation (Dongba, hieroglyphic, etc.)
_
    
    
More information about the Syndicate
mailing list