lukestein’s avatarlukestein’s Twitter Archive—№ 5,752

  1. TIL about @Stata’s strL (long string) variable type. WOW! Allows great storage savings when a var contains different-length strings (or many missing), especially if you also let -compress- “coalesce” repeated values. recast strL yourvar compress 1/2 @JesseWursten/1222195621617995788
    oh my god twitter doesn’t include alt text from images in their API
    1. …in reply to @lukestein
      E.g., first names from the 1900 census: “recast strL namefrst” shrinks data 26% “recast strL namefrst” + “compress” shrinks data 31% Note "compress" doesn't find these savings unless you've manually recast to strL first, but *will* convert back to str if that saves space. 2/2
      oh my god twitter doesn’t include alt text from images in their APIoh my god twitter doesn’t include alt text from images in their APIoh my god twitter doesn’t include alt text from images in their API
      1. …in reply to @lukestein
        btw I wound up finding this so useful that I made a little @Stata command that does this on one line: -strcompress- Installs in one line; see @lukestein/1222309851168010241