• refalo@programming.dev
    link
    fedilink
    arrow-up
    48
    ·
    edit-2
    3 days ago

    If like me you were wondering if MS actually provided their own parsers for their Office file formats… they did not.

    It seems to just be a bunch of random pyxyz 3rd-party support libraries all mashed together.

    • mormund@feddit.org
      link
      fedilink
      arrow-up
      11
      ·
      2 days ago

      What do you mean by parser? Office docs are just zipped XML files. They are trivial to parse. The hard part is all the quirks the document renderers have, which makes it impossible to perfectly match the output. But markdown can’t handle any complex formatting anyway

    • Sibbo@sopuli.xyz
      link
      fedilink
      arrow-up
      6
      ·
      3 days ago

      Maybe the people that wrote their parser have left the company? Typical big software corp problem.

      • GissaMittJobb@lemmy.ml
        link
        fedilink
        arrow-up
        4
        ·
        3 days ago

        I mean, the parser would still be there even if the people left the company, right? The source code remains.

        • Creat
          link
          fedilink
          arrow-up
          3
          ·
          2 days ago

          It might also be somewhere, but nobody knows where.