About dict.fromkeys, key from filename, values inside file, using Regex












1















Well, I'm learning Python, so I'm working on a project that consists in passing some numbers of PDF files to xlsx and placing them in their corresponding columns, rows determined according to row heading.
The idea that came to me to carry it out is to convert the PDF files to txt and make a dictionary with the txt files, whose key is a part of the file name (because it contains a part of the row header) and the values be the numbers I need.
I have already managed to convert the txt files, now i'm dealing with the script to carry the dictionary. at the moment look like this:



import os
import re


p = re.compile(r'w+f+')
'''
I'm not entirely sure at the moment how the .compile of regular expressions works, but I know I'm missing something to indicate that what I want is immediately to the right, I'm also not sure if the keywords will be ignored, I just want take out the numbers
'''
m = p.match('Theese are the keywords' or 'That are immediately to the left' or 'The numbers I want')


def IsinDict(txtDir):
ToData = ()
if txtDir == "": txtDir = os.getcwd() + "\"
for txt in os.listdir(txtDir):
ToKey = txt[9:21]
if ToKey == (r"w+"):
Data = open(txt, "r")
for string in Data:
ToData += m.group()

Diccionary = dict.fromkeys(ToKey, ToData)
return Diccionary

txtDir = "Absolute/Path/OfTheText/Files"

IsinDict(txtDir)


Any contribution is welcome, thanks for your attention.










share|improve this question





























    1















    Well, I'm learning Python, so I'm working on a project that consists in passing some numbers of PDF files to xlsx and placing them in their corresponding columns, rows determined according to row heading.
    The idea that came to me to carry it out is to convert the PDF files to txt and make a dictionary with the txt files, whose key is a part of the file name (because it contains a part of the row header) and the values be the numbers I need.
    I have already managed to convert the txt files, now i'm dealing with the script to carry the dictionary. at the moment look like this:



    import os
    import re


    p = re.compile(r'w+f+')
    '''
    I'm not entirely sure at the moment how the .compile of regular expressions works, but I know I'm missing something to indicate that what I want is immediately to the right, I'm also not sure if the keywords will be ignored, I just want take out the numbers
    '''
    m = p.match('Theese are the keywords' or 'That are immediately to the left' or 'The numbers I want')


    def IsinDict(txtDir):
    ToData = ()
    if txtDir == "": txtDir = os.getcwd() + "\"
    for txt in os.listdir(txtDir):
    ToKey = txt[9:21]
    if ToKey == (r"w+"):
    Data = open(txt, "r")
    for string in Data:
    ToData += m.group()

    Diccionary = dict.fromkeys(ToKey, ToData)
    return Diccionary

    txtDir = "Absolute/Path/OfTheText/Files"

    IsinDict(txtDir)


    Any contribution is welcome, thanks for your attention.










    share|improve this question



























      1












      1








      1








      Well, I'm learning Python, so I'm working on a project that consists in passing some numbers of PDF files to xlsx and placing them in their corresponding columns, rows determined according to row heading.
      The idea that came to me to carry it out is to convert the PDF files to txt and make a dictionary with the txt files, whose key is a part of the file name (because it contains a part of the row header) and the values be the numbers I need.
      I have already managed to convert the txt files, now i'm dealing with the script to carry the dictionary. at the moment look like this:



      import os
      import re


      p = re.compile(r'w+f+')
      '''
      I'm not entirely sure at the moment how the .compile of regular expressions works, but I know I'm missing something to indicate that what I want is immediately to the right, I'm also not sure if the keywords will be ignored, I just want take out the numbers
      '''
      m = p.match('Theese are the keywords' or 'That are immediately to the left' or 'The numbers I want')


      def IsinDict(txtDir):
      ToData = ()
      if txtDir == "": txtDir = os.getcwd() + "\"
      for txt in os.listdir(txtDir):
      ToKey = txt[9:21]
      if ToKey == (r"w+"):
      Data = open(txt, "r")
      for string in Data:
      ToData += m.group()

      Diccionary = dict.fromkeys(ToKey, ToData)
      return Diccionary

      txtDir = "Absolute/Path/OfTheText/Files"

      IsinDict(txtDir)


      Any contribution is welcome, thanks for your attention.










      share|improve this question
















      Well, I'm learning Python, so I'm working on a project that consists in passing some numbers of PDF files to xlsx and placing them in their corresponding columns, rows determined according to row heading.
      The idea that came to me to carry it out is to convert the PDF files to txt and make a dictionary with the txt files, whose key is a part of the file name (because it contains a part of the row header) and the values be the numbers I need.
      I have already managed to convert the txt files, now i'm dealing with the script to carry the dictionary. at the moment look like this:



      import os
      import re


      p = re.compile(r'w+f+')
      '''
      I'm not entirely sure at the moment how the .compile of regular expressions works, but I know I'm missing something to indicate that what I want is immediately to the right, I'm also not sure if the keywords will be ignored, I just want take out the numbers
      '''
      m = p.match('Theese are the keywords' or 'That are immediately to the left' or 'The numbers I want')


      def IsinDict(txtDir):
      ToData = ()
      if txtDir == "": txtDir = os.getcwd() + "\"
      for txt in os.listdir(txtDir):
      ToKey = txt[9:21]
      if ToKey == (r"w+"):
      Data = open(txt, "r")
      for string in Data:
      ToData += m.group()

      Diccionary = dict.fromkeys(ToKey, ToData)
      return Diccionary

      txtDir = "Absolute/Path/OfTheText/Files"

      IsinDict(txtDir)


      Any contribution is welcome, thanks for your attention.







      python regex






      share|improve this question















      share|improve this question













      share|improve this question




      share|improve this question








      edited Nov 13 '18 at 8:39









      boandriy

      231214




      231214










      asked Nov 13 '18 at 8:13









      Oscar AsecasOscar Asecas

      62




      62
























          0






          active

          oldest

          votes











          Your Answer






          StackExchange.ifUsing("editor", function () {
          StackExchange.using("externalEditor", function () {
          StackExchange.using("snippets", function () {
          StackExchange.snippets.init();
          });
          });
          }, "code-snippets");

          StackExchange.ready(function() {
          var channelOptions = {
          tags: "".split(" "),
          id: "1"
          };
          initTagRenderer("".split(" "), "".split(" "), channelOptions);

          StackExchange.using("externalEditor", function() {
          // Have to fire editor after snippets, if snippets enabled
          if (StackExchange.settings.snippets.snippetsEnabled) {
          StackExchange.using("snippets", function() {
          createEditor();
          });
          }
          else {
          createEditor();
          }
          });

          function createEditor() {
          StackExchange.prepareEditor({
          heartbeatType: 'answer',
          autoActivateHeartbeat: false,
          convertImagesToLinks: true,
          noModals: true,
          showLowRepImageUploadWarning: true,
          reputationToPostImages: 10,
          bindNavPrevention: true,
          postfix: "",
          imageUploader: {
          brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
          contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
          allowUrls: true
          },
          onDemand: true,
          discardSelector: ".discard-answer"
          ,immediatelyShowMarkdownHelp:true
          });


          }
          });














          draft saved

          draft discarded


















          StackExchange.ready(
          function () {
          StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstackoverflow.com%2fquestions%2f53276533%2fabout-dict-fromkeys-key-from-filename-values-inside-file-using-regex%23new-answer', 'question_page');
          }
          );

          Post as a guest















          Required, but never shown

























          0






          active

          oldest

          votes








          0






          active

          oldest

          votes









          active

          oldest

          votes






          active

          oldest

          votes
















          draft saved

          draft discarded




















































          Thanks for contributing an answer to Stack Overflow!


          • Please be sure to answer the question. Provide details and share your research!

          But avoid



          • Asking for help, clarification, or responding to other answers.

          • Making statements based on opinion; back them up with references or personal experience.


          To learn more, see our tips on writing great answers.




          draft saved


          draft discarded














          StackExchange.ready(
          function () {
          StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstackoverflow.com%2fquestions%2f53276533%2fabout-dict-fromkeys-key-from-filename-values-inside-file-using-regex%23new-answer', 'question_page');
          }
          );

          Post as a guest















          Required, but never shown





















































          Required, but never shown














          Required, but never shown












          Required, but never shown







          Required, but never shown

































          Required, but never shown














          Required, but never shown












          Required, but never shown







          Required, but never shown







          Popular posts from this blog

          Florida Star v. B. J. F.

          Danny Elfman

          Retrieve a Users Dashboard in Tumblr with R and TumblR. Oauth Issues