Convert multi byte characters to single byte in python












0















I have a requirement to convert the English characters and digits(0-9) represented in multi byte to single byte. Other than English characters have to remain unchanged. I am able to do it using Python and Shell script. The same needs be to achieved in only python (without using any shell script).




Input: 1MORE , 360FLY , BCジャパン , デイテル・ジャパン



Output: 1MORE , 360FLY ,BCジャパン , デイテル・ジャパン




The python script calls shell script for each character that it encounters.



Python script:



import os
import subprocess
import shlex
ipfile=open('Brands.csv','r')
opfile=open('japan_tv_weekly_converted.csv','w',encoding='utf-8')
for line in ipfile:
for character in line:
utf8Character=character
if utf8Character == '"':
os.system('sh iconv_command.sh \'+utf8Character+' \'+character)
else:
os.system('sh iconv_command.sh "'+utf8Character+'" "'+character+'"')
os.system('printf "n">>japan_tv_weekly_converted.csv')
opfile.close()
ipfile.close()


Shell script:



#!/bin/bash
x=`echo -n $1|iconv -f utf-8 -t ascii//translit`
if [ "$x" != "?" ]; then
echo -n $1|iconv -f utf-8 -t ascii//translit>>japan_tv_weekly_converted.csv
else
echo -n $2>>japan_tv_weekly_converted.csv
fi


Please help!










share|improve this question





























    0















    I have a requirement to convert the English characters and digits(0-9) represented in multi byte to single byte. Other than English characters have to remain unchanged. I am able to do it using Python and Shell script. The same needs be to achieved in only python (without using any shell script).




    Input: 1MORE , 360FLY , BCジャパン , デイテル・ジャパン



    Output: 1MORE , 360FLY ,BCジャパン , デイテル・ジャパン




    The python script calls shell script for each character that it encounters.



    Python script:



    import os
    import subprocess
    import shlex
    ipfile=open('Brands.csv','r')
    opfile=open('japan_tv_weekly_converted.csv','w',encoding='utf-8')
    for line in ipfile:
    for character in line:
    utf8Character=character
    if utf8Character == '"':
    os.system('sh iconv_command.sh \'+utf8Character+' \'+character)
    else:
    os.system('sh iconv_command.sh "'+utf8Character+'" "'+character+'"')
    os.system('printf "n">>japan_tv_weekly_converted.csv')
    opfile.close()
    ipfile.close()


    Shell script:



    #!/bin/bash
    x=`echo -n $1|iconv -f utf-8 -t ascii//translit`
    if [ "$x" != "?" ]; then
    echo -n $1|iconv -f utf-8 -t ascii//translit>>japan_tv_weekly_converted.csv
    else
    echo -n $2>>japan_tv_weekly_converted.csv
    fi


    Please help!










    share|improve this question



























      0












      0








      0








      I have a requirement to convert the English characters and digits(0-9) represented in multi byte to single byte. Other than English characters have to remain unchanged. I am able to do it using Python and Shell script. The same needs be to achieved in only python (without using any shell script).




      Input: 1MORE , 360FLY , BCジャパン , デイテル・ジャパン



      Output: 1MORE , 360FLY ,BCジャパン , デイテル・ジャパン




      The python script calls shell script for each character that it encounters.



      Python script:



      import os
      import subprocess
      import shlex
      ipfile=open('Brands.csv','r')
      opfile=open('japan_tv_weekly_converted.csv','w',encoding='utf-8')
      for line in ipfile:
      for character in line:
      utf8Character=character
      if utf8Character == '"':
      os.system('sh iconv_command.sh \'+utf8Character+' \'+character)
      else:
      os.system('sh iconv_command.sh "'+utf8Character+'" "'+character+'"')
      os.system('printf "n">>japan_tv_weekly_converted.csv')
      opfile.close()
      ipfile.close()


      Shell script:



      #!/bin/bash
      x=`echo -n $1|iconv -f utf-8 -t ascii//translit`
      if [ "$x" != "?" ]; then
      echo -n $1|iconv -f utf-8 -t ascii//translit>>japan_tv_weekly_converted.csv
      else
      echo -n $2>>japan_tv_weekly_converted.csv
      fi


      Please help!










      share|improve this question
















      I have a requirement to convert the English characters and digits(0-9) represented in multi byte to single byte. Other than English characters have to remain unchanged. I am able to do it using Python and Shell script. The same needs be to achieved in only python (without using any shell script).




      Input: 1MORE , 360FLY , BCジャパン , デイテル・ジャパン



      Output: 1MORE , 360FLY ,BCジャパン , デイテル・ジャパン




      The python script calls shell script for each character that it encounters.



      Python script:



      import os
      import subprocess
      import shlex
      ipfile=open('Brands.csv','r')
      opfile=open('japan_tv_weekly_converted.csv','w',encoding='utf-8')
      for line in ipfile:
      for character in line:
      utf8Character=character
      if utf8Character == '"':
      os.system('sh iconv_command.sh \'+utf8Character+' \'+character)
      else:
      os.system('sh iconv_command.sh "'+utf8Character+'" "'+character+'"')
      os.system('printf "n">>japan_tv_weekly_converted.csv')
      opfile.close()
      ipfile.close()


      Shell script:



      #!/bin/bash
      x=`echo -n $1|iconv -f utf-8 -t ascii//translit`
      if [ "$x" != "?" ]; then
      echo -n $1|iconv -f utf-8 -t ascii//translit>>japan_tv_weekly_converted.csv
      else
      echo -n $2>>japan_tv_weekly_converted.csv
      fi


      Please help!







      python iconv multibyte






      share|improve this question















      share|improve this question













      share|improve this question




      share|improve this question








      edited Nov 15 '18 at 9:07









      Sagar Zala

      2,37441337




      2,37441337










      asked Nov 15 '18 at 9:04









      VivekVivek

      11




      11
























          0






          active

          oldest

          votes











          Your Answer






          StackExchange.ifUsing("editor", function () {
          StackExchange.using("externalEditor", function () {
          StackExchange.using("snippets", function () {
          StackExchange.snippets.init();
          });
          });
          }, "code-snippets");

          StackExchange.ready(function() {
          var channelOptions = {
          tags: "".split(" "),
          id: "1"
          };
          initTagRenderer("".split(" "), "".split(" "), channelOptions);

          StackExchange.using("externalEditor", function() {
          // Have to fire editor after snippets, if snippets enabled
          if (StackExchange.settings.snippets.snippetsEnabled) {
          StackExchange.using("snippets", function() {
          createEditor();
          });
          }
          else {
          createEditor();
          }
          });

          function createEditor() {
          StackExchange.prepareEditor({
          heartbeatType: 'answer',
          autoActivateHeartbeat: false,
          convertImagesToLinks: true,
          noModals: true,
          showLowRepImageUploadWarning: true,
          reputationToPostImages: 10,
          bindNavPrevention: true,
          postfix: "",
          imageUploader: {
          brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
          contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
          allowUrls: true
          },
          onDemand: true,
          discardSelector: ".discard-answer"
          ,immediatelyShowMarkdownHelp:true
          });


          }
          });














          draft saved

          draft discarded


















          StackExchange.ready(
          function () {
          StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstackoverflow.com%2fquestions%2f53315763%2fconvert-multi-byte-characters-to-single-byte-in-python%23new-answer', 'question_page');
          }
          );

          Post as a guest















          Required, but never shown

























          0






          active

          oldest

          votes








          0






          active

          oldest

          votes









          active

          oldest

          votes






          active

          oldest

          votes
















          draft saved

          draft discarded




















































          Thanks for contributing an answer to Stack Overflow!


          • Please be sure to answer the question. Provide details and share your research!

          But avoid



          • Asking for help, clarification, or responding to other answers.

          • Making statements based on opinion; back them up with references or personal experience.


          To learn more, see our tips on writing great answers.




          draft saved


          draft discarded














          StackExchange.ready(
          function () {
          StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstackoverflow.com%2fquestions%2f53315763%2fconvert-multi-byte-characters-to-single-byte-in-python%23new-answer', 'question_page');
          }
          );

          Post as a guest















          Required, but never shown





















































          Required, but never shown














          Required, but never shown












          Required, but never shown







          Required, but never shown

































          Required, but never shown














          Required, but never shown












          Required, but never shown







          Required, but never shown







          Popular posts from this blog

          Florida Star v. B. J. F.

          Error while running script in elastic search , gateway timeout

          Adding quotations to stringified JSON object values