I want to do extract data from this file…Like it has 59 bug reports and i want to extract all these bugs with all the information like Date Sentences…This is the sample of one BugReport ID=5…
I tried using Soup.but it is not working.How we can do this…
Preprocee the text file containing XML code
hey @Rashmi007 ,
can you please show 2 or 3 more such samples and point out what exactly you want to extract from this bug report.
Thank You .
This is the kind of text file.I want to extract Bug id…Each but contained Titile ,Date,Sentences…
I want to extract each bugID with all the other features and want to create a CSV.
can you please send a text file containing a single sample , the way you have provided me in the image above
hey @Rashmi007 ,
kindly provide access to the drive bug report file so that i can view it. and do send a message here when you get it done.
access provided.Please have a look…
hey @Rashmi007 ,
have a look at the json file here
i have added the python script to to get create it. Have a look at it too , if there is something you can understand you can surely ask me.
Thanks a lot…
yeah i have seen that file.will help me a lot .Actually i want this data in CSV format for further processing.
Or if I want to apply Tf-IDf ,can I continue with JSon format??
you directly create a csv file from it , like using pandas try implementing pd.DataFrame.from_json function. It might work or also might raise a format error.
Else, i have provided you with the code , you can change it in the way you like.
If there is some , confusion then i am here to help you.

