Preprocee the text file containing XML code

I want to do extract data from this file…Like it has 59 bug reports and i want to extract all these bugs with all the information like Date Sentences…This is the sample of one BugReport ID=5…
I tried using Soup.but it is not working.How we can do this…

"(402115) Mozilla - opening and closing menulist in edit bookmark popup dismisses popup but shouldn t" '2007-11-01 14:29:46' 'mcdavis941' user-agent: mozilla/5.0 (windows windows nt 5.1 en-us rv:1.9a9pre) gecko/2007103103 minefield/3.0a9pre build identifier: mozilla/5.0 (windows windows nt 5.1 en-us rv:1.9a9pre) gecko/2007103103 minefield/3.0a9pre if you double-click the places star button, the edit bookmarks popup comes up if you use the menulist to pick a directory, the popup goes away, but it should wait until the user clicks done or delete reproducible: always '2007-11-01 15:00:55' '' confirming the first time i click on the menu list drop down button (within the bookmark popup) it opens and i see all the choices

hey @Rashmi007 ,
can you please show 2 or 3 more such samples and point out what exactly you want to extract from this bug report.

Thank You .


This is the kind of text file.I want to extract Bug id…Each but contained Titile ,Date,Sentences…
I want to extract each bugID with all the other features and want to create a CSV.

can you please send a text file containing a single sample , the way you have provided me in the image above

BugReport

hey @Rashmi007 ,
kindly provide access to the drive bug report file so that i can view it. and do send a message here when you get it done.

access provided.Please have a look…

hey @Rashmi007 ,
have a look at the json file here
i have added the python script to to get create it. Have a look at it too , if there is something you can understand you can surely ask me.

Thanks a lot…
yeah i have seen that file.will help me a lot .Actually i want this data in CSV format for further processing.
Or if I want to apply Tf-IDf ,can I continue with JSon format??

you directly create a csv file from it , like using pandas try implementing pd.DataFrame.from_json function. It might work or also might raise a format error.
Else, i have provided you with the code , you can change it in the way you like.

If there is some , confusion then i am here to help you.